By clock per clock I mean say take an operon @ 2.0 GHz and compare its power draw to a presscot and northwood also @ 2.0GHz. Then maybe throw in C&C as well as the Intel version (i forget the name) to see how much they can save.
This doesn't make much sense because prescott needs to run at a higher clock in order to get the same performance. This is due to the larger pipeline. That is why AMD went away from using frequency to label processors. A 1.2GHz AMD had performance similar to a 1.5GHz Intel Northwood. This is why AMD advertised it as a 1500+, even though it was a 1.2GHz CPU. Therefore you need to compare performance/watt based off of benchmarks and power at the wall. This is the best way to compare performance, especially with conroe.
No, he is simply looking for an answer to the question -- if you pit the processors clock tick for clock tick, which consumes more energy?
I am assuming he wishes to compare things like CPU leakage, power draw, etc. under the same conditions physically and not by trying to match up performance.
You are quite correct, the long pipeline requires extra power and is less efficient IPC (instructions per second wise), thus the need to ramp high clocks to achieve similar computational performance.
We can estimate it though without going into great detail. The formulation for dynamic power in a processor can be calculated by:
Power = C * V* V * F (C is cumulative capacitance at the gate level, V is drive voltage, and F is frequency).
Thus if you were to scale a 3.6 GHz P4 with a dissipation of 105 watts from 3.6 to say FX-62 speed which is 2.8, then the power should scale roughly about 2.8/3.6 or 0.77 which would put power at 81.7 Watts. Lower than an FX-62 of 125 watts, and even lower than a 4800+ at 89 watts.
This rather simple and straight forward analysis, though, is not entirely correct --- it is back of the envelop pure and simple. Don't take it as absolute, the analysis will require much further digging and explanation.
Jack
While i understand what he is going after, i am just saying that this analysis has little purpose since the cores are so different. Comparing frequency/power accomplishes little because every part of the pipeline is smaller on a P4 CPU. With a transistor of a constant speed will consume X amount of power. If a logic device is Y transistors deep, then it will consume XY Watts/clock. If a different logic device is twice as deep, it will consume 2XY Watts/clock, but will run at 1/2 of the speed but do approximately the same amount of work. (based off of simple logic, not actual power equations).