Tom's Hardware > Forum > CPU & Components > CPUs > Why the scaling of my core i7 920 is so poor, even worse than Q6600?

Why the scaling of my core i7 920 is so poor, even worse than Q6600?

Forum CPU & Components : CPUs - Why the scaling of my core i7 920 is so poor, even worse than Q6600?

Tom's Hardware: Over 1.4 million members in 6 different countries available to answer all your high-tech questions. Sign up now! Its free!
Word :    Username :           
 

Update:

Some of your guys are pathetic. I don't give a damn who make the CPU. In fact, I built three Intel-CPU-computers in last two years. I posted my test results in some forums to share my info and hope to find why the scaling my core i7 computer is so poor. I really had a high expectation for core i7.

OK. I redid my tests on windows XP, with/without HT. I used similar compiler flags as SPEC 2006.
I also overclock Core i7 from 2.66G to 3.5G. Below are the setup and comparison.

Core i7 computer
CPU: core i7 920 at 2.66G
MB: ASUS P6T
memory: 6G DDR3 1600
OS: Fedora 10 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -xSSE4.2 -ip -openmp -opt-prefetch

OS: Window XP 64bit
Compiler: Intel Visual Fortran Compiler Professional 11.0.066
Compiler flags: /Qautodouble /QxSSE4.2 /Qip /Qopenmp /Qopt-prefetch /F1000000000

Core 2 Quad computer
CPU: Core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp

Dual Xeon computer
CPU: dual Xeon E5410 at 2.33G
MB: Tyan S5396A2NRF
memory: 8G DDR2 667 ECC Fully Buffered
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp

The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory.

http://www.amdzone.com/phpbb3/download/file.php?id=86

*For Core i7, Turbo mode is OFF.

For four threads, Core i7 920@3.55G is slower than Q6600@2.4G!
The scaling of Xeon E5410 is the best.



---------------------------------------------------------------------------------------------------------------------------------------
I read a lot reviews and forums about the new Intel CPU core i7. The new Intel CPU looks great. For example, according to spec cfp2006,
http://www.amdzone.com/phpbb3/view [...] 2&t=135802, Core i7 is about 100% faster than core 2 for float point performance. Although someone suggested that Intel might optimize the CPU for the benchmark codes, I still decided to upgrade my computer from core 2(Q6600) to core i7(920). Because I thought the better bandwidth of core i7 would help the multithread computation. Well, I was wrong. My testing results made me very disappoint to the new core i7 CPU.

My new computer
CPU: core i7 920 at 2.66G
MB: asus p6t
memory: 6G DDR3 1600
OS: fedora 10 64bit

My old computer
CPU: core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: fedora 9 64bit

compiler: Inter ifort 11

The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory. The testing results:

1 thread , 202.7s (core i7 920) , 213.1s (q6600), 5.13%(core i7 advantage)
2 threads, 109.0s (core i7 920) , 109.3s (q6600), 0.27%(core i7 advantage)
4 threads, 96.1s (core i7 920) , 68.3s (q6600), -28.93%(core i7 advantage)

(core i7 HT is off, it is slower if HT is on. the clock of core i7 920 is about 10% higher than core 2 Q6600)

The scaling of core i7 for 4 threads is really bad. Considering the bandwidth of core i7 920 is about twice of core 2 Q6600, this result is really strange. I don't know why, maybe the OS didn't use all the potentials of the new CPU?

Message quoted 1 times
Message edited by jsmith001 on 12-05-2008 at 08:51:19 AM
Sponsored Links
Register or log in to remove.

Is turbo mode causing the chip to not use all 4 cores :)?

Reply to DFGum

Might be Fedora itself. The drivers may not be mature enough for it to fully utilize the CPUs power.

------------------------------ http://www.steamcalculator.com/76561197970703804/camo_sig.png
Reply to jimmysmitty

Wouldn't that be funny if i7's only advantage over a Quad turned out to be its disadvantage? Still, its probably Fedora and not i7 itself. Its worth looking into though.

Reply to gamerk316
- 0 +

Or it could be the processor overheating; did you check temperatures for the 4-core run?

Turbo mode would not explain a 25%+ discrepancy.

I am curious what 2-core and 4-core run times would be if you took out the last stick of 2G RAM, to force the i7 into dual-channel. It could be that the CFD code you have does not need the bandwidth.

The AMDZone benchmarks you linked to are of bandwidth, not core computation. They are the same benchmarks AMD used to advertise when K10 had a weak core but strong bandwidth. That worked well for servers, but flopped for DT/workstation.

Reply to Wr

Its either the App itself or like another poster said turbo mode is running just on core.

Reply to roadrunner197069

Its either the App itself or like another poster said turbo mode is running just on core.

Reply to roadrunner197069

For 1 you are using 2 different OS compiles . Try running Fedora 10 on the CoreI7 machine.

------------------------------ I aint signing nothing!!!
Reply to Rick_Criswell

he aske the same question in anand forum and got the same answer, because of the 2 different versions of fedora. by the way he is an amdzoner. he's been forum hoping.


Message edited by kg4icg on 12-04-2008 at 03:08:36 PM
Reply to kg4icg

Oh probably some AMD fans trying to bash on i7,but bad attempt :)

Reply to tim924

jimmysmitty wrote :

Might be Fedora itself. The drivers may not be mature enough for it to fully utilize the CPUs power.




thats exactly what it is. i ran ubuntu and it was noticeably slower than windows with my E8500.


but ill give fedora credit and just say its still a silly quad core in a single threaded world :kaola:

Reply to werxen

Same thread on AMD zone as well. Viral fanboys FFS.

Reply to gallag

Well, from what we can see in benchmarks, i7 isn't as good as Core 2 quad overall. It's good in some apps only. It's a stepping stone and the second version of the architecture will probably get things right.

Like B3 Phenom fixed B2 errata issues and added a bit more clock speed; and a die shrink with Deneb will improve things even further. It's not just AMD fanboys, it's the nature of a new architecture. Core 2 wasn't as new as people thought, it built upon Core architecture found in notebooks. It was highly successful because Netburst was so lame in it's last days.

Anyone who expected i7 to be to Core 2 as Core 2 was to Netburst was in for a disappointment. There's nothing wrong with i7 that the next modification won't fix. As is, I don't see it as a good upgrade for anyone with a Q6600 or better Intel quad.

------------------------------ Phenom 8750, ASUS M3A78T
4 gigs Kingston DDR2 800 two 1T SAMSUNG HD103UI
Sapphire 4870x2, Sony BDU-X10S BD-ROM
Antec Neo 650 PSU Antec Nine Hundred, Acer H213H 1080p LCD
Reply to yipsl

gallag wrote :

Same thread on AMD zone as well. Viral fanboys FFS.



Yeah, I noticed it too. Whatta bozo...

Reply to fazers_on_stun

yipsl wrote :

Well, from what we can see in benchmarks, i7 isn't as good as Core 2 quad overall. It's good in some apps only. It's a stepping stone and the second version of the architecture will probably get things right.




What benchies are you talking about? All the ones I've seen show an improvement over Penryn in most areas, esp. encoding and gaming where the system is not GPU bound.

Reply to fazers_on_stun

I dont know whats worse. The OP or yipsl's last post.

Reply to someguy7

considering guys that went from the Qx series are seeing gains, that makes this guy look even less genuine. either he is a troll or he really screwed up something when he built his system.

Reply to roofus

Plus the fact that he turned simultaneous multithreading off, despite the fact that his app is multithreaded. Something is seriously wrong if that degrades performance, because that's one of i7's biggest advantages over core 2.


Message edited by cjl on 12-05-2008 at 08:49:06 PM
Reply to cjl

werxen wrote :

thats exactly what it is. i ran ubuntu and it was noticeably slower than windows with my E8500.


but ill give fedora credit and just say its still a silly quad core in a single threaded world :kaola:



Thats the problem with Open Software really. I mean its great because its free and normally much more stable. But the drivers needed to run everything at its best normally take much longer to get because they don't have the giant buildings worth of people working on them or the update support like Windows gives.

Reply to jimmysmitty

I noticed the OP edited the original post to call us "pathetic" - guess he knows we can't downgrade the original post since if it falls too negative it'll get hidden :).

Reply to fazers_on_stun

jsmith, not sure what to tell you, but there is two posts with excellent analysis in this thread by Idontcare in the Anantech forums here:


http://forums.anandtech.com/messag [...] erthread=y

Who? knows, maybe there's more to this cache coherancy and TLB issue than meets the eye.

Reply to piesquared

jsmith001 wrote :

Update:

Some of your guys are pathetic. I don't give a damn who make the CPU. In fact, I built three Intel-CPU-computers in last two years. I posted my test results in some forums to share my info and hope to find why the scaling my core i7 computer is so poor. I really had a high expectation for core i7.

OK. I redid my tests on windows XP, with/without HT. I used similar compiler flags as SPEC 2006.
I also overclock Core i7 from 2.66G to 3.5G. Below are the setup and comparison.

Core i7 computer
CPU: core i7 920 at 2.66G
MB: ASUS P6T
memory: 6G DDR3 1600
OS: Fedora 10 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -xSSE4.2 -ip -openmp -opt-prefetch

OS: Window XP 64bit
Compiler: Intel Visual Fortran Compiler Professional 11.0.066
Compiler flags: /Qautodouble /QxSSE4.2 /Qip /Qopenmp /Qopt-prefetch /F1000000000

Core 2 Quad computer
CPU: Core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp

Dual Xeon computer
CPU: dual Xeon E5410 at 2.33G
MB: Tyan S5396A2NRF
memory: 8G DDR2 667 ECC Fully Buffered
OS: fedora 9 64bit
Compiler: Intel Fortran Compiler Professional 11.0.069
Compiler flags: -r8 -ip -openmp

The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory.

http://www.amdzone.com/phpbb3/download/file.php?id=86

*For Core i7, Turbo mode is OFF.

For four threads, Core i7 920@3.55G is slower than Q6600@2.4G!
The scaling of Xeon E5410 is the best.



---------------------------------------------------------------------------------------------------------------------------------------
I read a lot reviews and forums about the new Intel CPU core i7. The new Intel CPU looks great. For example, according to spec cfp2006,
http://www.amdzone.com/phpbb3/view [...] 2&t=135802, Core i7 is about 100% faster than core 2 for float point performance. Although someone suggested that Intel might optimize the CPU for the benchmark codes, I still decided to upgrade my computer from core 2(Q6600) to core i7(920). Because I thought the better bandwidth of core i7 would help the multithread computation. Well, I was wrong. My testing results made me very disappoint to the new core i7 CPU.

My new computer
CPU: core i7 920 at 2.66G
MB: asus p6t
memory: 6G DDR3 1600
OS: fedora 10 64bit

My old computer
CPU: core 2 Q6600 at 2.4G
MB: gigabyte p35
memory: 4G DDR2 800
OS: fedora 9 64bit

compiler: Inter ifort 11

The testing code is a openmp CFD(computational fluid dynamics) code. The code used about 500MB memory. The testing results:

1 thread , 202.7s (core i7 920) , 213.1s (q6600), 5.13%(core i7 advantage)
2 threads, 109.0s (core i7 920) , 109.3s (q6600), 0.27%(core i7 advantage)
4 threads, 96.1s (core i7 920) , 68.3s (q6600), -28.93%(core i7 advantage)

(core i7 HT is off, it is slower if HT is on. the clock of core i7 920 is about 10% higher than core 2 Q6600)

The scaling of core i7 for 4 threads is really bad. Considering the bandwidth of core i7 920 is about twice of core 2 Q6600, this result is really strange. I don't know why, maybe the OS didn't use all the potentials of the new CPU?



So XP uses 4 cores now when did MS add that?

Word, Playa.

Reply to spud
- 0 +

Can you use the same non-SSE 4.2 non-prefetch binary code to bench both the Q6600 and i7 920? As well, could you show a 4-thread i7 run in Task Manager with the cores on individual graphs? And as I mentioned before, a CoreTemp shot somewhere around the middle of the run, to rule out load-induced throttling.

As usual I would like to rule out compiler optimizations that turn into speed bumps for specific source code, bottlenecks outside the CPU, and heat-induced throttling or hardware misconfiguration.

Reply to Wr

spud wrote :

So XP uses 4 cores now when did MS add that?

Word, Playa.



Since quite a while. It just doesn't allocate programs across each core as well as Vista does. It leaves that up to the program.

But Vista trys to allocate all processes across all the cores available to keep the load per core lighter and try to get the work done faster.

Xp, if I remember correctly, will load the first core all with processes until it is at about 100% then start loading the next core and so on and so forth.

Reply to jimmysmitty

jimmysmitty wrote :

Since quite a while. It just doesn't allocate programs across each core as well as Vista does. It leaves that up to the program.

But Vista trys to allocate all processes across all the cores available to keep the load per core lighter and try to get the work done faster.

Xp, if I remember correctly, will load the first core all with processes until it is at about 100% then start loading the next core and so on and so forth.



You are correct I got the licensing mixed up. My bad.

Word, Playa.

Reply to spud
Tom's Hardware > Forum > CPU & Components > CPUs > Why the scaling of my core i7 920 is so poor, even worse than Q6600?
Go to:

There are 1136 identified and unidentified users. To see the list of identified users, Click here.

Please mind

You are about to answer a thread that has been inactive for more than 6 months.
If you still wish to proceed, please ensure that your posting is original and does not duplicate or overlap any prior responses to this thread.

Add a reply Cancel
Sponsored links
  • Ask the community now
  • Publish
Ad
They won a badge
Join us in greeting them