Ads
Ads
All about CPU
 Latest CPU articles
Tuning Cool'n'Quiet: Maximize Power And Performance, Part 1

Tuning Cool'n'Quiet: Maximize Power And Performance, Part 1
Think your Athlon or Phenom processor is already tuned to deliver the best balance between performance and power consumption? Think again. We show you how to tweak Cool'n'Quiet for even more aggressive speed at maximum efficiency using several AMD CPUs. Read More

  • AMD Phenom II X4 965 BE: Same Speed, Less Power
    Today AMD is introducing a revision of its flagship Phenom II X4 965 processor rated at 125W, replacing the 140W part, as well as a new 3.1 version of its Overdrive overclocking software. We take a quick look at both to see what advantages they offer. Read More
  • Overclocked On Air: Intel's Core i5-750
    Intel's new quad-core i5 and i7 CPUs for LGA 1156 deliver plenty of performance and impressive efficiency. But how far can they be overclocked? We take the entry-level model Core i5-750 as far as it'll go with a modest air cooler and benchmark it. Read More
All CPU articles

Newsletters


  • Ask your question about IT issues
  • Post

Partners

The Games selection

adventure : Ray Adventure game, South Park style. Pick the way the story goes by picking an answer among those offered.
violent : Interactive Buddy Unwind on your interactive buddy: Do anything you want to him, it will earn you money, and you can buy other stuff to torture him with.
Ads

Sponsored links

FSB Limits Exposed: Intel CPUs Don't Scale Very Well In UC Berkeley Test

Next news
1:20 PM - April 29, 2008 by Theo Valich

 

Berkeley (CA) - Researchers from the Computer Science Division at UC Berkeley and Lawrence Berkeley National Laboratories (CRD/NERSC) recently submitted a paper to the IEEE, highlighting the subject of scaling an optimized Lattice Boltzmann Simulation on popular supercomputer architectures. TG Daily was told that the paper was good enough to prompt the IEEE to issue an award. However Intel may not be completely happy with the findings: At least in this very specific environment, the Xeon and Itanium 2 processors did not scale very well, while Sony’s Cell BE came out on top.

The paper itself was published as "Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms" and tries to shed some light on a specific area of socket-per-socket HPC (High-Performance-Computing) scaling in supercomputer environments. The scientists evaluated AMD’s Opteron (Santa Rosa), Intel’s Itanium 2 and Xeon (Clovertown), as well as the Sony-Toshiba-IBM Cell BE and Sun’s Niagara 2 processors. The researchers apparently spent quite some time on optimizing the application itself, rather than the hardware. This optimization was claimed to have resulted in a 14x improvement over the original LBMHD code (Lattice Boltzmann magneto-hydrodynamics).


According to the paper, the best scaling was delivered by the STI Cell BE system, followed by Sun’s Niagara 2, AMD’s Opteron, Intel’s Xeon and Itanium 2.

We contacted Intel to discuss UC Berkeley’s findings, but Intel declined to comment as the company said it wasn’t familiar with the content in the paper.

However, Lattice Boltzmann applications are known to have a high demand for system memory bandwidth and this fact may have put Intel’s system at a disadvantage in this specific test: Intel uses FB-DIMM 667, AMD DDR2-667, Niagara 2 FB-DIMM 667 and Cell the ultra-fast (and Rambus-based) XDR technology. Until Nehalem (Bloomfield core) and its integrated triple DDR3 memory controller comes along, Intel is likely to trail the pack in such tests. Regardless of the name of your Xeon processor, whether it is Cloverfield, Harpertown or Tigerton, any bandwidth-intensive application will cause a poor scaling performance on a FSB-burdened platform. In this UC Berkeley test, Intel’s Xeon and Itanium 2 followed the pack with a substantial distance.

It is interesting to note that even AMD’s Opteron processors were scaling almost in a linear fashion when additional CPUs were added. The Xeons scaled only by 43% on a socket-per-socket basis.

The lesson learned? Obviously, there are different benchmarks out there, most of them stressing a particular discipline. This specific test indicates that you should not run a memory bandwidth-intensive application through a Xeon or Itanium 2 system, if you have the luxury of having an Opteron, Niagara 2 or Cell system available as well. But does it mean that Xeons and Itanium generally scale worse than other architectures? No. There is more to supercomputers than memory bandwidth and Intel certainly has the edge on pure processing horsepower at this time.

Source : Tom's Hardware US

Talkback
Add your comment
TripGun 04/30/2008 2:05 AM
Hide
-0+

If the test is so dependent on memory bandwidth then it seems they should have submitted a few GPU's in their test as they are so well known for their throughput. These synthetic benchmarks are optimized for certain core architecture and should be taken with a grain of salt. What are they teaching our kids in school?

Kari 04/30/2008 12:45 PM
Hide
-0+

I wouldn't call LBMHD as a 'synthetic benchmark', they actually use it in their research and stuff.. :P

D_Kuhn 04/30/2008 4:42 PM
Hide
-0+

Intel processors have never scaled well... which makes sense since they come from a single processor PC background. AMD's entire Athlon64/Opteron line is based on a processor design that was originally built for multiprocessing, acquired from a company that built servers - so of course they scale better. Sony's design also... ground up for massively parallel implementation. Intel is out of it's application space here, hence the relatively poor performance.

Don't expect that advantage to last though, every successive generation of Intel architecture has seen advances in multiprocessing design. The Core Duo architecture has not caught up to the Opteron in this regards (though it far surpasses it in area's that count more for the PC market), but it's getting close.

StenLi 05/01/2008 4:25 PM
Hide
-0+

"..AMD?s Opteron (Santa Rosa).." r u sure? :)

traviso 05/01/2008 10:07 PM
Hide
-0+

But did they test the old 65nm Xeons or the new 45nm ones, as they have a much higher FSB (800mhz vs 1033mhz and 1666mhz). I'm suspecting they probably didn't.

Also, Intel has a new chipset coming out in the end of 2008 that totally gets rid of the northbridge/southbridge concept and is promising (no benchmarks yet, but over the past 1.5yrs, Intel has been delivery on promises, unlike the past) to make big improvements in bus limitations for PCs.

Comments are closed on this page.

Sponsored links