Tom's Hardware Forums » Overclocking » General Discussions » Memory bandwidth tests... any real differences (part 2)
 

Memory bandwidth tests... any real differences (part 2)




Word :   Username :  
 
Bottom
Author
 Thread : Memory bandwidth tests... any real differences (part 2)
 
Profile: addict
More Information

About 7 months ago I posted data comparing two memory dividers (1:1 and 3:5 @ 333 MHz) on my then Q6600/P965 based system and concluded that for the 67 % increase in memory bandwidth, the marginal gains in actual performance weren't worth the extra voltage/heat.

Since then I've upgraded my hardware to an X3360/P35 setup and wanted to revisit this issue. Again, two dividers were looked at: one pair running 8.5x333=2.83 GHz, and another running @ 8.5x400=3.40 GHz:

333 MHz FSB:
1:1 a.k.a. PC2-5300 (667 MHz)
5:8 a.k.a. PC2-8500 (1,067 MHz)

400 MHz FSB:
1:1 a.k.a. PC2-6400 (800 MHz)
4:5 a.k.a. PC2-8000 (1,000 MHz)

I figured there would be a much greater difference in the 333 FSB case since the memory bandwidth increased by 60 % vs. 25 % in the 400 MHz FSB case. All other BIOS settings were held constant with the exception of the divider (and the strap) and the given FSB. Subtimings were set to auto and as such could vary as managed by the board which I found out, was required since manually settings some of the subtimings lead to either an incomplete POST, or an unstable system.

The benchmarks were broken down into three categories:
1) "Real-World" Applications
2) 3D Games
3) Synthetic Benchmarks

The following "real-world" apps were chosen: x264, winrar, and the trial version of Photohop CS3. All were run on a freshly installed version of Windows XP Pro x64 SP2 w/ all relevant hotfixes. The 3D games were just Doom3 (an older game) and Crysis (a newer game). Finally, I threw in some synthetic benchmarks consisting of the Winrar self test, Super Pi-mod, and Everest's synthetic memory benchmark. Here is an explanation of the specifics:

Trial of Photoshop CS3 – The batch function in PSCS3 v10.0.1 was used process a total of fifty-six, 10.1 MP jpeg files (226 MB totally):

1) bicubic resize 10.1 MP to 2.2 MP (3872x2592 --> 1800x1200) which is the perfect size for a 4x6 print @ 300 dpi.
2) smart sharpen (120 %, 0.9 px radius, more accurate, lens blur setting)
3) auto levels
4) saved the resulting files as a quality 10 jpg.

Benchmark results are an average of two runs timed with a stopwatch.

RAR version 3.71 – rar.exe ran my standard backup batch file which generated about 955 MB of rars containing 5,210 files totally. Here is the commandline used:

Code :
  1. rar a -m3 -md4096 -v100m -rv40p -msjpg;mp3;tif;avi;zip;rar;gpg;jpg "f:\Backups\Backup.rar" @list.txt

where list.txt a list of all the target files/dirs included in back up set. Benchmark results are an average of two runs timed with a stopwatch.

x264 Benchmark HD – Automatically runs a 2-pass encode on the same 720p MPEG-2 (1280x720 DVD source) file four times totally. It contains two versions of x264.exe and runs it on both. The benchmark is the best three of four runs (FPS) converted to total encode time.

Shameless promotion --> you can read more about the x264 Benchmark HD at this URL which contains results for hundreds of systems. You can also download the benchmark and test your own machine.

3D Games Based Benchmarks

Doom3 - Ran timeddemo demo1 a total of three times and averaged the fps as the result. Settings were 1,280x1,024, ultra quality with 8x AA.

Crysis - Ran the included "Benchmark_CPU.bat" and "Benchmark_GPU.bat" both of which runs the pre-defined timedemo, looped four times. I took the best three of four (average FPS) and averaged them together as the benchmark. Settings were 1,024x768, very high for all (used the DX9 very high settings hack, and 2x AA.

"Synthetic" Application Based Tests

WinRAR version 3.71 – If you hit alt-B in WinRAR, it'll run a synthetic benchmark. This was run twice (stopped after 150 MB) and is the average of four runs.

SuperPI / mod1.5 XS – The 16M test was run twice, and the average of the two are the benchmark.

Everest v4.50.1330 Memory Benchmark - Ran this benchmark a total of three times and averaged the results.

Hardware specs:

Code :
  1. D.F.I. LP LT P35-TR2 (BIOS: LP35D317)
  2. Intel X3360 @ 8.5x400=3.40 GHz
  3. Corsair Dominator DDR2-1066 (TWIN2X4096-8500C5DF)
  4.   2x 2Gb @ 5-5-5-15 (all subtimings on auto)
  5. (tRD=8) @ 667 MHz (1:1) @ 2.100V
  6. (tRD=7) @ 1,066 MHz (5:8) @ 2.100V
  7. (tRD=8) @ 800 MHz (1:1) @ 2.100V
  8. (tRD=6) @ 1,000 MHz (4:5) @ 2.100V
  9. EVGA Geforce 8800GTS (G92) w/ 512 meg
  10. Core=770 MHz
  11. Shader=1,923 MHz
  12. Memory=2,000 MHz


Note: the performance levels (tRD) are set automatically by the board which wouldn't POST if I manually tweaked them. Even though they're different, I still feel the data are valid since this is the only way I can run them. In other words, if I'm going to run the higher dividers, it'll be as such or it won't POST!

Without further ado, here are the data starting first with a 333 MHz FSB comparing the 1:1 vs. 5:8 divider (DDR2-667 vs. DDR-1066):
http://img231.imageshack.us/img231/1902/333vb8.gif

Here are the averaged data visualized graphically:
http://img231.imageshack.us/img231/3518/333graphbx2.gif

Now on to the 400 MHz FSB comparing the 1:1 vs. 4:5 divider (DDR2-800 vs. DDR2-1000):
http://img128.imageshack.us/img128/5392/400wb6.gif

And graphically:
http://img231.imageshack.us/img231/3334/400graphgl9.gif

As you can see, there way nothing spectacular in either the real-world category, or the 3D games category in comparison to the massive increase in memory bandwidth (shown on the graphs in red). In fact, I was surprised to see that there were really no gains by Doom3 and minimal gains by Crysis. This is probably due to the fact that the video card shoulders the burden of these games with Doom3 being the light-weight of the two. As expected, the synthetic benchmarks did pick-up on the larger bandwidth, but only in the case of the 400 MHz FSB did I see anything approaching the theoretical increase (14 % of 25 % vs 15 % of 60 %).

If you read my first memory bandwidth post, perhaps the same conclusions can be drawn from these new data. One thing I'll add is that this new MB doesn't require extra voltage like my older P5B-Deluxe did to run the higher dividers, so it's not producing that much more heat. That said, I'm actually running the system with the 4:5 divider, since things seem to feel faster to me (windows opening, responsiveness, etc.) which are all unfortunately intangibles I can't measure.


Message edited by graysky on 05-11-2008 at 08:36:47 PM

---------------
http://encoding.n3.net <--- for all your CD and DVD "back-up" needs!

http://valid.x86-secret.com/cache/banner/359444.png
Related Product

Register or log in to remove.

Master-de-bater
Profile: Eternal Poster
More Information

Thanks as always, graysky. Much appreciated.

I think I remember that you did a test with different FSB speeds but same cpu speeds. Could you do that test again with your newer system? I don't think anything has changed, but it's always good to know.

Cheers! [:mousemonkey:3]


---------------
"Nvidia, the Way It's Meant to be PAID Played! - Corrado
*Lesbian Lover Club* - founder Assman
Profile: addict
More Information

@evilonigiri - maybe later dude... I'm pretty burnt out of tests after this one.. they are pretty tedious and time consuming as you can imagine :(

Master-de-bater
Profile: Eternal Poster
More Information

Of course. I did some tests with the PCI-E frequency before and that was quite tiring.

And of course, I would be referring to this in the future. You have my thanks again.


---------------
"Nvidia, the Way It's Meant to be PAID Played! - Corrado
*Lesbian Lover Club* - founder Assman
bc4
Profile: addict
More Information

very nice

thanks for your hard work.

Profile: addict
More Information

I edited the first post switching the highest 400 MHz FSB run from 5:6 to 4:5 (960 MHz vs. 1,000 MHz) and included some info about subtimings to make things more clear.

Profile: journeyman
More Information

Quote :

Subtimings were set to auto and as such could vary as managed by the board which I found out, was required since manually settings some of the subtimings lead to either an incomplete POST, or an unstable system.



Great Post.. Interesting.
Would like to see something similiar but with the memory timings optimized. Unfortunately it sounds like you had a problem there.
Currently running my memory 1:1 at 667mhz (underclocked), but the timings are 3-3-3-8 1T. I wonder how it would compare to higher mhz with looser timings. Like 800mhz 4-4-4-12 2T, or even 900+ mhz.


---------------
P5N32-E-SLI, e4300@3.2, 4x1gig tracers 1:1 , 8800gts-320 OC ?/?, antec 850w psu, antec 900 case
Vista64 Ultimate
Profile: addict
More Information

Yeah, good question. I haven't tried backing off on the main timings with these chips.


  Tom's Hardware Forums » Overclocking » General Discussions » Memory bandwidth tests... any real differences (part 2)

Go to:
 

Google Ads
Ad
News

SGI claims 4.35 TB memory bandwidth on Itanium 2 server

Published on July 17, 2006

SGI said that it has built 1024-processor Altix 4700 system that has achieved a sustained memory bandwidth of 4.35 TB/s. Read more

Corsair ships Xpert memory modules

Published on February 01, 2005

Corsair said it is now shipping its blinking and text-scrolling memory modules, dubbed Xpert. Read more

CES 2007: Dell wants more internet bandwidth

Published on January 09, 2007

At his CES keynote this morning, Michael Dell challenged the telecoms industry to roll out more fiber more quickly in order to help fuel the digital age - jumping on the bandwagon of praising "You" (and Co) for driving the digital age with bandwidth consuming content. Read more

Samsung ships "majority" of GDDR3 memory for Xbox 360

Published on November 29, 2005

Samsung today said that it is the main supplier of the graphics memory for Microsoft's recently launched game console Xbox 360. Running at 700 MHz, the 512 Mbit GDDR3 memory, organized in a 16 Mbit x 32 configuration, is rated at a bandwidth of 5.6 GByte per second, which is about 3.5 times faster than the memory used in the original Xbox, Samsung said. Read more

Latest Reviews & Articles

Part 4: Avivo HD vs. PureVideo HD

Published on September 29, 2008

The 780G chipset/Radeon HD 3200 and the MCP78S chipset/GeForce 8200 provide the first integrated graphics solutions that can accelerate Blu-ray playback. We dig deep into how well they work with high-quality Blu-ray 1080p video playback. Read more

Four GeForce 9600 GT Cards Compared

Published on September 26, 2008

Manufacturers really love the first Geforce 9. The graphic chip is fast, the cards are inexpensive, and some retailers offer more than ten variations. Read more

Maxtor's Shared Storage Does NAS At Home

Published on September 25, 2008

What do you do with all the data you collect at home? Network attached storage is the solution. We test Maxtor's Shared Storage II and find that it is also suitable for use in small businesses. Read more

SLI & Centrino 2: Gaming Laptops Battle

Published on September 24, 2008

Take four gaming laptops. Arm two of them with SLI and make the others Centrino 2-compatible. You're looking at a high-end collection of the latest mobile technology battling it out for benchmark supremacy and your hard-earned dollars. Read more