Shop for Motherboards
DX48BT2 Motherboard DX48BT2 Motherboard

Compare the top 5 lowest prices by hovering your mouse over the product names on the left

$299.99
Rampage Formula Motherboard Rampage Formula Motherboard $359.99
P5Q Deluxe Motherboard P5Q Deluxe Motherboard $229.99
GA-MA78GM-S2H Motherboard GA-MA78GM-S2H Motherboard $94.99
M2A-VM HDMI Motherboard M2A-VM HDMI Motherboard $79.99

See More Products...

Motherboards Reviews
11-Way P45 Motherboard Shootout

11-Way P45 Motherboard Shootout
Two formerly high-end features—PCI Express 2.0 transfers and native Crossfire support—separate Intel’s new mainstream P45 Express chipset from its earlier P35 Express. Eleven motherboards challenge the market with features, performance and value. Read More

$5K PC Takes On $4.6M Supercomputer

1:50 PM - May 30, 2008 by Wolfgang Gruener
Source: Tom's Hardware US – Category : Motherboards
35 comments

 

Antwerp (Belgium) - Recent advances in general purpose GPU computing are beginning to shift perceptions in supercomputing applications. Belgian researchers have assembled a relatively simple enthusiast PC with an emphasis on graphics processing capability, which beats a multi-million dollar supercomputer in its target application.

The desktop PC, called Fastra, was built with a focus on the development of new computational methods for tomography. Tomography is a technique used in medical scanners to create three-dimensional images of the internal organs of patients, based on a large number of x-ray photos that are acquired over a range of angles. As these 3D images can be quite large, advanced reconstruction techniques can sometimes require weeks of computation time on a regular PC. Which means that supercomputers are usually required to process computer tomography (CT) images.

The research group Vision Lab at the University of Antwerp came up with a different solution and constructed a PC that integrates four GeForce 9800 GX2 graphic cards (with a total of eight GPU cores) that runs CUDA-optimized tomography applications. The specifications include a MSI K9A2 Platinum motherboard, an AMD Phenom 9850 CPU, 4 x 2 GB Corsair TWINX DDR2 PC6400 memory, a Samsung Spinpoint F1 750 GB hard drive, a Thermaltake Toughpower 1500W Modular power supply unit as well as four MSI 9800GX2 cards. The researchers said that the cost of the system was less than 4000 Euro or about $5300.

It isn’t quite a tricked-out gaming system and the 3DMark06 score is just above what your average PC can manage to come with today (12,603 points). However, it is the CUDA application where this PC really shines. Compared to the 512-processor, $4.6M CalcUA supercomputer purchased in 2005, the PC can be more than a match: The projection of image slices took 23.4 seconds on the supercomputer and 35.1 seconds on the PC. The reconstruction of the slices was displayed after 67.4 seconds on the supercomputer systems and after just 52.2 seconds on the PC. The Vision Lab crew now believes that a real-time construction is possible through GPUs and is now building a cluster of such systems.

While it is an impressive example how GPUs can be applied in non-traditional ways, there are a few notes to be added. Of course, GPUs cannot replace traditional supercomputers, which still can be applied to applications with a broader range. Also, supercomputers usually carry huge memories, often in the Terabyte range, which cannot be matched by today’s GPU clusters. When we talk to scientists working with supercomputers and GPUs, they typically believe that future supercomputers will not completely transition to GPU clusters, but may develop into systems that consist of a traditional supercomputer structure as well as GPU capability.

An interesting side note about the Fastra PC is its motherboard. Eagle-eyed readers may have noticed that the MSI K9A2 Platinum board is not an Nvidia SLI-based board, but uses AMD Crossfire (780 chipset). The simple reason to choose this board may have been cost, but it is unlikely to impact the performance of the system: CUDA does not support SLI at this time, which means that the GPUs have to communicate with each other as well as with the CPU via PCI Express. The researchers claim that they have not seen any impact on performance and the GPUs apparently are scaling well.


React! Return to news index
Add to your Google homepage Add to My Yahoo!
Talkback
blackopsninja 05/30/2008 8:23 AM
Hide
-0+
blackopsninja
sweet
dittopb 05/30/2008 8:29 AM
Hide
-2+
dittopb
I think the reason why they choose MSI K9A2 Platinum board is the PCIe slot spacing. I checked Asus and Gigabyte boards with quad PCIe 2.0 support and found to be unevenly spaced and would be hard to fit 4 dual-slotted GPUs.
hixbot 05/30/2008 8:42 AM
Hide
-1+
hixbot
Ok, now we know how well 8 GPU cores handle supercomputer-tasks. Lets see how the $4.6M supercomputer can handle Crysis!
hughyhunter 05/30/2008 8:56 AM
Hide
-0+
hughyhunter
It's about time someone realizes we build for shear power!!!
lopopo 05/30/2008 10:13 AM
Hide
-0+
lopopo
It's about time someone other than folding did this. Only hope others will do the same.
dobby 05/31/2008 12:13 PM
Hide
--3+
dobby
interesting choice of CPU, i know that this doesnt use the CPU directly for it Processing power, but you would think that they would go the full hog and use a skull trial system
khaydin 05/31/2008 1:26 AM
Hide
-0+
khaydin
8GB of ram on Windows XP? Hopefully it's 64bit XP otherwise they're wasting more than half of that ram...
customisbetter 05/31/2008 1:43 AM
Hide
-0+
customisbetter
how does one, let alone 4 9800 gx2s get 12k in 3dmark?
hughyhunter 05/31/2008 2:29 AM
Hide
-0+
hughyhunter
customisbetter wrote :

how does one, let alone 4 9800 gx2s get 12k in 3dmark?



I agree... I'm interested to hear a reason why. Also... I didnt know there was support for four GX2's. How do you run 4 of them? Their is only 1 connector!
khaydin 05/31/2008 5:24 AM
Hide
-0+
khaydin
hughyhunter :
I agree... I'm interested to hear a reason why. Also... I didnt know there was support for four GX2's. How do you run 4 of them? Their is only 1 connector!



They're not setup in SLI since the motherboard and CUDA doesn't support SLI. So if they were doing 3dmark only 1 of those 4 would be getting benched.
navvara 05/31/2008 5:45 AM
Hide
-0+
navvara
customisbetter :
how does one, let alone 4 9800 gx2s get 12k in 3dmark?



They are not running SLI. They are just plugged in the PCI-X and accessed by the app indepentently.

romulus47plus1 05/31/2008 7:36 AM
Hide
-0+
romulus47plus1
Yea! Fastra!
drho 05/31/2008 7:35 AM
Hide
-0+
drho


Higher memory bandwidth with the AMD versus Intel until Intel implements a real memory bus. CPU memory bandwidth to GPU memory can be a significant factor.
apache_lives 05/31/2008 9:11 AM
Hide
--2+
apache_lives
gawd MSI, atleast the $4.6M will last longer then a few months :D

skulltrail with the quad channel memory would have been a wiser and more reliable choice although the cost would have become alot higher, and when CUDA starts to support SLi, perhaps the motherboard choice would come back to bite them.
martel80 05/31/2008 11:35 AM
Hide
-0+
martel80
navvara :
They are not running SLI. They are just plugged in the PCI-X and accessed by the app indepentently.


PCI-X is something completely different from PCI-express.
http://en.wikipedia.org/wiki/PCI-X

And if CUDA doesn't support SLI, I wonder if this computer even uses both parts of the GX2 since this single card already runs in SLI.
Anonymous 05/31/2008 2:13 AM
Hide
-0+
It can. CUDA API will reports 2 GPUs presence when a 9800 GX2 is attached.
navvara 05/31/2008 3:43 AM
Hide
-0+
navvara
ismailfaruqi :
It can. CUDA API will reports 2 GPUs presence when a 9800 GX2 is attached.



My point exactly. The reason they use Nvidia video cards on intel chipsets is the extra pci-x (or express who cares about wording) lanes. CUDA supports multiple GPU's (it will detect all compatible Nvidia GPU's attached to a system) but not SLI.

Although this news is interesting I wonder why they didn't use tesla? It would certainly scale better with tesla solutions being rack mountable (though more expensive to get the same crunching power).
Wild9 05/31/2008 8:31 AM
Hide
-0+
Wild9
Quite an astonishing story, in my opinion. A super-computer in your home..now just imagine a whole planet full of these systems working on cancer cures. I'm sure there are some applications that need more memory than a desktop can handle, but this is astounding.
hughyhunter 05/31/2008 8:35 AM
Hide
--1+
hughyhunter
navvara wrote :

CUDA supports multiple GPU's (it will detect all compatible Nvidia GPU's attached to a system) but not SLI.



I dont get it... what's the difference?
Jaevric 05/31/2008 11:55 AM
Hide
-0+
Jaevric
Is it bad that my first thought was "I hope they bought EVGA so they can step-up in a couple of months?"

Still, definitely really cool.
LkS 06/01/2008 12:22 PM
Hide
-0+
LkS
We as enthusiasts should be proud that our constant upgrading in search of ultimate gaming performance has funded the development of consumer affordable technology so advanced that it can benefit important applications such as this.
schouwla 06/01/2008 1:52 AM
Hide
-0+
schouwla
The is very interesting.
Intel and AMD are working hard now to make it easier to program to their multi-core monsters, but right now Nvidia has the upper hand.


Lars
mantech 06/01/2008 2:07 AM
Hide
-0+
mantech
Well it is very interesting to know the facts about this Fastra motherboard
I believe there are quite few other board out there that haven't been tested for various combination. Is impresive how technology has evolved over time. With such tech so advance we might some day have a Super Desktop PC in our Home.
Anonymous 06/01/2008 2:16 AM
Hide
-0+
Why the inherent bias against AMD? They chose AMD to save money? So instead of paying an outrageous $5500 for a Core2Quad system, they paid a more reasonable $5300 for a Phenom CPU + mobo? LMAO. As the above poster said, AMD have had an advantange on memory and system bandwidth ever since the day the Athlon 64 first came out. This is exactly why there are so many more supercomputers running Opterons than Xeons...
schouwla 06/01/2008 4:25 AM
Hide
-0+
schouwla
This disussion it not so much about CPU architecture as it is using the GPU obvious vector adavntages over the CPU.

I agree that AMD firestream API might be a little smarter at using CPUs and GPUs for the same job.

Looking forward to some major progress in this field really soon.

Lars
schouwla 06/01/2008 6:22 AM
Hide
-0+
schouwla
As someone already pointed out why did they not buy a Tesla
e.g. NVIDIA Tesla™ D870 $5000
Achieve up to 700 GFLOPS of performance (1 TeraFLOP peak) with both C870 GPUs
hughyhunter 06/01/2008 7:26 AM
Hide
-1+
hughyhunter
schouwla wrote :

As someone already pointed out why did they not buy a Tesla
e.g. NVIDIA Tesla™ D870 $5000
Achieve up to 700 GFLOPS of performance (1 TeraFLOP peak) with both C870 GPUs



They wanted to compare consumer level (gaming)... The average joe doesnt buy a Tesla... They buy 8800GT's.
Anonymous 06/01/2008 8:30 AM
Hide
-1+
Jay-D :
Why the inherent bias against AMD? They chose AMD to save money? So instead of paying an outrageous $5500 for a Core2Quad system, they paid a more reasonable $5300 for a Phenom CPU + mobo? LMAO. As the above poster said, AMD have had an advantange on memory and system bandwidth ever since the day the Athlon 64 first came out. This is exactly why there are so many more supercomputers running Opterons than Xeons...



Well, the reason in this case is... currently only MSI K9NA2 Platinum -which is an AMD mainboard- that has 4 PCI-express 16x slots with spacing evenly (2 slots each), which allows 4 9800 gx2 combination. The Intel platform only has 3 PCI-express slots at most (either nvidia or skulltrail).

schouwla :
As someone already pointed out why did they not buy a Teslae.g. NVIDIA Tesla? D870 $5000Achieve up to 700 GFLOPS of performance (1 TeraFLOP peak) with both C870 GPUs



Of course for price reason. With 4x 9800 gx2, this $5300 system has theoretical peak of 4 TFLOPS, compared to $5000 Tesla one.
dittopb 06/01/2008 10:37 AM
Hide
-0+
dittopb
Quad CrossFire (8X+8X+8X+8X)}; 4X per GPU
Do you think there's enough bandwidth to feed the GPUs?
schouwla 06/01/2008 1:46 AM
Hide
-0+
schouwla
Also ATI seems to be double precission floating points where as Nvidia does not. So it sucks for Monte Carlo simulations.

Lars
Anonymous 06/01/2008 5:16 AM
Hide
-0+
[QUOTE]Well, the reason in this case is... currently only MSI K9NA2 Platinum -which is an AMD mainboard- that has 4 PCI-express 16x slots with spacing evenly (2 slots each), which allows 4 9800 gx2 combination. The Intel platform only has 3 PCI-express slots at most (either nvidia or skulltrail).[/QUOTE]

But the article suggests that it was chosen to save an amount of money equal to < 5% of the total cost of the system, which is just absurd... Aside from the additional PCI-E slot, AMDs integrated memory controller more than makes up for the nominal loss of IPC. Regardless, I have no doubt that there was some serious thought put into the components the team used, and that cost was not a factor.

Note You are going to post a comment as anonymous.