Download the Tom's Hardware App from the App Store
The reference for current tech news
Yes No
Ads
Tom's Hardware > Forum > Graphics & Displays > Nvidia > [Solved] Simulating Neural Networks using GPGPU computing

[Solved] Simulating Neural Networks using GPGPU computing

Forum Graphics & Displays : Nvidia [Solved] Simulating Neural Networks using GPGPU computing

Best answer from EXT64.

Word :    Username :           
 

Hi all,

I have a few questions regarding GPGPU computing that I am very hopeful you all will be able to help me with. I am a graduate student getting my PhD in neuroscience and a major component of my work is simulating large scale neural networks that represent recordings I have performed on real neurons. Our lab is currently investing in a few Tesla systems that will be used to perform highly parallel simulations that we used to outsource to larger computational resources on campus.

My major goal here is to build myself a new system where I will be able to test my CUDA code before I run it on the bigger system. I would also like to not spend my entire life in the lab, so having a rig at home where I can do some of my work would be favorable. In addition, I occasionally have time to play games and would like my system to be able to perform decently with most modern titles.

I am going to lay out a couple of the options that I have put together so far and would really appreciate any comments the community could provide on the viability of any of the combinations. I am mostly torn between deciding on the GTX 400 or 500 series and determining whether I want to throw more money down for an Intel CPU or to stick with the lower cost AMD solution.

Theoretical System 1:

2 x EVGA GeForce GTX 460 (Fermi) SSC+ 1GB 256-bit GDDR5

1 x ASUS P8P67 WS REVOLUTION LGA 1155 Intel P67 / NVIDIA NF200 (16x, 16x, 8x, 8x)

1 x Intel Core i5-2500K Sandy Bridge 3.3GHz (3.7GHz Turbo Boost)

1 x G.SKILL Ripjaws X Series 16GB (4 x 4GB) 240-Pin DDR3 SDRAM DDR3 1600

1 x CORSAIR CMPSU-750TX 750W


Theoretical System 2:

2 x EVGA SuperClocked GeForce GTX 560 Ti (Fermi) 1GB 256-bit GDDR5

1 x ASUS M4N98TD EVO AM3 NVIDIA nForce 980a SLI ATX AMD Motherboard *listed wrong mobo

1x AMD Phenom II X6 1090T Black Edition 3.2GHz

1x G.SKILL Ripjaws X Series 16GB (4 x 4GB) 240-Pin DDR3 SDRAM DDR3 1600

1 x CORSAIR CMPSU-750TX 750W




Those are pretty much the two options that I am trying to choose between. I will most likely have 1 6 GB/s SATA drive in either system.

I would really appreciate any suggestions anyone might have based on experience with components of GPGPU computing in general. Thanks everyone!

Harrison


Message edited by Harrison_013 on 05-08-2011 at 12:41:12 AM
Reply to Harrison_013
Register or log in to remove.

I don't think cuda works with sli well; wouldn't it be better to get a faster single card?

------------------------------ Always be on the lookout for criminals who make PC's without PCI-e slots!
Reply to math1337

I'd go with the Phenom II if you need more physical cores/threaded apps. The Sandy Bridge is better in gaming, but I'm not too sure about the difference in threaded apps with HyperThreading off if you are going to overclock it.

------------------------------ Many a drug addict has killed a man for more, but have you ever heard of a gamer kill for more?
Reply to therandomuser

You're not going to OC a CPU when doing science. Bit errors will kill you.

Reply to kewlx25

Thanks for the replies guys.

Math1337:

I was thinking the same thing and I have read that SLI and CUDA coding can be considered perpendicular processes. Suggesting that SLI makes all the cards one addressable computing block where CUDA gives you the flexibility to discretely address each Stream Multiprocessor (SM) on the cards. With the two cards I was planning on running independent simulations that occasionally communicate through the CPU but I have also read that this sort of path can give rise to thread pile ups. I want to get as many SM's out of this rig as possible.


In regards to the AMD CPU I had a similar inclination but I have just read so many reviews about how good sandy bridge is that I was beginning to doubt my judgement. I have always leaned toward AMD and I think the X6 with definitely provide me with what I am looking for.

I you guys have anymore advice as to whether I should choose a single high end card or dual mid range cards I would really appreciate it.

Harrison

Reply to Harrison_013

The 460/560 are relatively poor CUDA cards. Their architecture is better for gaming than CUDA (they will work, but at 2/3 the power you'd expect). I'd suggest a 470 or 570 for a CUDA comp. Another thing to remember, NVidia artificially limited the double precision on the gaming cards to 1/8 SP (vs the hardware limit of 1/2 the Teslas see) so if you are doing DP, the gaming card will be a lot slower than your Teslas at the lab.

Multi cards just got a lot easier with the 4.0 Toolkit thanks to the addition of direct GPU to GPU copy and the memory addressing you mentioned. However it is still harder than doing a single card.

Sandybridge is the best processor choice, however if going AMD allows you to upgrade the GPU to 470/570, then the slight downgrade will be worth it. As the host processor is primarily driving the client CUDA GPUs, fast single thread performance is important (moreso than having more threads).

------------------------------ 4 x AMD 6172 > Tyan S8812 > 32GB > X-1050 | i7 970 > P6X58D-E > 7970 3GB > 24GB | i7 3770k > P8Z77-V Pro > Classified 580 3GB > 16GB

http://piro.pirocast.net/badge/none/fah02/800/0/0/0/0/0/0/0/0/0/0/0/0/255/255/255/557101.png
Reply to EXT64

What do you think about the 465?

Thanks for bringing up the FP64 point as I had completely forgotten that they didn't have the same double precision capability. What do you think about going with a single lower end quaddro to up the FP64 processing speed? Most of the work I am doing uses vector arrays where each variable is expressed as a double precision number. This, double precision speed is going to be a pretty big factor. The quaddro's are just so darn expensive.

Thanks Again

Harrison

Reply to Harrison_013

Might be better off setting up a remote desktop with the lab computers. Still waiting for a cloud based neural network to go up. Skynet anyone?


Message edited by upgrade_1977 on 05-10-2011 at 09:28:12 PM
------------------------------ Thermaltake armor+ VH6000BWS Case, Corsair AX1200 PSU, i7 920 @ 3.2Ghz, Gigabyte EX58 UD3R-SLI, 8GB (2X4GB) Corsair vengeance 1600,
2 X GTX 480 in SLI (H2O), X-FI sound card, Hitatchi 1TB 7200rpm hard drive, Windows7 64bit,
Asus VG236H 23" 3D monitor
Reply to upgrade_1977
Best answer

Well, I often use double precision (except when folding) and the 470 does pretty decently. The 465 is the same architecture as the 470/480, so it will scale down in speed by the number of cores you lose. And one thing I misspoke on earlier, the 460/560 is great in single precision, but poor in double (so what I said was true for your case, just wanted to clarify).

 

On Newegg at least, the 465 is about $175 after MIR, which is in the same ballpark as the 460. Unfortunately they are all the reference cooler, which gets a little hot for my tastes.

 

The cheapest Quadro that has good double precision is the Quadro 4000 (the rest are handicapped as bad as the gaming cards). http://www.nvidia.com/content/PDF/ [...] evised.pdf
And it is pretty expensive ($700+).

 

NVidia has kind of dropped the ball (purposefully) on us scientific programmers on a budget.

 

The 465 has very limited models left (all reference and hot), the 470 is gone, the 480 has a couple models left but is $280+, the 460/560 were not designed for double precision thus are relatively slow at it, and the 570/580 is too much $$.

 

With all that, I'd suggest looking into the 560Ti. I know I said it has slow DP, however it has a much higher core clock than my 470 which should make up for it some. If that is too high, it is a close race between taking a performance hit and getting the power efficient 460 or getting the hot and power hungry 465.


Message edited by EXT64 on 05-10-2011 at 05:37:31 AM
------------------------------ 4 x AMD 6172 > Tyan S8812 > 32GB > X-1050 | i7 970 > P6X58D-E > 7970 3GB > 24GB | i7 3770k > P8Z77-V Pro > Classified 580 3GB > 16GB

http://piro.pirocast.net/badge/none/fah02/800/0/0/0/0/0/0/0/0/0/0/0/0/255/255/255/557101.png
Reply to EXT64

This topic has been closed by Maziar

------------------------------ The end is near...
Reply to Maziar
Register or log in to remove.
Tom's Hardware > Forum > Graphics & Displays > Nvidia > [Solved] Simulating Neural Networks using GPGPU computing
Go to:

There are 1987 identified and unidentified users. To see the list of identified users, Click here.

Please mind

You are about to answer a thread that has been inactive for more than 6 months.
If you still wish to proceed, please ensure that your posting is original and does not duplicate or overlap any prior responses to this thread.

Add a reply Cancel
  • Ask the community now
  • Publish
Ad
Ads
Latest best answer
Graphics Card Help for a Casual
By geofelt, 3 hours ago:

As a standard, a pci-e x16 graphics card slot can deliver up to 75w to a graphics card...

Best offers
They won a badge
Join us in greeting them
Top experts