CUDA-Enabled Apps: Measuring Mainstream GPU Performance

Let’s Get Visual

Remember in Minority Report how Tom Cruise does his computing by gesturing on a transparent display? The UI concept is so captivating that it’s since been copied in a host of other venues, including TV commercials. Well, imagine a conventional video editor meeting a Minority Report UI then getting filtered through a Nintendo 64. That’s Super LoiLoScope, and “MARS” is the code-name for its new, CUDA-enabled version. No, MARS (as I’ll refer to it for brevity) isn’t Adobe Premiere. It’s not even Premiere Elements. Heck, it’s not even PowerDirector 7. But it is wickedly simple to use once you wrap your head around its hypergraphical, drag-and-drop-o-rama design. Better yet, it makes use of CUDA in H.264 encoding, decoding, and playback, making this one of the most thorough CUDA implementations seen yet.

Nvidia and LoiLo (the name of the company behing SuperLoiLoScope) set expectations very high with this application. The companies point out that while a quad-core CPU can compute at 100 GFLOPS, a 240-core GPU like the GTX 280 can nail 900 GFLOPS. As such, we should expect to see a significant encoding performance improvement with a GTX 280. Obviously, our results should be lower with the 9600 GT and 9800 GTX, but on a percentage improvement basis, we could reasonably expect to leave TMPGEnc in the dust thanks to the broader CUDA implementation.

Create a new thread in the US Reviews comments forum about this subject
This thread is closed for comments
56 comments
    Your comment
  • The 8800GS or with the new name 9600GSO goes for 60$ and delivers 96 stream processors. Would it be correct to assume that it would perform betwen the 9600 GT and 9800 GTX you reviewed?

    Other then that great article, been waiting for it since we got a sneak preview from Chris last week.
    1
  • And I'll never take Nvidia marketing seriously until they either stop singing about CUDA being the holy grail of computing, or this changes: "Aside from Folding@home and SETI@home, every single application on Nvidia’s consumer CUDA list involves video editing and/or transcoding."
    6
  • As more software will use CUDA, we will not only see a great boost in performance for e.g. video performance, but for parallel programing in general. This sky rocket this business into a new age!
    0
  • l0bd0nAs more software will use CUDA, we will not only see a great boost in performance for e.g. video performance, but for parallel programing in general. This sky rocket this business into a new age!

    Honestly, I dont think a proprietary language will do this. If anything, it's likely to be GPGPU's in general, run by Open Computing Language.(OpenCL)
    4
  • Are we both thinking about the same "Pirates 2"? Or am I missing something...
    4
  • Who knows it's just a clip he used he could be naming it anything for the hell of it.

    CUDA transcoding is very nice to someone that does H.264 transcoding at a high profile and lacks a 300+ dollar cpu who would spend hours transcoding a dvd on high profile settings.

    Else from that CUDA acceleration has just been more of a feature nothing like a main event. Although can easly be the main attraction to someone that does a good flow of H.264 trasncoding/encoding.

    Encoding/transcoding in h.264 high profile can easily make someone who is very content with their cpu and it's power become sad very quickly when they see the est time for their 30 min clip or something.
    2
  • I'm using CoreAVC since support was added for CUDA h264 decoding. I kinda feel stupid for buying a high end CPU (at the time) since playing all videos, no matter the resolution or bit-rate, leaves the CPU at near-idle usage.
    Vid card: 8600GTS
    CPU: E6700
    0
  • Well you lucked in considering not all of the geforce 8 series supports H.264 decoding etc.
    0
  • they should remove Adobe CS4 suite from there since Cuda transcoding is only posible with nvidia CX videocards not with normal gaming cards wich supports cuda.
    2
  • CUDA means Miracle in my language :-) I it will do those
    The sad thing is that ATI does not truly compete in CUDA department and there is not standard for it.
    -2
  • I was only really interested in the Badaboom benchmarks and I was fairly impressed but I seem to remember the last time you guys done an article based on GPU accelerated apps (Cuda vs Stream) Badaboom suffered from output quality issues something that hasn't been mentioned in this article. It's all very well a 9800GTX being able to encode HD video content in half the time if the final product is no good.
    0
  • Jean,

    Actually, I don't believe we've done a comparison between the two. However, I have read that comparison at other sites, and it's actually ATI's Stream app that has the quality issues. Version two of the software is on the way, and it purportedly fixes the quality issues (though it still isn't demonstrating much GPU scaling, from what I've seen thus far).
    1
  • cangeliniJean,Actually, I don't believe we've done a comparison between the two. However, I have read that comparison at other sites, and it's actually ATI's Stream app that has the quality issues. Version two of the software is on the way, and it purportedly fixes the quality issues (though it still isn't demonstrating much GPU scaling, from what I've seen thus far).
    yeah but chose your words carefouly since readers could be misslead on this one :) the quality of the transcoding is related to the aplication used not to the computing technology like cuda or stream.
    -1
  • Cangelini, Badaboom definitely has lower quality output compared to the newest x264 builds. I'd definitely like to take advantage of my 9600 GT, but not unless I can use it with Handbrake or some other app on my own terms (NOT BASELINE OR MAIN PROFILE.)
    0
  • I can haz chezberger?

    ATI

    CUDA

    CONA
    -8
  • SpadeMThe 8800GS or with the new name 9600GSO goes for 60$ and delivers 96 stream processors.

    The 9600GSO has 2 versions (ignoring VRAM variations), one with only 48 SPs (essentially a castrated G94, not G92).
    1
  • There is a plugin for people who do audio engineering/recording/mixing/mastering from this guy:

    http://www.nilsschneider.de

    It runs on CUDA, but TBH, it has not manifested itself as anything special just yet, it's more a "proof of concept". However, as someone who's been doing that kind of thing for years, any quad-core ever made is good enough for real-time audio work, so there's not much point in CUDA acceleration.
    0
  • Measuing? Do you not even have spellcheck now?
    -5
  • I enjoyed the article, and just like in the dual-core versus quad core debate, there remains few applications that can fully exploit CUDA.

    By the way, I have quick correction. The author writes, "...that can leverage parallelism in a way that jives with CUDA’s architecture." The correct word is "jibe" not "jive."
    0
  • CUDA is mostly about hype. Nothing really else.
    -2