Nvidia Debuts GK110-based 7.1 Billion Transistor Super GPU

We know all about Nvidia's GK104 chip, which has most recently been flying through our labs in a dual configuration in the sexy GeForce GTX 690. While that card is the king of gaming (for now), the big daddy of Nvidia Kepler-based GPUs isn't even here yet.

This week at the Nvidia GPU Technology Conference in San Jose, the graphics company took the wraps off of the Kepler-based GK110 GPU that will power the Tesla K20 – a professional-level graphics card for serious business.

Nvidia Tesla K20

The big reveal at this conference from a hardware standpoint definitely is the GK110, which packs an astonishing 7.1 billion transistors on a 28nm process. It also promises to have all the compute features that some were feeling missing from the GK104. Nvidia CEO Jen-Hsun Huang said at a post-keynote Q&A that the GK110 is "the most complex IC commercially available on planet."

7.1 billion transistors in the GK110

In comparison, next in complexity and transistor count is a chip from Xilinx called the Virtex-7 2000T FPGA, which integrates 2 million logic cells and 6.8 billion transistors. To help put that in better perspective, Intel's 10-core Xeon Westmere-EX has 2.6 billion transistors.

The GK110 features 15 SMX units with 192 CUDA cores per unit, which gives a grand total of 2,880 CUDA cores. Nvidia hasn't yet revealed full specifications on the Tesla K20 products yet, but indicated that not all boards will have all 15 SMX units running. Regardless, people can safely expect the use of around at least 2,496 CUDA cores from most Tesla K20 implementations.

The memory bus has been upgraded to 384-bit with six 64-bit controllers in parallel. As for memory capacity itself, Nvidia did not specify. When pushed for an answer, Huang said simply, "Not enough."

To clarify, he added, "As much fast memory as possible behind 384 bits," but no matter what, it will "likely not be enough, because the problems [the K20 is] trying to solve are so huge."

Unfortunately, the GK110 isn't quite finished yet, so we won't be seeing this one until Q4 2012. When it does become available the GK110 GPU is expected to be incorporated into the new Titan supercomputer at the Oak Ridge National Laboratory in Tennessee and the Blue Waters system at the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign.

For those who want a Kepler-based Tesla product today, Nvidia also announced was the GK104-based Tesla K10, which is available immediately. This accelerator board features two GK104 Kepler GPUs that deliver an aggregate performance of 4.58 teraflops of peak single-precision floating point and 320 GB per second memory bandwidth.

The Tesla K10 has already found use in the oil and gas industries, as well as signal and image processing.

 

Nvidia Tesla K10

"Fermi was a major step forward in computing," said Bill Dally, chief scientist and senior vice president of research at Nvidia. "It established GPU-accelerated computing in the top tier of high performance computing and attracted hundreds of thousands of developers to the GPU computing platform. Kepler will be equally disruptive, establishing GPUs broadly into technical computing, due to their ease of use, broad applicability and efficiency."

As Nvidia CEO Jen-Hsun Huang detailed at his keynote, the Kepler-based Tesla cards feature three new innovations that help add to the edge over Fermi. They are:

  • SMX Streaming Multiprocessor -- The basic building block of every GPU, the SMX streaming multiprocessor was redesigned from the ground up for high performance and energy efficiency. It delivers up to three times more performance per watt than the Fermi streaming multiprocessor, making it possible to build a supercomputer that delivers one petaflop of computing performance in just 10 server racks. SMX's energy efficiency was achieved by increasing its number of CUDA architecture cores by four times, while reducing the clock speed of each core, power-gating parts of the GPU when idle and maximizing the GPU area devoted to parallel-processing cores instead of control logic.
  • Dynamic Parallelism -- This capability enables GPU threads to dynamically spawn new threads, allowing the GPU to adapt dynamically to the data. It greatly simplifies parallel programming, enabling GPU acceleration of a broader set of popular algorithms, such as adaptive mesh refinement, fast multipole methods and multigrid methods.
  • Hyper-Q -- This enables multiple CPU cores to simultaneously use the CUDA architecture cores on a single Kepler GPU. This dramatically increases GPU utilization, slashing CPU idle times and advancing programmability. Hyper-Q is ideal for cluster applications that use MPI.
     

Read more at our liveblog of the Nvidia GTC keynote, and find out what applications Nvidia has planned for gaming in the cloud with GeForce Grid.

Read more from @MarcusYam on Twitter.

Marcus Yam
Marcus Yam served as Tom's Hardware News Director during 2008-2014. He entered tech media in the late 90s and fondly remembers the days when an overclocked Celeron 300A and Voodoo2 SLI comprised a gaming rig with the ultimate street cred.
  • fb39ca4
    If it will help oil companies, its a bad thing.
    Reply
  • fb39ca4
    BTW where is the fan for that thing? Is the reference board water cooled?
    Reply
  • bardacuda
    2880 cores? Booya! Last rumored specs I saw were 2304 cores. Hope to see the gaming version of this card...even if it's just to drool over and never buy.
    Reply
  • tipoo
    fb39ca4BTW where is the fan for that thing? Is the reference board water cooled?
    These things would be sitting in clusters with powerful push/pull fans in the casing. It's not meant to be a standalone card in a PC.
    Reply
  • tipoo
    tipooThese things would be sitting in clusters with powerful push/pull fans in the casing. It's not meant to be a standalone card in a PC.
    Here's how they look
    http://www-sop.inria.fr/dream/pmwiki/uploads/Public/Public/nvidia-tesla.jpg
    Reply
  • Combat Wombat
    fb39ca4BTW where is the fan for that thing? Is the reference board water cooled?
    Bill Dally is a pretty smart cookie... I think they will put fans on it heh heh heh.

    This will make a prime Xmas Gift... being released in Q4.. Anyone feel free to buy me one hahahah.
    Reply
  • tipoo
    Combat WombatBill Dally is a pretty smart cookie... I think they will put fans on it heh heh heh.This will make a prime Xmas Gift... being released in Q4.. Anyone feel free to buy me one hahahah.

    You guys are aware this $2000+ card would not play any of your games at all, right? Nor would it even work in a PC. I'd buy you one if you gave me 4000 dollars though :P
    Reply
  • upgrade_1977
    And yet still no gtx680 - 690 boards anywhere to be seen.
    Reply
  • Combat Wombat
    tipooYou guys are aware this $2000+ card would not play any of your games at all, right? Nor would it even work in a PC. I'd buy you one if you gave me 4000 dollars thoughIt's more the bragging rights more than anything.
    Reply
  • hajila
    But can it play Crysis?
    Reply