Nvidia's Tegra 4 GPU: Doubling Down On Efficiency

Tegra 4 Features And Functionality

On paper, Tegra 4 is an OpenGL ES 2.0 architecture. It’s missing just a couple of capabilities that would have been required for ES 3.0 compatibility—mainly, FP32 precision (plus the formats that go along with it) and ETC2 RGB compression. Neither is significant, Nvidia argues. The precision would only be important for general-purpose computing, while Tegra 4 does support DXT compression to achieve many of the same things as ETC2.  

Enabling HDREnabling HDR

On the DirectX side, Tegra 4’s GPU supports the Direct3D 9_1 feature level. Higher feature levels want FP32 precision in the fragment shader as well, and Nvidia’s is limited to FP20. They also want R32F format support, along with a couple of other texture formats that Tegra 4 can’t do. The company says it’s in talks with Microsoft to expose the 9_3 surface formats though, which are supported in hardware and used for HDR, without the API’s precision requirement. Until something happens there, higher-level features like instancing can’t be used under Windows RT.

Even still, Tegra 4 includes a host of new features that Nvidia believes will improve the quality of optimized games. Take anti-aliasing support as an example. Previously, Tegra 3 offered coverage sampling anti-aliasing, which calculated coverage values within a pixel. Because the value of a coverage sample depends on the pixel’s composition, however, CSAA ranges from somewhat to not at all effective. Tegra 4 solves this with full 2x and 4x multi-sample anti-aliasing support, aided by the implementation of color and Z compression.

Tegra 4 also picks up a 24-bit Z- and 8-bit stencil buffer, compared to Tegra 3’s 16-bit Z-buffer. Array textures, cube maps, and texture border color likewise make Nvidia’s list of added features.

ShadowsShadows

The inclusion of depth textures and percentage closer filtering are for shadow maps, allowing high-quality shadows with filtering around the edges. Of course, it’s possible to achieve more realism with DirectX 11 and a TFLOP-class GPU. But this DirectX 9-era technique is still the way some console titles handle shadows. It’s not completely free in Tegra 4; it’s a hardware-based feature though, so you don’t have to burn shader cycles on the filtering.

Nvidia upgrades Tegra 4’s maximum texture resolution to 4K x 4K, up from Tegra 3’s 2K x 2K. In theory, that makes it possible for a game developer to use assets from a PC or console title, yielding nice high-res textures.

Beyond better-looking textures, Nvidia says there are a number of games in development with true HDR. Tegra 4 enables these with FP16 filter and blend support, MRTs, and the requisite surface formats. This was some of the most talked about functionality back in the GeForce 6800 days (remember the Far Cry 1.3 patch that added HDR support to an already gorgeous-looking game?). Now, eight years later, we’re getting a taste of it in the mobile space.

Create a new thread in the US Reviews comments forum about this subject
This thread is closed for comments
41 comments
    Your comment
    Top Comments
  • Memnarchon
    A Sunday article? :O
    23
  • blazorthon
    deedee2die4Nvidia staying on top, the best of the best!


    Uhh, no... T4 isn't supposed to be out for like six months, yet it's already not as fast as some of Qualcomm's latest. Nvidia is improving, but as usual, they're staying a little behind in technology.
    20
  • s3anister
    I'm always amazed with the progress made in strides in this ultra-competitive sector so it's nice to see nvidia finally hit 28mm with Tegra 4. I'm sure some of their performance gains can be attributed to this.
    18
  • Other Comments
  • s3anister
    I'm always amazed with the progress made in strides in this ultra-competitive sector so it's nice to see nvidia finally hit 28mm with Tegra 4. I'm sure some of their performance gains can be attributed to this.
    18
  • levin70
    Charlie at semiaccurate is correct. The Tegra 4 is DOA. Almost no one will be using it. Everyone else is already ahead of where the T4 is today, and it hasn't even launched. How many design wins were noted? 1?

    Yeah, says it all.
    -3
  • Memnarchon
    A Sunday article? :O
    23
  • deedee2die4
    Nvidia staying on top, the best of the best!
    -3
  • blazorthon
    deedee2die4Nvidia staying on top, the best of the best!


    Uhh, no... T4 isn't supposed to be out for like six months, yet it's already not as fast as some of Qualcomm's latest. Nvidia is improving, but as usual, they're staying a little behind in technology.
    20
  • aicom
    levin70Charlie at semiaccurate is correct. The Tegra 4 is DOA. Almost no one will be using it. Everyone else is already ahead of where the T4 is today, and it hasn't even launched. How many design wins were noted? 1?Yeah, says it all.

    Nobody is ahead of Tegra's four Cortex A15 cores. Krait is at less performance than A15 (until the refresh at least). Samsung's got Exynos 5 Octa, but that's not out yet either and T4 will probably still top it in the GPU performance department. Speaking of which, Tegra 4 has the most powerful GPU in floating-point of anyone (including the iPad 4) with 74.8 TFLOPS @ 672 MHz. It only takes a 825 MHz Cortex A15 to match a 1.6 GHz A9, and Tegra 4 is supposed to ship at 1.9 GHz. Unfortunately, TDP does go up in the process.

    You also have to look at where these parts are targeted. Krait is really gunning for phone design wins and they have many. It's a very power efficient chip that found its way into some very nice phones. Tegra 4 is not aimed at that market; Tegra 4i is. Tegra 4 will have a much higher TDP than 4i (and Krait) and will get substantially higher performance as a result.
    6
  • tjosborne
    Hey guys, I am considering getting a Asus transformer prime tablet with the tegra 3. Would it be best to wait till this processor ends up in a tablet to get one?
    0
  • Anonymous
    So at 1.3Gpix/s, Nvidia has just admitted to 10x overdraw...per second? So we're looking at 9~10 frames per second oh high res displays. Lag lives on.
    4
  • PreferLinux
    aicomNobody is ahead of Tegra's four Cortex A15 cores. Krait is at less performance than A15 (until the refresh at least). Samsung's got Exynos 5 Octa, but that's not out yet either and T4 will probably still top it in the GPU performance department. Speaking of which, Tegra 4 has the most powerful GPU in floating-point of anyone (including the iPad 4) with 74.8 TFLOPS @ 672 MHz. It only takes a 825 MHz Cortex A15 to match a 1.6 GHz A9, and Tegra 4 is supposed to ship at 1.9 GHz. Unfortunately, TDP does go up in the process.You also have to look at where these parts are targeted. Krait is really gunning for phone design wins and they have many. It's a very power efficient chip that found its way into some very nice phones. Tegra 4 is not aimed at that market; Tegra 4i is. Tegra 4 will have a much higher TDP than 4i (and Krait) and will get substantially higher performance as a result.

    You mean Gigaflops, not Teraflops.
    4
  • blazorthon
    aicomNobody is ahead of Tegra's four Cortex A15 cores. Krait is at less performance than A15 (until the refresh at least). Samsung's got Exynos 5 Octa, but that's not out yet either and T4 will probably still top it in the GPU performance department. Speaking of which, Tegra 4 has the most powerful GPU in floating-point of anyone (including the iPad 4) with 74.8 TFLOPS @ 672 MHz. It only takes a 825 MHz Cortex A15 to match a 1.6 GHz A9, and Tegra 4 is supposed to ship at 1.9 GHz. Unfortunately, TDP does go up in the process.You also have to look at where these parts are targeted. Krait is really gunning for phone design wins and they have many. It's a very power efficient chip that found its way into some very nice phones. Tegra 4 is not aimed at that market; Tegra 4i is. Tegra 4 will have a much higher TDP than 4i (and Krait) and will get substantially higher performance as a result.


    S4 Pro is a faster CPU IIRC. IDK about how the graphics compares and won't comment about it.

    Nvidia, like I said, is getting better, but they're still going to be a little behind. They're making up a lot of ground here, especially with how they're making Tegra 4 and Tegra 4i instead of a single SoC to take both places, but they seem like they'l still have a little room to make up, at least in CPU performance, to be the best. Like I said before (at least in other articles about it), they'll still be near the top either way.
    -1
  • blazorthon
    ankit0x1can this play crysis? ('console port' on arm)


    Just keep the texture settings low and the resolution at minimum and it might be almost playable if someone manages to get it to work on ARM ;)
    1
  • sanilmahambre
    Tegra 4 in Razer gaming tab is best example of NVIDIA's superiority.
    -8
  • mayankleoboy1
    blazorthonUhh, no... T4 isn't supposed to be out for like six months, yet it's already not as fast as some of Qualcomm's latest. Nvidia is improving, but as usual, they're staying a little behind in technology.


    +1
    In this hyper competitive market, every SoC company needs atleast one product that is best in one area over every other SoC. Middling in all the areas is what T3 did, which made it a failure.
    2
  • blazorthon
    mayankleoboy1+1In this hyper competitive market, every SoC company needs atleast one product that is best in one area over every other SoC. Middling in all the areas is what T3 did, which made it a failure.


    To be fair, although in many ways not an exceptional SoC, T3 was a very successful product, wasn't it? They managed to get Tegra 3 in one form or another around to a lot of well-selling devices.
    2
  • mayankleoboy1
    blazorthonTo be fair, although in many ways not an exceptional SoC, T3 was a very successful product, wasn't it? They managed to get Tegra 3 in one form or another around to a lot of well-selling devices.


    I would say that T3 came with a bang and lots of fanfair, but as soon as the Krait4 came, nobody wanted to be seen with a T3.
    After that T3 pretty quickly started appearing in the Tier2 phones. All T1 phones used either a KraitS4 or the Exynos4. I would even say that Ms's Surface was a semi-fail in performance because of the T3. If MS had went with a dual core KraitS4, i am pretty sure Surface would have received much better reviews.
    3
  • blazorthon
    mayankleoboy1I would say that T3 came with a bang and lots of fanfair, but as soon as the Krait4 came, nobody wanted to be seen with a T3.After that T3 pretty quickly started appearing in the Tier2 phones. All T1 phones used either a KraitS4 or the Exynos4. I would even say that Ms's Surface was a semi-fail in performance because of the T3. If MS had went with a dual core KraitS4, i am pretty sure Surface would have received much better reviews.


    +1 I can agree with that :)
    1
  • mayankleoboy1
    As Nvidia is not a market leader in SoC's, they cant do anything revolutionary in the T4 SoC . Like increasing the die area for adding more GPU muscle. Because they dont know if the OEM's will accept that or not. So whatever they do, it will always be conservative.
    Apple and Samsung have no such problem. THey can afford be revolutionary, without much consequences.
    -4
  • aicom
    Not sure where people are getting the idea that Krait is faster than A15. S4 Pro's max clock is 1.7 GHz, while Tegra 4's is 1.9 GHz, so A15 already wins the clock speed match. In addition, Krait is a touch slower than A15 (3.3 DMIPS/MHz vs 3.5 DMIPS/MHz).

    http://en.wikipedia.org/wiki/Krait_(CPU)
    http://en.wikipedia.org/wiki/Snapdragon_(system_on_chip)#Snapdragon_S4
    5
  • mayankleoboy1
    ^
    Krait S4 is a modified A9 architecture, that performs between A9 and A15 archs, but closer to A9.
    Apples Swift architecture is also a modified architecture that performs between A9 and A15, but much closer to the A15 arch.
    1
  • blazorthon
    Anonymous said:
    Not sure where people are getting the idea that Krait is faster than A15. S4 Pro's max clock is 1.7 GHz, while Tegra 4's is 1.9 GHz, so A15 already wins the clock speed match. In addition, Krait is a touch slower than A15 (3.3 DMIPS/MHz vs 3.5 DMIPS/MHz).

    http://en.wikipedia.org/wiki/Krait_(CPU)
    http://en.wikipedia.org/wiki/Snapdragon_(system_on_chip)#Snapdragon_S4


    Anonymous said:
    ^
    Krait S4 is a modified A9 architecture, that performs between A9 and A15 archs, but closer to A9.
    Apples Swift architecture is also a modified architecture that performs between A9 and A15, but much closer to the A15 arch.


    If we're going by the links in the above post, Krait is much closer to A15 than to A9. This is still ignoring the fact that those types of measurements are extremely general and oftentimes only loosely related to performance in the real world.

    My bad, it's the newer top Snapdragon 800 version that Tegra 4 doesn't beat in CPU performance, thanks for the clarification. Regardless, I seem to have given Nvidia too little credit with Tegra 4.
    1