Sign in with
Sign up | Sign in

The East-Side GPU

The AMD A8-3500M APU Review: Llano Is Unleashed
By

Llano’s CPU side borrows heavily from existing processor technology, so it shouldn’t be a surprise that the GPU portion of the die is also quite similar to Radeon graphics cards on the market today. The Sumo core is essentially an updated version of the Redwood GPU found in the Radeon HD 5500 and 5600 cards.

As you can see, there aren’t many differences between them until you look outside the hub and render back-ends. Llano’s GPU accesses memory through the integrated northbridge, but it still has a 128-bit interface that delivers bandwidth that’d be comparable to a discrete card with DDR3 memory. Outside the hub, the Fusion APU has two display controllers and UVD3 capabilities, where Redwood has four display controllers and UVD2.

When it comes to raw specifications, both of these GPUs are virtually identical: based on AMD's VLIW5 architecture, just like the rest of the Radeon HD 5000 series, each thread processor (previously referred to as a Stream processor) contains four Radeon cores plus one special function ALU, a branch unit, and special purpose registers. Sumo has five SIMD engines, each containing 16 thread processors and four texture units. Calculate it out and you have 400 Radeon cores and 20 texture units in total, with two render back-ends, each capable of four color ROPs (adding up to eight at the end of the day). These are the same specifications as the Radeon HD 5570 and 5670 cards.

Why not use hardware from the Radeon HD 6000 generation? According to AMD, the complexities of aligning work schedules between the graphics team and Llano team are responsible. The Radeon HD 5000 series is quite similar to the 6000 series anyway, so there’s really little negative impact of which to speak, especially considering that the UVD engine does get updated in Llano.

The A8-series APUs take advantage of the full 400-shader GPU, while the A6 has one of its SIMD engines disabled, yielding 320 Radeon cores and 16 texture units (similar to the Radeon HD 5550). The A4 has yet another SIMD engine stripped from its belly, resulting in a total of 240 Radeon cores and eight texture units. One of the render back-ends is also shut down, limiting this model to four ROPs. On a side note, as far as I can remember there was only ever one other Radeon card with 240 cores: the Radeon HD 2900 GT, which itself was a crippled version of the 320-core Radeon HD 2900 XT.

We’re not going to rehash the technical nuances of the VLIW5 architecture—we’ve been there and done that in the Radeon HD 5870 launch. What we are going to do is take a closer look at what Llano’s GPU does differently. And there are some significant differences.

For instance, the UVD engine is updated to version three with power gating capabilities from the Radeon HD 6000 series. This means that MPEG-4 Part 2 (which includes DivX and Xvid), MPEG-2, and the Multi-View Codec (MVC) that Blu-ray 3D uses receive decode acceleration. Yes, Llano is capable of 3D playback over HDMI. In addition, the power gating capabilities make it possible to play back media using the fixed-function UVD3 block instead of the GPU’s shaders, saving a great deal of power in the process. AMD claims that Llano has the ability to play back two Blu-ray disks on one battery charge as a result of this optimization.

The memory interface and host interface required (and received) radical changes, as the APU communicates to memory through the integrated northbridge. The GPU can now write directly to the same cache that the CPU traditionally had exclusive access to. Having said that, the GPU portion of the die has priority access to memory through a true dual-channel 128-bit interface, which is the same width as the Radeon HD 5570 and 5670. The bandwidth is limited by system memory, which is significantly slower than GDDR5. Note that Llano’s GPU memory interface is twice as wide as the 64-bit interface used on the lower-power E- and C-series Fusion processors.

The Fusion APU also boasts a unique ability that dedicated graphics cards can not possess: direct access to unified memory shared between the CPU and GPU, something that makes Zero Copy and Pin-in-Place possible. To understand the advantage, consider how a discrete graphics card works today; texture maps are created in system memory and then transferred to virtual memory in Windows. When the system needs to bind the texture, it first makes sure it’s in virtual memory, then the OS copies it to DRAM, and the DMA of the PCIe bus transfers it to the graphics memory for access. Simply put, there’s a lot of copying going on that can cause significant latency.

But an APU doesn’t need to copy memory contents because the GPU and APU blocks share access to the same memory. Zero Copy can access virtual memory directly. Just update the page tables and point to it; no copying is necessary. Application memory can be pinned in place without copying it through the operating system staging buffers. When very large data sets are involved, the APU can even outrun a dedicated GPU (Ed.: I covered this optimization, which AMD was calling Fast Copy previously, in ASRock's E350M1: AMD's Brazos Platform Hits The Desktop First. Brazos is also able to share that memory space, which was previously separate, and enjoy a latency reduction).

That’s a best-case scenario. And on the whole, Llano is about 5-7% slower than a dedicated card because of the extra latencies involved. CPUs and GPUs aren’t all that compatible when you get right down to it. The GPU has to give the CPU low-latency access to memory, reorganize its memory accesses, and deal with extra latency because of it. A lot of work went into memory handling, and while some efficiency is lost, the final performance is very close to a discrete part with the same specifications.

Aside from these differences, the GPU block is identical to any other Radeon HD 5000 card. It features the TeraScale 2 unified processing architecture, full DirectX 11 support (something AMD repeatedly pointed out that Sandy Bridge doesn’t offer), OpenGL 4.1, MSAA, SSAA, and MLAA anti-aliasing, angle-independent anisotropic filtering, and OpenCL 1.1 support. While it certainly isn’t as powerful as a high-end discrete card, the point to take away is that this isn’t a crippled or cut-down GPU. It’s capable of the exact same features as any other Radeon.

Dual Graphics

The A-series APUs have a unique capability that, at least in theory, complements the integrated GPU nicely. They’re able to work cooperatively with separate discrete graphics for a net performance boost. Even more surprising is Llano’s ability to cooperate with GPUs that are faster or slower than its own integrated engine. Dual Graphics does not require identical GPUs to work properly, nor does it cripple the faster GPU to the specifications of the lowest common denominator, as we’ve seen from CrossFire. It actually load balances the available graphics hardware for more performance. For instance, if the discrete GPU is twice as fast as the on-die graphics, the driver takes one frame from the APU for every two frames it takes from the dedicated card.

This asymmetrical CrossFire implementation sounds like a fantastic idea, but there are serious limitations. First, it doesn’t work at all unless it’s driving a DirectX 10 or 11 application. And if you run a DirectX 9 or earlier game engine, it actually degrades performance to the slower of the two graphics options installed.

Update: According to AMD, actual production models should revert to the faster of the two graphics options installed when running a game engine using DirectX versions lower than 10. The company claims that the early test hardware we were given suffered from this issue, so we'll have to wait for actual production units to verify. OpenGL performance is not supported by Dual Graphics and is always delivered by the GPU handling the primary display outputs.

Even when it does work, the feature is somewhat inconsistent, and we definitely noticed stuttering, despite benchmark results claiming faster raw frame rates. Finally, Dual Graphics won’t work unless the performance ratio is at least as close as two-to-one—for instance, if a graphics card is three times as fast as Llano’s GPU, Dual Graphics won’t work. We’ll cover the performance ramifications shortly.

Another limitation with the Sabine notebook platform is that OEMs will have to decide between Dual Graphics or Eyefinity support. Since the A-series notebook uses both dedicated display controllers for the APU and Dual Graphics configurations, if you want to use three displays in Eyefinity, you’d have to use the discrete card’s controllers. In other words, no Dual Graphics would be possible. The lack of Eyefinity support is probably unimportant in the laptop space, though.

Having pointed out these unfortunate side-effects, we think there’s a lot of potential here. Assuming that AMD puts more resources into driver development and fixes the issues we’ve encountered, Dual Graphics could be a serious consideration for the consumer. Having a Fusion–based system might mean that you could spend $50 for a graphics card and end up with the same performance as an $80 model. It becomes even more attractive in the notebook space, as the graphics subsystem can revert to the power-saving APU when battery life is a consideration, then switch to both GPUs when an outlet is available for much better performance.

Even if you match it up to a discrete card too powerful to allow for Dual Graphics operation, the APU is able to execute OpenCL calls while the graphics card handles 3D rendering. This is a forward-looking scenario, mind you, but if game developers embrace this application interaction for tasks like physics calculation, it introduces some interesting possibilities.


Ask a Category Expert

Create a new thread in the Reviews comments forum about this subject

Example: Notebook, Android, SSD hard drive

Display all 134 comments.
This thread is closed for comments
Top Comments
  • 24 Hide
    stingstang , June 14, 2011 4:23 AM
    Good job, AMD. You finally made a better cpu/gpu combo than intel in terms of graphics power.
    ....big win there...
  • 23 Hide
    gmarsack , June 14, 2011 5:09 AM
    This looks like a fantastic solution for notebooks. Can't wait to finally see more of these systems in the wild. :)  Good job AMD. I would think this will help boost the company along until Bulldozer arrives. Way to survive! :) 
  • 23 Hide
    billj214 , June 14, 2011 4:35 AM
    This APU being somewhat low power and good graphics almost deserves to be in a tablet PC since CPU processing is not critical in tablet PC's and graphics is something that can help with media and games.

    Ditto on the "Good Job AMD" definitely on the right track.
Other Comments
  • 3 Hide
    vz7 , June 14, 2011 4:11 AM
    Do you know when the desktop review for llano will be out?
  • 24 Hide
    stingstang , June 14, 2011 4:23 AM
    Good job, AMD. You finally made a better cpu/gpu combo than intel in terms of graphics power.
    ....big win there...
  • 23 Hide
    billj214 , June 14, 2011 4:35 AM
    This APU being somewhat low power and good graphics almost deserves to be in a tablet PC since CPU processing is not critical in tablet PC's and graphics is something that can help with media and games.

    Ditto on the "Good Job AMD" definitely on the right track.
  • 18 Hide
    cangelini , June 14, 2011 4:38 AM
    vz7Do you know when the desktop review for llano will be out?


    The NDA is up on the 30th.
  • -4 Hide
    niceview , June 14, 2011 4:43 AM
    two things:

    1) What happened to the Game Charts results for the Radeon HD 5570, when the games were benchmarked? I thought you made a point to say you were going to compare the APU's 6620G with a discrete card (that has the same number of SPs and same clock). So much for that, unless you thought only comparing the two with a synthetic test was enough. Oh well. Tom's can be such a tease!

    2) I'm just a little disappointed that the APU's graphics power was not able to double Intel's.... Under the best of circumstances, AMD's latest integrated graphics came close to being twice as fast, but i guess that is ok since we are not playing horseshoes. I just thought it would be nice if it had made a nice even doubling, or more. Now, i'm worried IVY BRIDGE will beat it....
  • 8 Hide
    niceview , June 14, 2011 4:56 AM
    sorry, i guess that should be:

    we ARE playing horseshoes...

    and i have to give credit where credit is due: props to AMD for almost doubling Intel's HD Graphics in the integrated space....
  • 22 Hide
    sinfulpotato , June 14, 2011 5:02 AM
    What I find most exciting is the battery life saved. THIS is what will make this chip a winner. When if Sandy bridge is faster your average consumer won't be able to notice... PERIOD. However battery life... is a HUGE win.

    I don't play favorites, AMD needs market share.... FOR OUR SAKE. If Intel and AMD where on the same terms we would see faster progression and SAVE MONEY.
  • 23 Hide
    gmarsack , June 14, 2011 5:09 AM
    This looks like a fantastic solution for notebooks. Can't wait to finally see more of these systems in the wild. :)  Good job AMD. I would think this will help boost the company along until Bulldozer arrives. Way to survive! :) 
  • 11 Hide
    ikyung , June 14, 2011 5:14 AM
    fstrthnuAMD is kind of in a fix here, the more enthusiast gamers won't even bother looking at the Llano computers while this is kind of overkill for casual gamers. MAYBE money-pressed college students or something, but most people will just skip this and either buy a regular gaming computer or build their own using one of the guides from this very site! Going for good graphics in cheap desktops is kind of a futile exercise, the people who will care will just get the more expensive stuff anyways. Notebooks are more understandable, but the prices on the decent gaming desktops are just too good for Llano to be very competitive (and also, the CPU portion will be a letdown for the average person. Noticeably slower than the comparable Intel Core i5.)

    Well, Llano's market is the mobile space. Not desktop. Yes, they are bringing out desktop Llanos, but just like the article said, Trinity is what the enthusists are waiting for. I honestly don't think Llano's aim in the desktop market is for hardcores. I could see Llano's popularity in HTPC, and casual gaming/workstations though. Lower power, GPU over CPU tradeoff, etc. IF AMD releases the right drivers for the APU+Discrete CPU to work together, I see the market being even bigger.
  • 17 Hide
    striker410 , June 14, 2011 5:23 AM
    I suppose we need to stop suggesting i3-2100 and H67 over in the forums then? Go AMD!
  • -4 Hide
    Anonymous , June 14, 2011 5:37 AM
    How much for the A8 3500M? and it is a 1.5 GHZ chip with turbo to 2.4. I think its price point is more consistent with the I3 2310M @ 2.1 GHZ. Would have like to see you compare it to that instead of a I5 @ 2.5 GHZ
  • 6 Hide
    cleeve , June 14, 2011 5:49 AM
    SlaughteremHow much for the A8 3500M? and it is a 1.5 GHZ chip with turbo to 2.4. I think its price point is more consistent with the I3 2310M @ 2.1 GHZ. Would have like to see you compare it to that instead of a I5 @ 2.5 GHZ


    AMD didn't supply pricing for the processors, they only supplied pricing for the laptops. Accorsing to the price they gave us, the i5-2520M is fair competition.

    Having said that, does it matter? No matter how you slice it, Llano's GPU will beat Intel HD graphics and Llano's CPU will be beaten by Intel. You'll see different degrees of advantage but this point won't change, not until Trinity at least.
  • 12 Hide
    cleeve , June 14, 2011 5:52 AM
    just another user...modern games that aren't worth playing and CADs that can make use of a GPU...

    ...There is no single reason to prefer Llano over Sandy bridge, and I really cannot understand your excitement about it.


    You just listed two. :) 

    Obviously everyone will have their own priorities, but I think it's safe to say that you will find the majority of people will be more concerned with graphics performance than the ability to encode or render media. That's really what it comes down to; you're not going to notice a difference while surfing the net.
  • 6 Hide
    sparkle_ftw , June 14, 2011 5:57 AM
    How AMD will still be important to gamers: Use Llano profits from mainstream sales to continue funding and improving production of AMD's discrete gpus. Those kick some serious butt.
  • 18 Hide
    AppleBlowsDonkeyBalls , June 14, 2011 6:01 AM
    Decent review, though a bit unfair. What's the point of comparing the performance of a CPU that will be in $600 laptops (A8-3500M) to one that will be in $700 laptops (i5 2520M)? The A8-3500M competes with the Core i3 2310M, while the A8-3830MX competes with the Core i5 2520M. Given that, CPU performance for Llano is better than what is painted in the review.

    With Sandy Bridge, I think you're giving up a lot of GPU performance for some additional CPU performance. I think the choice is clear for most users.
Display more comments