AMD FirePro W8000 And W9000 Review: GCN Goes Pro

Harnessing The Potential Of GCN

From V- To W-Series: In A New League

The new FirePro W family centers on AMD’s Graphics Core Next (GCN) architecture, which is the design used in the company's Radeon HD 7xxx-series desktop boards. The FirePro W boards succeed the V-series, which employed an older Very Long Instruction Word (VLIW) architecture. VLIW enabled decent 3D performance, but it struggled in compute-heavy applications. GCN was designed to alleviate that issue, and we've already seen it do wonders for AMD's consumer offerings in that regard.

A Detailed Look at the Compute Unit

GCN's Compute Unit (CU) replaces the Single Instruction Multiple Data (SIMD) engine that AMD has used since the days of its Radeon HD 2000. A CU consists of four vector units (VUs), which, in turn, consist of 16 ALUs and a register. Each VU unit can operate independently and execute one quarter of a command set (wavefront) per clock cycle. A CU with four VUs can execute four wavefronts every four clock cycles (or one wavefront per clock cycle). The VUs can also be scalar programmed and operate in a vector mode.

Additionally, the CUs have a scalar unit that’s responsible for things like flow control operations, which could be handled by the VUs if the VUs weren't better suited to other tasks. Each CU also has four texture units connected to a 16 KB read/write cache. The L1 cache isn’t just twice the size as the VLIW4 architecture's, but can be written to in addition to just read from.

FirePro W9000 Hits 1 TFLOP Of Double-Precision Math

The Tahiti GPU in AMD's flagship FirePro W9000 features 32 CUs. Each sports 64 ALUs, totaling 2048 ALUs. A GPU clock of 975 MHz gives us up to 4 TFLOPs of 32-bit compute performance and 1 TFLOP of double-precision floating-point math. Naturally, that's a good marketing figure, so it's a fair bet that AMD decided to use a 975 MHz GPU clock rate on the W9000 for this specific reason. Its second-fastest W8000 employs a more conservative 900 MHz frequency.

At that high-end sped, the card's L1 cache serves up 2 TB/s of bandwidth. The GPU is also equipped with 768 KB of L2 cache.

Better Tessellation And Order-Independent Transparency (OIT)

As with the desktop-oriented Tahiti-based cards, both high-end FirePro cards are armed with a GPU that sports two geometry engines with better tessellation performance than their predecessors. The ninth-generation fixed-function tessellation engines are able to handle about 2 billion triangles/s. However, AMD promises between 1.7x and 4x better performance, depending on the number of tessellation divisions.

The hardware-accelerated OIT mode is supposed to result in better output quality, minimizing artifacts and transparency render errors. Applications do have to be written to take advantage of this feature, though.

PowerTune and ZeroCore

We first dove into PowerTune in Radeon HD 6970 And 6950 Review: Is Cayman A Gator Or A Crock?The feature monitors power consumption and lowers the GPU's clock frequency as needed to keep it from exceeding its thermal design power (TDP).

AMD's Tahiti GPU has an additional power-saving feature called ZeroCore, which is composed of several components. A deep sleep mode and the DRAM’s stutter mode both serve to lower power consumption. Meanwhile, the contents of the frame buffer can now be compressed.

ZeroCore kicks in when the card is idle or the system goes to sleep. On the Windows desktop, these high-end cards only draw 15 W. Once Windows sits idle for long enough and switches off the display signal, power use drops even more. The card is nearly shut off, dissipating so little heat that its fan is able to stop spinning.

CrossFire users should be particularly excited about ZeroCore, which is able to turn off the second, third, and fourth graphics cards when they aren’t needed, lowering power consumption and thermal output. Granted, multi-card configurations are rare indeed in workstations.

  • mayankleoboy1
    Typical of AMD : releasing cards without proper drivers.
    I bet most professionals wont touch these cards until atleast 3-4 driver revisions. These cards are newer, and perform worse than competitions older.
    Reply
  • mayankleoboy1
    1.How does the CPU performance affect the benchmarks ? IOW, are these softwares enough offloaded on to the GPU, that changing the CPU to a much better Intel Xeons wont affect the performance much ?

    2. Also, how do the consumer cards perform on these pro softwares ?
    Reply
  • rdc85
    They are new architecture, it's kinda expected result. I can see there a room for improvement, but without the application that can take advantage of it, then it will useless..

    in the end I'm glad to see that AMD graphic section is trying to make an effort, not like the their proc section..
    Reply
  • My impression is that on average, Nvidia higher quality. IMHO of course
    Reply
  • bystander
    mayankleoboy1Typical of AMD : releasing cards without proper drivers.I bet most professionals wont touch these cards until atleast 3-4 driver revisions. These cards are newer, and perform worse than competitions older.Did you not read all the benchmarks? In many of the benchmarks it beat out Nvidia's offering by a lot, some were even, some were worse. And they are cheaper than the those Nvidia cards it would seem by the price offering of 4.2k for the Quadro 6000 right on the last page, compared to 4k for the W9000 and 1.6k for the W8000.

    So depending on what you use it for, it may very well be a great choice.
    Reply
  • Please note that dozens of software companies (all the most prevalent in DCC and CAD) have thoroughly tested and certified the drivers for the W8000 and W9000 cards. This means that users of these applications should not be concerned about driver stability or user experience.

    Yes, this is a brand new architecture and yes, performance improvements will continue to be made with subsequent driver optimizations.
    Reply
  • ohim
    Even though no one will prolly ever play games on a workstation, this are the first cards to have equal or superior gaming performance over the consumer cards also. Wonder if taking a HD 7970 and possibly mooding the bios for a FirePro one how will it impact the workstation benchmarks.
    Reply
  • rmpumper
    The review needs at least one gaming GPU as comparison.
    Reply
  • I always wondered how well these cards would do with games, anyone an idea? :)
    Reply
  • mayankleoboy1
    ohimEven though no one will prolly ever play games on a workstation, this are the first cards to have equal or superior gaming performance over the consumer cards also. Wonder if taking a HD 7970 and possibly mooding the bios for a FirePro one how will it impact the workstation benchmarks.
    AFAIK, its not possible now to BIOS mod a regular 7970 into a W9000. AMD and Nvidia have become smarter.
    Reply