SPECworkstation 3 Test Notes
We ran this series of tests with the new patched v.3.0.4 version of the benchmark (marked as 'new' in the charts) and provide direct comparisons to the previous v.3.0.3 version ('old') that don't utilize all 128 threads in some subtests. "PBO" denotes an overclocked configuration. As mentioned on the previous page, the older version of the benchmark wouldn't run some subtests on the 3990X, or the results were too far outside of the expected range to be usable. Now we can run those tests, but we will only have one 3990X entry in those charts.
Some of these applications also make an appearance in our standard test suite, but those test configurations and benchmarks are focused on a typical desktop-class environment. In contrast, these tests are configured to stress the systems with workstation-class workloads.
With the exception of the W-3175X system, we loaded down our test platforms with 64GB of DDR4 memory spread across four modules to accommodate the expanded memory capacity required for several of these workstation-focused tasks. Due to the W-3175X's six-channel memory controller and our limited stock of high-capacity DIMMs, we used six 8GB DIMMs for a total capacity of 48GB. All systems were tested at the vendor-specified supported memory data transfer rates for their respective stock configurations, and DDR4-3600 for the overclocked settings. Test conditions mirror those explained in our Threadripper 3990X review, but we've also included a breakdown of the test systems at the end of the page.
The full suite consists of more than 30 applications split among seven categories, but we've winnowed down the list to tests that largely focus specifically on CPU performance. We haven't submitted these benchmarks to the SPEC organization, so these are not official benchmarks.
NAMD is a parallel molecular dynamics code designed to scale well with additional compute resources and is one of the premier benchmarks used to quantify performance with simulation code. We couldn't run this benchmark with the previous SPECworkstation version, but now it ticks right along as the 3900X's 128 threads tear through the benchmark.
The 3990X beats Intel's competing 28-core Xeon W-3175X by a massive margin, even after we overclocked the latter to the limits. Workload scalability is important here: The 3990X is more than twice as fast as the 32-core Threadripper 3970X in some tests, so the 3900X's additional cache may come into play.
Media and Entertainment
With the new version of the benchmark, the LuxRender CPU test scales much better than we see with the widely-used version of the benchmark we include in our normal test suite. It's important that potential customers can see the benefits of optimized code, so this massive improvement is welcome. We slid the test results from our standard suite into the second spot in the album, but be aware that is from the stand-alone LuxRender benchmark that isn't a part of the SPEC suite. With SEPCworkstation 3, the deltas between the 3990X and competing chips is now significantly widened, especially relative to the externally-available LuxRender benchmark utility, so hopefully LuxRender releases a new version of its stand-alone benchmark to unleash this type of performance.
We run the new Blender Benchmark beta in our regular suite of tests, but different types of render jobs can stress processors in unique ways. Here we can see a breakout of several industry-standard benchmark renders that largely favor the Zen 2 architecture. You'll notice that the Threadripper 3990X tends to be more competitive in the longer-duration render workloads, which falls right in line with the company's guidance that workload intensity has a big impact on performance. We verified that this portion of the benchmark suite runs across both processor groups with the previous-generation of the benchmark, so most of these results show little variability. We do see a slight regression with the new version in the 3BMWs workload.
We noticed that our standard HandBrake tests, which aren't part of the SPEC suite, didn't scale well at all with the 3990X, with the x265 test in particular showing subpar performance scaling considering the 3990X's massive compute resources.
However, with the optimized SPEC code, we can now see the massive performance improvements we expect. For reference, the first two charts in the album outline SPEC performance, while the second two charts come from our own external test tool. The SPEC results are much more representative of the real-world performance benefits of optimized code, particularly to highlight the extreme deltas relative to competing processors.
Product Development and Energy
The earth’s subsurface structure can be determined via seismic processing. One of the four basic steps in this process is the Kirchhoff Migration, which is used to generate an image based on the available data using mathematical operations.
This test kicked off odd results with the older version of the benchmark, but now we see the expected improvements, and they're impressive.
SRMP algorithms are used for discrete energy minimization. This test didn't run correctly, either, but now yields usable and realistic results. We see nearly linear performance scaling in this test, which is impressive.
Calculix is based on the finite element method for three-dimensional structural computations. The new SPEC update did generate a few results that we wouldn't expect given the 3990X's helping of compute resources. Calculix certainly falls into that bucket. This could be a oddity from our test environment, and we're working to diagnose the issue. That starts with retesting the comparison processors, and we'll update as necessary. In either case, we shouldn't expect these massive improvements in this program, and we'll follow up with SPEC if we hit a brick wall in our examination.
Financial and General Workloads
The financial services simulations are used to project risk and uncertainty in financial forecasting models and run across SIMD lanes, meaning the vectorized code should unlock the ultimate in performance from processors with the requisite compute elements. We found that these SPECWorkstation 3 tests continue to not scale across processor groups, but this could be due to the underlying code, as opposed to the benchmark itself.
The Python benchmark conducts a series of math operations, including numpy and scipy math libraries, with Python 3.6. This test also includes multithreaded matrix tests that would obviously benefit from more cores, provided the software can utilize the host processing resources correctly. Naturally, the multithreaded matrix workload favors Threadripper 3000, but we don't see the expected performance improvements. This test does span the two processor groups, but doesn't scale well. That's probably attributable to the underlying code instead of the SPEC benchmark.
The Intel processors dominate the numpy and scipy tests, but that also represents another challenge in the benchmark ecosystem for AMD: These tests are compiled in Intel's MKL library, but alternative libraries would improve AMD's standing.
SPECworkstation 3's Rodinia LifeSciences benchmark steps through four tests that include medical imaging, particle movements in a 3D space, a thermal simulation, and image-enhancing programs. These workloads don't execute across both processor groups with the old version of the benchmark, but now they do. That removes the 3970X's clock speed as its big advantage and allows the 3990X to take an expected lead in the Heartwall thermal simulation.
We also ran into unexpectedly good results in Srad and Lavamd, but a bit of regression with Hotspot. We're taking a deeper look at these results to troubleshoot.
We come away even more impressed with the Threadripper 3990X after this series of tests. Aside from the Threadripper 3990X's brutal performance in threaded workloads that can exploit its resources, the rapid industry adoption of the important tools and software to support the chip and expose its raw performance potential is incredibly important.
The updated benchmark isn't a panacea that can solve all issues in all benchmarks, as some of the underlying applications still aren't tuned for the unique Zen 2 architecture. However, these results underline our previous conclusions about the Threadripper 3990X, which we bestowed with an Editor's Choice award:
"The Threadripper 3990X is pretty much exactly what AMD says it is: A highly specialized processor that provides incredible performance in a narrow cross-section of workloads, but at an extremely attractive price point given its capabilities."
The SPEC organization also announced that it has begun developing a completely new SPEC CPU suite that will replace the existing SPEC CPU 2017 benchmark. After acceptance, the organization will award up to $9,000 and free benchmark software to individuals that submit new benchmarks that take advantage of multi-core processors and parallel, multi-threaded computing.
That marks another new wave of development of industry-standard benchmarks that will accurately measure the performance of core-heavy chips, but that submission period will last a year, meaning those new tests will be a bit further out on the horizon than the reworked SPECworkstation 3 benchmarks.
Kudos to the SPEC organization for moving quickly to assure industry-standard benchmarks that reflect the current state of the market. There might be a few teething pains in the process, but we'll take that over industry stagnation any day of the week. If the rest of the industry moves as fast as the SPEC organization, these types of core-heavy processors could become a more common reality for enthusiasts sooner rather than later.
|AMD Socket sTRX4 (TRX40)||Threadripper 3990X, 3970X, 3960X|
|MSI Creator TRX40|
|4x 8GB G.Skill FlareX DDR4-3200 - Stock: DDR4-3200, OC: DDR4-3600 16-18-18-36|
|Intel Socket 2066 (X299)||Core i9-10980XE|
|MSI Creator X299|
|4x 8GB G.Skill FlareX DDR4-3200 - Stock: DDR4-2933, OC: DDR4-3600 16-18-18-36|
|AMD Socket AM4 (X570)||AMD Ryzen 9 3950X|
|MSI MEG X570 Godlike|
|2x 8GB G.Skill FlareX DDR4-3200 - Stock: DDR4-3200, OC: DDR4-3600 14-14-14-34|
|Intel LGA 3647 (C621)||Intel Xeon W-3175X|
|ROG Dominus Extreme|
|6x 8GB Corsair Vengeance RGB DDR4-2666 - Stock: DDR4-2666, OC: DDR4-3600 14-14-14-34|
|AMD Socket SP3 (TR4)||Threadripper 2990WX, 2970WX, 2950X|
|MSI MEG X399 Creation|
|2x 8GB G.Skill FlareX DDR4-3200 - Stock: DDR4-2933, OC: DDR4-3466 14-14-14-34|
|All Systems||Nvidia GeForce RTX 2080 Ti|
|2TB Intel DC4510 SSD|
|EVGA Supernova 1600 T2, 1600W|
|Windows 10 Pro (1903 - All Updates)|
|Cooling||Corsair H115i, Enermax Liqtech 360 TR4 II, Custom Loop|
MORE: Best CPUs
MORE: All CPUs Content