Basemark's Cross-Platform GPU Test: iPhone 14 Pro Beats Steam Deck
Intel's Arc A380 barely keeps up with Apple's M1 Pro
Basemark introduced its new GPUScore Sacred Path cross-platform benchmark that lets users compare completely different GPU architectures running on different operating systems. Our GPU benchmarks hierarchy is limited to Windows 11 testing, but what if you want to know how the best graphics cards stack up against your smartphone? Now you can.
To give a few examples, Sacred Path shows how Apple's A16 GPU stacks up against Steam Deck's GPU and Samsung's Galaxy S22+ GPU as well as how performance of Nvidia's GeForce RTX 3090 relates to that of Intel's Arc A770 and Apple's M1 Max. Of course, with any benchmark that has to work across a wide spread of hardware capabilities, there are caveats. But let's see what the numbers say.
Cross-Platform, Cross-Device Benchmark
There are quite a few major hardware graphics platforms for PCs and mobile devices. These include AMD's integrated and discrete GPUs, Apple's PowerVR-derived integrated graphics processors, Arm's Mali graphics solutions, Imagination's PowerVR-based graphics solutions, Intel's standalone Arc Alchemist and integrated Xe-LP GPUs, Nvidia's discrete graphics processors, Qualcomm's Adreno graphics for mobile applications, Samsung RDNA 2-based built-in GPU cores, and a number of other platforms.
Formally, all of these GPUs are designed to enable high-performance gaming, though high-end mobile and high-end PC are obviously quite different targets. All the architectures have their own pros and cons, and in many cases they run under different operating systems using different APIs (application programming interfaces). That makes it very difficult to determine how one GPU stacks up against another, since it is almost impossible to do an apples-to-apples — or Apples-to-desktops — comparison.
Basemark's cross-platform Sacred Path is designed to clear away some of the confusion, as it performs exactly the same workload on all the various GPUs (assuming the test is run using the official native resolution of 2560x1440). To make the benchmark more relevant for different devices, it supports various performance optimization techniques, including variable rate shading, advanced custom shaders, Ground Truth Ambient Occlusion (GTAO), and temporal anti-aliasing.
Repeatable, but With Fluctuations
After playing with the Sacred Path benchmark for a couple of hours on several Apple devices, we found that while in general the program produces consistent and repeatable results that correlate with the GPU capabilities of the device, at times its results fluctuate significantly — we're talking ~50% or more here. Perhaps Apple's mobile platforms adapt to workloads to provide better performance, but since smartphones and tablets are not in our wheelhouse, we'll leave appropriate investigation to someone else.
Basemark's PowerBoard 4.0 database already has enough submissions at the official render resolution of 2560x2440 to allow comparisons among many different platforms. To provide a better perspective, we compared the highest results each graphics processor achieved and did not take the CPU or API into account.
Unfortunately, since these results can fluctuate quite a bit, we'll have to apply appropriate levels of salt to the PowerBoard 4.0 submissions. Also keep in mind that the results from smartphones, notebooks, and tablets may vary greatly due to different cooling and ambient temperatures — these are not all run under test lab conditions by any stretch of the imagination.
High-End Gaming GPUs
Basemark's Sacred Path benchmark doesn't look particularly amazing compared to modern games, but it still needs a lot of resources to run at decent speeds. Sacred Path's score represents the average FPS multiplied by 100 (e.g, 1840 points means 18.4 FPS). Fortunately, even existing GPUs from AMD and Nvidia can crack this benchmark pretty easily, delivering 200–280 FPS. That also means they're potentially not getting anywhere close to 100% GPU utilization, which again is the problem with trying to create a "universal" GPU benchmark.
GPU | Score | Device | API | OS |
GeForce RTX 3090 | 28,270 | Desktop, AMD Ryzen 9 5900X | DirectX 12 | Windows 11 Pro |
GeForce RTX 3080 Ti | 25,050 | Desktop, AMD Ryzen 7950X | Vulkan 1.3 | Windows 11 Pro |
Radeon RX 6900 XT | 24,244 | Desktop, Intel Core i5-12600KF | Vulkan 1.3 | Windows 10 Pro |
GeForce RTX 3080 10GB | 22,489 | Desktop, AMD Ryzen 9 5900X | Vulkan 1.3 | Windows 10 Pro |
Radeon RX 6800 XT | 20,569 | Desktop, AMD Ryzen 5800X | Vulkan 1.3 | Windows 10 Pro |
Judging by results demonstrated by high-end gaming graphics cards, we can conclude that Basemark's Sacred Path loves raw compute performance. The more FP32 teraflops you have, the better your results will be. AMD's Radeon RX 6900 XT is very close to the GeForce RTX 3080 Ti, but the GeForce RTX 3090 is unrivalled.
Sadly, there are no RTX 4090 submissions just yet. Jarred ran it but it failed to submit (perhaps the system currently has a whitelist of known GPUs and the 4090 isn't on it). Manual (custom) testing using the official settings gave a result of 48,709 over the ~4 second run, which at least seems to be in the right ballpark. At the same time, a test that completes in seconds means that the GPU isn't even remotely warmed up before the test is over. (Where's my salt shaker?)
Stay On the Cutting Edge: Get the Tom's Hardware Newsletter
Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.
Mid-Range and High-End Mobile Gaming GPUs
GPU | Score | Device | API | OS |
GeForce RTX 3070 Ti | 17980 | Desktop PC, Core i7-12700F | Vulkan 1.3 | Windows 11 Pro |
GeForce RTX 3070 | 15967 | Desktop, Core i7-11700K | Vulkan 1.3 | Windows 11 Pro |
Radeon RX 6700 XT | 15514 | Desktop, Ryzen 7 3700X | Vulkan 1.3 | Windows 10 Enterprise |
GeForce RTX 3060 Ti | 14010 | Desktop, Core i3-10105F | Vulkan 1.3 | Windows 11 Pro |
GeForce RTX 3080 Laptop | 12277 | Acer Nitro AN515-45, Ryzen 9 5900HX | Vulkan 1.3 | Windows 11 Home |
GeForce RTX 3070 Laptop | 12524 | ASUS ROG Strix G533QR, Ryzen 9 5900HX | DirectX 12 | Windows 11 Home |
Midrange and high-end mobile gaming graphics processors don't have as much oomph as high-end desktop cards. Nonetheless, they all — AMD's Navi 22, Intel's ACM-G10, and Nvidia's GA104 — do quite well. Intel's Arc A770 is 11% behind Nvidia's GeForce RTX 3060 Ti, as another point of reference.
Meanwhile, assuming that Apple's M1 Ultra scales its graphics performance really well (the M1 Ultra consists of two M1 Max SoCs connected together), we would expect its performance to be on par with that of Nvidia's GeForce RTX 3070, which is not bad for an integrated graphics processor.
Entry-Level GPUs
GPU | Score | Device | API | OS |
Apple M1 Max | 8939 | MacBook Pro 18.4 | Metal | MacOS 13.0 |
GeForce RTX 3050 | 8117 | Desktop PC, Ryzen 5 5600X | Vulkan 1.3 | Windows 11 Pro |
GeForce RTX 3050 Laptop | 6110 | MechRevo Z3 Air Series GM5TGEO, Intel Core i7-11800H | Vulkan 1.3 | Windows 10 Home |
Intel Arc A380 | 4801 | Desktop PC, Core i5-10400 | DirectX 12 | Windows 11 Pro |
Apple M1 Pro | 4698 | MacBook Pro 14 2021 | Metal | MacOS 12.6 |
Owners of entry-level GPUs have not submitted many results so far, or at least they're not showing up. Fundamentally, the more teraflops a GPU has, the better it performs in Sacred Path.
Apple's M1 Max (an iGPU) and Nvidia's GA106 in the form of desktop GeForce RTX 3050 lead the pack here. Meanwhile, Intel's standalone Arc A380 can barely keep up with Apple's M1 Pro integrated GPU. Even Nvidia's GeForce RTX 3050 for laptops is significantly faster than the Intel Arc A380.
Handhelds, Ultra-Portables, Integrated
GPU | Score | Device | API | OS |
Apple M2 | 3124 | MacBook Air 2022 | Metal | MacOS 13 |
Apple M1 | 2477 | Mac Mini 2020 | Metal | MacOS 12.5.1 |
Apple M1 | 2423 | iPad Pro 11 2021 | Metal | iPadOS 15.6.1 |
Apple A16 | 1837 | iPhone 14 Pro | Metal | iOS 16.0.3 |
AMD Van Gogh | 1828 | Steam Deck | Vulkan 1.2 | SteamOS |
AMD Ryzen 9 5900HX | 1744 | Acer Nitro AN515-45 | Vulkan 1.3 | Windows 11 |
Apple A15 | 1595 | iPhone 13 Pro Max | Metal | iOS 16.0.2 |
Qualcomm Adreno 730 (Snapdragon 8 Gen 1 — SM8450) | 1365 | Oppo Find X5 Pro | Vulkan 1.1 | Android 12 |
Samsung Xclipse 920 | 1265 | Galaxy S22+ | Vulkan 1.1 | Android 12 |
Arm Mali-G710 MC10 (MediaTek Dimensity 9000) | 899 | Xiaomi Redmi K50 Pro | Vulkan 1.1 | Android 12 |
When it comes to cross-platform benchmarking, handhelds, ultra-portables, and iGPUs are arguably the most interesting categories as they includes multiple hardware and software architectures as well as different form-factors.
Since Apple uses its M1 and M2 system-on-chips for its tablets, it's reasonable to include them into this comparison even though they can power full-fledged PCs like MacBook Air/MacBook Pro. After all, AMD's Ryzen SoCs with higher-end GPUs are also designed for this very purpose. Meanwhile, Apple's M1 and M2 iGPUs are faster than AMD's Cezanne iGPU by 40%–80%.
As for SoCs for handheld devices, Apple's A16 can beat even the RDNA 2-based AMD Van Gogh iGPU used in the Steam Deck game console, but the iPhone 14 Pro needs to run the test for a couple of times to hit something like 2200 points. Qualcomm's Adreno 730 used in the Snapdragon 8 Gen 1 (SM8450) is a good performer, but it's still behind Apple's previous-generation A15 SoC. Meanwhile, RDNA 2-based Samsung Xclipse 920 looks somewhat disappointing when compared to rivals.
Summary
Basemark's Sacred Path benchmark can indeed be used to test all kinds of graphics processors, from humble GPUs aimed at smartphones and all the way to monstrous graphics cards that can be used not just for games but also as space heaters or AI supercomputers.
The software produces consistent and repeatable results most of the time, and it heavily depends on the compute performance of modern GPUs. While that might be useful on some level as a performance metric, do keep in mind that a 1.15GB benchmark that can complete in seconds on the fastest GPUs will never adequately show the differences between GPUs that might have anywhere from 1GB to 24GB (or more) VRAM. In this test, raw teraflops wins the game, even though actual game titles also depend on graphics memory capacity, memory bandwidth, and other factors.
The benchmark also doesn't use ray tracing in a bid to work on all graphics processors from the past decade or more, so it cannot demonstrate all the benefits that leading-edge GPUs for PCs support. There are also some fluctuation-related hiccups with some platforms, but that's enough for this look at the vagaries of cross-platform "universal" GPU testing.
Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.
-
ezst036 I mean Apple hardware blows a Geforce 4090 out of the water, why would a lowly Steam Deck stand a chance? :)Reply -
PlaneInTheSky Steam should have gone with ARM.Reply
I realise that would cause compatibility issues and it would require getting developers on board to make x86-ARM translations....and it would require a JIT x86-ARM translation layer like Apple's Rosetta for games that aren't ported to ARM....
...BUT...it would have made the Steam Deck so much more energy efficient.
The reason I am not getting a Steam Deck is just the bulk, the power consumption, even the noise from my friend's Steam Deck next to me started to actually bother me after a while. -
gg83 I have a hard time believing the iPhone 14 gpu is better all around though. Is it just in specific benchmarks? And how do you compare scores from 2 different systems?Reply -
Bamda PlaneInTheSky said:Steam should have gone with ARM.
I realise that would cause compatibility issues and it would require getting developers on board to make x86-ARM translations....and it would require a JIT x86-ARM translation layer like Apple's Rosetta for games that aren't ported to ARM....
...BUT...it would have made the Steam Deck so much more energy efficient.
The reason I am not getting a Steam Deck is just the bulk, the power consumption, even the noise from my friend's Steam Deck next to me started to actually bother me after a while.
LOL!! -
Bamda
Take it with a grain of salt. Benchmarks can be made to favor one CPU/GPU. Just like games, some run faster on different hardware. So the only thing you can do is look at multiple benchmarks and since there are none then this is pointless to even consider these results.gg83 said:I have a hard time believing the iPhone 14 gpu is better all around though. Is it just in specific benchmarks? And how do you compare scores from 2 different systems? -
gg83
Exactly my thoughtBamda said:Take it with a grain of salt. Benchmarks can be made to favor one CPU/GPU. Just like games, some run faster on different hardware. So the only thing you can do is look at multiple benchmarks and since there are none then this is pointless to even consider these results. -
anticeon Isn't obvious ?Reply
ARM will win the race.
i bet Nvidia and AMD already developed ARM based GPU for the next decade