Benchmark Suite
The mixture of synthetic and real-world tests we run are meant to give a comprehensive overview of a device’s performance. Synthetic tests—which are generally composed of several small, specialized blocks of code for performing operations such as cryptography, file compression, matrix operations, and alpha blending—are good at isolating the performance of different parts of a component’s architecture, including integer and floating point math, specialized instructions, pixel shaders, and rasterization. With this information, we can make comparisons between individual hardware components like SoCs (CPU A is faster than CPU B) or NAND flash. Because of their highly focused nature, however, it can be difficult to relate these results to the overall user experience, limiting us to generic statements about a device being faster because it has a faster CPU in certain benchmarks. Furthermore, synthetic tests are generally designed to push hardware to its limits—useful for determining maximum performance and for spotting weak points in a design—but do not represent real-world workloads.
For this reason, we also try to include benchmarks that test macro-level activities you do every day such as web browsing, composing text, editing photos, or watching a video. While these benchmarks are a better indicator of overall user experience, they are much more difficult to develop, leaving testers few options for mobile platforms.
To truly understand the performance of a device, we need to test it at the component level and at the system level, we need to know its maximum performance and its performance in real-world scenarios, and we also need to spot deficiencies (thermal throttling) and anomalies (unsupported features). No single benchmark can do all of these things. There’s not even a single benchmark that can adequately test any one of these things (creating a good benchmark is extremely difficult and there are always compromises). This is why we run a whole suite of benchmarks, many of which have overlapping functionality.
By now it should be apparent that the benchmarks we use are not randomly selected. In addition to fulfilling the requirements above, our benchmark suite comes from experienced developers who are willing to openly discuss how their benchmarks work. We work closely with most of these developers so that we may gain a better understanding of the tests themselves and to provide them with feedback for improving their tests. The table below lists the benchmarks we currently use to test mobile devices.
Google Android
Category | Benchmark | Version | Developer |
---|---|---|---|
CPU And System Performance | AndEBench Pro 2015 | 2.1.2472 | EEMBC |
Basemark OS II Full | 2.0 | Basemark Ltd | |
Geekbench 3 | 3.3.1 | Primate Labs | |
MobileXPRT 2013 | 1.0.92.1 | Principled Technologies | |
PCMark | 1.1 | Futuremark | |
TabletMark 2014 | 3.0.0.63 | BAPCo | |
Browsermark | 2.1 | Basemark Ltd | |
JSBench | 2013.1 | Purdue University | |
Google Octane | 2.0 | ||
Peacekeeper | - | Futuremark | |
GPU And Gaming Performance | 3DMark: Ice Storm Unlimited | 1.2 | Futuremark |
Basemark X | 1.1 | Basemark Ltd | |
GFXBench 3 Corporate | 3.0.28 | Kishonti | |
GFXBench 3.1 Corporate | 3.1.0 | Kishonti | |
Basemark ES 3.1 | 1.0.2 | Basemark Ltd | |
Battery Life And Thermal Throttling | Basemark OS II Full | 2.0 | Basemark Ltd |
GFXBench 3 Corporate | 3.0.28 | Kishonti | |
PCMark | 1.1 | Futuremark | |
TabletMark 2014 | 3.0.0.63 | BAPCo |
Apple iOS
Category | Benchmark | Version | Developer |
---|---|---|---|
CPU And System Performance | Basemark OS II Full | 2.0 | Basemark Ltd |
Geekbench 3 | 3.3.4 | Primate Labs | |
TabletMark 2014 | 3.0.0.63 | BAPCo | |
Browsermark | 2.1 | Basemark Ltd | |
JSBench | 2013.1 | Purdue University | |
Google Octane | 2.0 | ||
Peacekeeper | - | Futuremark | |
GPU And Gaming Performance | 3DMark: Ice Storm Unlimited | 1.2 | Futuremark |
Basemark X | 1.1 | Basemark Ltd | |
GFXBench 3 Corporate | 3.0.32 | Kishonti | |
GFXBench 3.1 Corporate | 3.1.0 | Kishonti | |
Basemark ES 3.1 | 1.0.2 | Basemark Ltd | |
Battery Life And Thermal Throttling | Basemark OS II Full | 2.0 | Basemark Ltd |
GFXBench 3 Corporate | 3.0.32 | Kishonti | |
TabletMark 2014 | 3.0.0.63 | BAPCo |
Microsoft Windows Phone
Category | Benchmark | Version | Developer |
---|---|---|---|
CPU And System Performance | Basemark OS II Full | 2.0 | Basemark Ltd |
Browsermark | 2.1 | Basemark Ltd | |
JSBench | 2013.1 | Purdue University | |
Google Octane | 2.0 | ||
Peacekeeper | - | Futuremark | |
GPU And Gaming Performance | Basemark X | 1.1 | Basemark Ltd |
GFXBench 3 DirectX | 3.0.4 | Kishonti | |
Battery Life And Thermal Throttling | Basemark OS II Full | 2.0 | Basemark Ltd |
GFXBench 3 DirectX | 3.0.4 | Kishonti |