Startup claims its Zeus GPU is 10X faster than Nvidia's RTX 5090: Bolt's first GPU coming in 2026
280 RTX 5090 GPUs = 28 Zeus GPUs according to Bolt Graphics

Bolt Graphics, a GPU startup from Sunnyvale, California, this week introduced its Zeus GPU platform that is designed for gaming, rendering, and supercomputer simulations. The company says that its Zeus GPU not only supports features like upgradeable memory and built-in Ethernet interfaces, but it can also beat Nvidia's GeForce RTX 5090 by around 10 times in path tracing workloads, according to slide published by ServeTheHome.
There is one catch: Zeus can only beat the RTX 5090 GPU in path tracing and FP64 compute workloads. It's not clear how well it will handle traditional rendering techniques, as that was less of a focus. In speaking with Bolt Graphics, the card does support rasterization, but there was less emphasis on that aspect of the GPU, and it may struggle to compete with the best graphics cards when it comes to gaming. And when it comes to data center options like Nvidia's Blackwell B200, it's an entirely different matter.
RISC-V multi-chiplet GPU
Unlike GPUs from AMD, Intel, and Nvidia that rely on proprietary instruction set architectures, Bolt's Zeus relies on the open-source RISC-V ISA, according to the published slides. The Zeus core relies on an open-source out-of-order general-purpose RVA23 scalar core mated with FP64 ALUs and the RVV 1.0 (RISC-V Vector Extension Version 1.0) that can handle 8-bit, 16-bit, 32-bit, and 64-bit data types as well as Bolt's additional proprietary extensions designed for acceleration of scientific workloads.
As Zeus is aimed at path tracing rendering technique as well as compute workloads, we weren't sure whether it has traditional fixed-function GPU hardware like texture units (TMUs) and raster operation units (ROPs). The company reached out to us and said that Zeus does have TMU and ROP engines, but it's not disclosing any of the specifications just yet and is currently working on further optimizations. Each Zeus GPU has one DisplayPort 2.1a and one HDMI 2.1b output.
Like many processors these days, Zeus relies on a multi-chiplet design. The entry-level Zeus 1c26-032 features a single processing unit that is equipped with 32GB of LPDDR5X memory with 273 GB/s of bandwidth, and can be mated with up to 128GB of DDR5 memory using two SO-DIMMs (at 80 GB/s). The Zeus GPU is also paired with an I/O chiplet that packs a QSFP-DD port for 400GbE/800GbE, two PCIe Gen5 x16 slots with CXL 3.0 on top (enabling efficient memory sharing across multiple cards), and a GbE port for BMC. The GPU chiplet connects to its I/O chiplet at 256 GB/s.
The more advanced Zeus 2c26-064/128 uses two Zeus processing units, an I/O chiplet, and supports either 64GB or 128GB of LPDDR5X memory. The most powerful variant — Zeus 4c26-256 — integrates four processing units, four I/O chiplets, 256GB LPDDR5X and up to 2TB of DDR5 memory. The quad-chiplet Zeus implementation is not a card but rather functions as a server.
Unlike high-end GPUs that prioritize bandwidth, Bolt is evidently focusing on greater memory size to handle larger datasets for rendering and simulations. Also, built-in 400GbE and 800GbE ports to enable faster data transfer across networked GPUs indicates the data center focus of Zeus.
Performance
Row 0 - Cell 0 | Bolt Zeus 1c26-032 | Bolt Zeus 2c26-064 | Bolt Zeus 2c26-128 | Bolt Zeus 4c26-256 | Nvidia GeForce RTX 5090 |
Board Power (W) | 120 | 250 | 250 | 500 | 575 |
FP64 / FP32 / FP16 vector TFLOPS | 5 / 10 / 20 | 10 / 20 / 40 | 10 / 20 / 40 | 20 / 40 / 80 | 1.6 / 105 / 105 |
INT16 / INT8 matrix TFLOPS | 307.2 / 614.4 | 614.4 / 1,228.8 | 614.4 / 1,228.8 | 1,228.8 / 2,457.6 | 818.4 / 1,636.8 |
Path Tracing (gigarays) | 77 | 154 | 154 | 307 | 32 |
On-chip cache (MB) | 128 | 256 | 256 | 512 | 120 |
LPDDR5X memory | 32 GB LPDDR5X | 64 GB LPDDR5X | 128 GB LPDDR5X | 256 GB LPDDR5X | - |
DDR5 memory | 2x DDR5 SO-DIMMs | 4x DDR5 SO-DIMMs | 4x DDR5 SO-DIMMs | 8x DDR5 DIMMs | - |
Max memory and bandwidth | Up to 160 GB @ 363 GB/s | Up to 320 GB @ 725 GB/s | Up to 384 GB @ 725 GB/s | Up to 2,304 GB @ 1.45 TB/s | 32 GB @ 1.8 TB/s |
Video Encoding (AV1, H.264/265) | 2x 8K60 AV1 streams | 4x 8K60 AV1 streams | 4x 8K60 AV1 streams | 8x 8K60 AV1 streams | 3x 8K60 AV1 streams |
Display outputs | DisplayPort 2.1a, HDMI 2.1b | DisplayPort 2.1a, HDMI 2.1b | DisplayPort 2.1a, HDMI 2.1b | - | 3x DisplayPort 2.1a, HDMI 2.1b |
PCIe | 2x PCIe 5.0 x16 | 2x PCIe 5.0 x16 | 2x PCIe 5.0 x16 | 8x PCIe 5.0 x16 | PCIe 5.0 x16 |
GbE | 400 GbE (QSFP-DD) | 400 GbE (QSFP-DD) | 400 GbE (QSFP-DD) | 6x 800 GbE (OSFP) | - |
High-quality rendering, real-time path tracing, and compute are key focus areas for Zeus. As a result, even the entry-level Zeus 1c26-32 offers significantly higher FP64 compute performance than Nvidia's GeForce RTX 5090 — up to 5 TFLOPS vs. 1.6 TFLOPS — and considerably higher path tracing performance: 77 Gigarays vs. 32 Gigarays. Zeus also features a larger on-chip cache than Nvidia's flagship — up to 128MB vs. 96MB — and lower power consumption of 120W vs. 575W, making it more efficient for simulations, path tracing, and offline rendering.
Stay On the Cutting Edge: Get the Tom's Hardware Newsletter
Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.
However, the RTX 5090 dominates in AI workloads with its 105 FP16 TFLOPS and 1,637 INT8 TFLOPS compared to the 10 FP16 TFLOPS and 614 INT8 TFLOPS offered by a single-chiplet Zeus. The 1c26-32 with with its 10 FP32 TFLOPS performance would have seemingly have no chance against the RTX 5090's 105 TFLOPS.
But we also don't know what sort of traditional graphics performance Zeus might offer. The stated matrix throughput looks relatively tame compared to modern consumer graphics cards, never mind the data center alternatives. A single Blackwell B200 with a 700W power limit offers up 60 TFLOPS of shader FP32 compute, 30 TFLOPS of FP64 dense matrix compute, and 1.8 PetaFLOPS of sparse TF32 compute.
Simply put, there's a lot we don't know yet, and Zeus is only running in simulation right now, with physical hardware slated to arrive later this year. How will it handle traditional rendering, along with ray tracing and AI? We'll need to wait and see, and drivers will be another potential factor — even a large company like Intel tool a long time to work out a lot of the driver kinks.
The quad-chiplet 4c26-256 — which still consumes less power than the GeForce RTX 5090, but is likely to be more expensive — on paper can beat Nvidia's gaming flagship in all workloads except FP32 and FP16, which emphasizes that traditional gaming was not the priority. The company indicates that the quad-chiplet variant is optimized for electromagnetic field modeling, photonics research, and FFT calculations. With its larger memory pools and reduced reliance on external storage, Zeus promises to improve speed of large-scale simulations. Assuming, of course, that its hybrid memory subsystem works efficiently across all workloads.
Software
Software support will be a critical factor in the success of Bolt's Zeus GPUs, as hardware capabilities alone are not enough to compete with established players like AMD and Nvidia. Unlike CUDA for Nvidia and ROCm for AMD, Bolt's Zeus lacks a mature, widely adopted software ecosystem. Since it is based on RISC-V, Zeus can potentially leverage existing open-source tools and libraries, but without strong developer support, adoption will be limited.
One thing that Bolt introduced is its Glowstick path tracing rendering engine, which could be a promising in-house solution for real-time rendering. Traditional rendering workflows require long processing times to visualize changes, whereas Zeus significantly reduces this delay, which makes it useful for professional visualization applications. Compared to existing solutions, Bolt claims up to 2.5x faster performance on single-chip variants, scaling higher with multiple GPUs.
However, it is unclear whether Zeus can support industry-standard frameworks such as OpenCL, Vulkan, and CUDA-translation layers, which are essential for gaining traction in HPC workloads. If Bolt Graphics provides strong developer tools, robust compiler support, and compatibility with Linux-based HPC environments, Zeus could establish itself as a competitor in scientific computing and rendering, but it faces an uphill battle against Nvidia's well-established ecosystem.
Availability
Bolt Graphics says that the first developer kits will be available in late 2025, with full production set for late 2026, which will give time for software developers to play with the hardware. Right now, Zeus is only running in simulation, so none of the performance claims have been verified with actual hardware.
The number of discrete GPU developers from the U.S. and Western Europe shrank to three companies in 2025 from around 10 in 2000. Nvidia dominates the market and is followed by AMD and Intel, which can compete in select segments only. No company in the recent years — at least outside of China — was bold enough to engage into competition against these three contenders, so the very emergence of Bolt Graphics seems like a breakthrough. However, the major focuses of Bolt's Zeus are high-quality rendering for movie and scientific industries as well as high-performance supercomputer simulations.
If Zeus delivers on its promises, it could establish itself as a serious alternative for scientific computing, path tracing, and offline rendering. But without strong software support, it risks struggling against dominant market leaders.

Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.
-
bit_user
Wow, I'm getting major deja vu from Xeon Phi, here. Larrabee actually did have TMUs, but not much else. Intel demo'd pure software ray tracing on it, IIRC. And someone eventually found a later model Xeon Phi that still had display interfaces on it, in a dumpster outside of Intel's labs.The article said:As Zeus is aimed at path tracing rendering technique as well as compute workloads, it does not seem to have traditional fixed-function GPU hardware like texture units (TMUs) and raster operation units (ROPs), so it has to rely on compute shaders (or similar methods) for texture sampling and graphics outputs. This saves precious silicon real estate for compute elements. Nonetheless, each Zeus GPU has one DisplayPort 2.1a and one HDMI 2.1b output.
BTW, CXL 3.0 uses the same phy as PCIe 6.0, which makes it a little puzzling they kept PCIe 5.0 for the "slots". It's not like there are any PCIe 6.0 platforms yet, to my knowledge, but probably the next gen of server boards will be.The article said:two PCIe Gen5 x16 slots with CXL 3.0 on top
Edit: In the doc I found (see post #23), there's no mention of CXL. So, I don't know where that idea came from. They specifically discuss using a 400 GbE switch for multi-GPU communication. The diagram of their next gen architecture shows the PCIe 5.0 being able to bifurcate into as many as 8 PCIe 5.0 x4 links (but that's still 32 lanes in total) and supporting 2x 800 Gb Ethernet links per I/O chiplet, though it's not clear if those capabilities can be used concurrently.
Yeah, let's please just call this what it is, which is a server CPU with a built-in display controller.The article said:The quad-chiplet Zeus implementation is not a card, but rather is a server.
Unlike high-end GPUs that prioritize bandwidth, Bolt is evidently focusing on greater memory size to handle larger datasets for rendering or simulations.
That's a bad comparison, because client GPUs are intentionally designed to have minimal fp64 compute, since it's not generally useful for gaming or AI inference and would therefore be a waste of silicon. If you want to compare it to a client GPU, focus on the fp32 and AI inference horsepower, which shows this Zeus processor is truly no GPU.The article said:even the entry-level Zeus 1c26-32, offers significantly higher FP64 compute performance than Nvidia's GeForce RTX 5090 — up to 5 TFLOPS vs. 1.6 TFLOPS
Also, their fp64 TFLOPS numbers don't compare well with proper datacenter GPUs. They claim up to 20 TFLOPS of fp64, which indeed is more than double of a 192-core Zen 5 EPYC Turin, by my math. However, Nvidia's B200 claims 67 TFLOPS and AMD's MI300X claims 81.7 (163.4 matrix).
So, it's indeed powerful for a server CPU, but pretty weak compared with GPUs. Also, you cannot get around the memory bandwidth problem, when you start slinging such amounts of compute. Each fp64 is 8 bytes, which means that if your processor can only manage 1.45 TB/s, which works out to a mere 0.18T loads or stores of a fp64. It's funny they mention FFTs as an application, because it does seem like it'd be bandwidth-starved, there. Where this really becomes a problem is for AI, which is why the big training "GPUs" use HBM and have up to like 4x that bandwidth.
This part is probably the most intriguing. I'd like to see how real-world performance compares. I wonder if the Zeus processor is claiming theoretical peak, but once you hit it with a real dataset, it quickly becomes bandwidth-limited. Perhaps that's the reason Nvidia didn't put more RT cores in the RTX 5090.The article said:... and considerably higher path tracing performance: 77 Gigarays vs. 32 Gigarays.
I think their approach is basically that it's just like a server CPU, so you can use existing threading libraries and techniques. That's not ideal, from a scalability perspective, but it does open them up to the vast majority of HPC/scientific software out there.The article said:Unlike CUDA for Nvidia and ROCm for AMD, Bolt's Zeus lacks a mature, widely adopted software ecosystem. Since it is based on RISC-V, Zeus can potentially leverage existing open-source tools and libraries, but without strong developer support, adoption will be limited.
Edit: I'm now less sure about this, since they don't mention what OS or runtime the device is using. I had assumed it's just a massively-parallel RISC-V CPU, but it seems not. Check the doc, for details.
Ah, and here's the rub. What we usually see with upstart HPC makers is that, by the time they can get something to market, mainstream is already passing them by. This should be concurrent with Zen 6 and Diamond Rapids, which will have even more cores and even more memory & IO bandwidth.The article said:Bolt Graphics says that the first developer kits will be available in late 2025, with full production set for late 2026, which will give time for software developers to play with the hardware.
Well, good luck to them. It's cool to hear people still doing software ray tracing, in this day and age. I'd love to see that benchmarked. -
Elusive Ruse I would love to have a new player in this field, but I’m not sure how much stake I can put into these claims. I certainly hope this startup is up to sth.Reply -
bit_user
It shouldn't be compared to a client GPU, like the RTX 5090. That makes about as much sense as comparing EPYC or Xeon to one.Gururu said:This could literally destroy nVidia GPU dominance, a true disrupter finally.
The main difference between this and regular server CPUs is just that this thing has a built-in display controller. To me, that display engine seems like it's just there for the sake of integration, like putting a BMC right in your I/O die.
Edit: okay, it's more than just a server CPU with an iGPU. Check post #23 for more detailed analysis. -
usertests There is one major catch: Zeus can only beat the RTX 5090 GPU in path tracing and FP64 compute workloads because it does not support traditional rendering techniques.
If it could be used alongside a traditional GPU, like that 5090 + 3050 combo used for PhysX, then it could at least be interesting as a curiosity for YouTube clicks. -
Notton What an interesting card design.Reply
Is that two PCIe connectors on it?
Is that an SFP and RJ-45 connector on the back?
The GPU has 4 memory chips nearby, but then an extra two SODIMMs further away. -
Heiro78
It certainly looks like two PCIe connectors and a RJ-45.Notton said:What an interesting card design.
Is that two PCIe connectors on it?
Is that an SFP and RJ-45 connector on the back?
The GPU has 4 memory chips nearby, but then an extra two SODIMMs further away.
I wish for these claims to be even a 10th true. More competition would be good. -
Gururu
It even has DDR5 slots! Pretty badass, the website is very promising, at least in certain specifications blows away current GPUs.Heiro78 said:It certainly looks like two PCIe connectors and a RJ-45.
I wish for these claims to be even a 10th true. More competition would be good.