Startup claims its Zeus GPU is 10X faster than Nvidia's RTX 5090: Bolt's first GPU coming in 2026

Bolt Graphics
(Image credit: Bolt Graphics)

Bolt Graphics, a GPU startup from Sunnyvale, California, this week introduced its Zeus GPU platform that is designed for gaming, rendering, and supercomputer simulations. The company says that its Zeus GPU not only supports features like upgradeable memory and built-in Ethernet interfaces, but it can also beat Nvidia's GeForce RTX 5090 by around 10 times in path tracing workloads, according to slide published by ServeTheHome.

There is one catch: Zeus can only beat the RTX 5090 GPU in path tracing and FP64 compute workloads. It's not clear how well it will handle traditional rendering techniques, as that was less of a focus. In speaking with Bolt Graphics, the card does support rasterization, but there was less emphasis on that aspect of the GPU, and it may struggle to compete with the best graphics cards when it comes to gaming. And when it comes to data center options like Nvidia's Blackwell B200, it's an entirely different matter.

RISC-V multi-chiplet GPU

Unlike GPUs from AMD, Intel, and Nvidia that rely on proprietary instruction set architectures, Bolt's Zeus relies on the open-source RISC-V ISA, according to the published slides. The Zeus core relies on an open-source out-of-order general-purpose RVA23 scalar core mated with FP64 ALUs and the RVV 1.0 (RISC-V Vector Extension Version 1.0) that can handle 8-bit, 16-bit, 32-bit, and 64-bit data types as well as Bolt's additional proprietary extensions designed for acceleration of scientific workloads.

(Image credit: Bolt Graphics)

As Zeus is aimed at path tracing rendering technique as well as compute workloads, we weren't sure whether it has traditional fixed-function GPU hardware like texture units (TMUs) and raster operation units (ROPs). The company reached out to us and said that Zeus does have TMU and ROP engines, but it's not disclosing any of the specifications just yet and is currently working on further optimizations. Each Zeus GPU has one DisplayPort 2.1a and one HDMI 2.1b output.

Like many processors these days, Zeus relies on a multi-chiplet design. The entry-level Zeus 1c26-032 features a single processing unit that is equipped with 32GB of LPDDR5X memory with 273 GB/s of bandwidth, and can be mated with up to 128GB of DDR5 memory using two SO-DIMMs (at 80 GB/s). The Zeus GPU is also paired with an I/O chiplet that packs a QSFP-DD port for 400GbE/800GbE, two PCIe Gen5 x16 slots with CXL 3.0 on top (enabling efficient memory sharing across multiple cards), and a GbE port for BMC. The GPU chiplet connects to its I/O chiplet at 256 GB/s.

The more advanced Zeus 2c26-064/128 uses two Zeus processing units, an I/O chiplet, and supports either 64GB or 128GB of LPDDR5X memory. The most powerful variant — Zeus 4c26-256 — integrates four processing units, four I/O chiplets, 256GB LPDDR5X and up to 2TB of DDR5 memory. The quad-chiplet Zeus implementation is not a card but rather functions as a server.

Unlike high-end GPUs that prioritize bandwidth, Bolt is evidently focusing on greater memory size to handle larger datasets for rendering and simulations. Also, built-in 400GbE and 800GbE ports to enable faster data transfer across networked GPUs indicates the data center focus of Zeus.

Performance

Swipe to scroll horizontally
Row 0 - Cell 0

Bolt Zeus 1c26-032

Bolt Zeus 2c26-064

Bolt Zeus 2c26-128

Bolt Zeus 4c26-256

Nvidia GeForce RTX 5090

Board Power (W)

120

250

250

500

575

FP64 / FP32 / FP16 vector TFLOPS

5 / 10 / 20

10 / 20 / 40

10 / 20 / 40

20 / 40 / 80

1.6 / 105 / 105

INT16 / INT8 matrix TFLOPS

307.2 / 614.4

614.4 / 1,228.8

614.4 / 1,228.8

1,228.8 / 2,457.6

818.4 / 1,636.8

Path Tracing (gigarays)

77

154

154

307

32

On-chip cache (MB)

128

256

256

512

120

LPDDR5X memory

32 GB LPDDR5X

64 GB LPDDR5X

128 GB LPDDR5X

256 GB LPDDR5X

-

DDR5 memory

2x DDR5 SO-DIMMs

4x DDR5 SO-DIMMs

4x DDR5 SO-DIMMs

8x DDR5 DIMMs

-

Max memory and bandwidth

Up to 160 GB @ 363 GB/s

Up to 320 GB @ 725 GB/s

Up to 384 GB @ 725 GB/s

Up to 2,304 GB @ 1.45 TB/s

32 GB @ 1.8 TB/s

Video Encoding (AV1, H.264/265)

2x 8K60 AV1 streams

4x 8K60 AV1 streams

4x 8K60 AV1 streams

8x 8K60 AV1 streams

3x 8K60 AV1 streams

Display outputs

DisplayPort 2.1a, HDMI 2.1b

DisplayPort 2.1a, HDMI 2.1b

DisplayPort 2.1a, HDMI 2.1b

-

3x DisplayPort 2.1a, HDMI 2.1b

PCIe

2x PCIe 5.0 x16

2x PCIe 5.0 x16

2x PCIe 5.0 x16

8x PCIe 5.0 x16

PCIe 5.0 x16

GbE

400 GbE (QSFP-DD)

400 GbE (QSFP-DD)

400 GbE (QSFP-DD)

6x 800 GbE (OSFP)

-

High-quality rendering, real-time path tracing, and compute are key focus areas for Zeus. As a result, even the entry-level Zeus 1c26-32 offers significantly higher FP64 compute performance than Nvidia's GeForce RTX 5090 — up to 5 TFLOPS vs. 1.6 TFLOPS — and considerably higher path tracing performance: 77 Gigarays vs. 32 Gigarays. Zeus also features a larger on-chip cache than Nvidia's flagship — up to 128MB vs. 96MB — and lower power consumption of 120W vs. 575W, making it more efficient for simulations, path tracing, and offline rendering.

However, the RTX 5090 dominates in AI workloads with its 105 FP16 TFLOPS and 1,637 INT8 TFLOPS compared to the 10 FP16 TFLOPS and 614 INT8 TFLOPS offered by a single-chiplet Zeus. The 1c26-32 with with its 10 FP32 TFLOPS performance would have seemingly have no chance against the RTX 5090's 105 TFLOPS.

But we also don't know what sort of traditional graphics performance Zeus might offer. The stated matrix throughput looks relatively tame compared to modern consumer graphics cards, never mind the data center alternatives. A single Blackwell B200 with a 700W power limit offers up 60 TFLOPS of shader FP32 compute, 30 TFLOPS of FP64 dense matrix compute, and 1.8 PetaFLOPS of sparse TF32 compute.

Simply put, there's a lot we don't know yet, and Zeus is only running in simulation right now, with physical hardware slated to arrive later this year. How will it handle traditional rendering, along with ray tracing and AI? We'll need to wait and see, and drivers will be another potential factor — even a large company like Intel tool a long time to work out a lot of the driver kinks.

The quad-chiplet 4c26-256 — which still consumes less power than the GeForce RTX 5090, but is likely to be more expensive — on paper can beat Nvidia's gaming flagship in all workloads except FP32 and FP16, which emphasizes that traditional gaming was not the priority. The company indicates that the quad-chiplet variant is optimized for electromagnetic field modeling, photonics research, and FFT calculations. With its larger memory pools and reduced reliance on external storage, Zeus promises to improve speed of large-scale simulations. Assuming, of course, that its hybrid memory subsystem works efficiently across all workloads.

Software

Software support will be a critical factor in the success of Bolt's Zeus GPUs, as hardware capabilities alone are not enough to compete with established players like AMD and Nvidia. Unlike CUDA for Nvidia and ROCm for AMD, Bolt's Zeus lacks a mature, widely adopted software ecosystem. Since it is based on RISC-V, Zeus can potentially leverage existing open-source tools and libraries, but without strong developer support, adoption will be limited.

One thing that Bolt introduced is its Glowstick path tracing rendering engine, which could be a promising in-house solution for real-time rendering. Traditional rendering workflows require long processing times to visualize changes, whereas Zeus significantly reduces this delay, which makes it useful for professional visualization applications. Compared to existing solutions, Bolt claims up to 2.5x faster performance on single-chip variants, scaling higher with multiple GPUs.

However, it is unclear whether Zeus can support industry-standard frameworks such as OpenCL, Vulkan, and CUDA-translation layers, which are essential for gaining traction in HPC workloads. If Bolt Graphics provides strong developer tools, robust compiler support, and compatibility with Linux-based HPC environments, Zeus could establish itself as a competitor in scientific computing and rendering, but it faces an uphill battle against Nvidia's well-established ecosystem.

Availability

Bolt Graphics says that the first developer kits will be available in late 2025, with full production set for late 2026, which will give time for software developers to play with the hardware. Right now, Zeus is only running in simulation, so none of the performance claims have been verified with actual hardware.

The number of discrete GPU developers from the U.S. and Western Europe shrank to three companies in 2025 from around 10 in 2000. Nvidia dominates the market and is followed by AMD and Intel, which can compete in select segments only. No company in the recent years — at least outside of China — was bold enough to engage into competition against these three contenders, so the very emergence of Bolt Graphics seems like a breakthrough. However, the major focuses of Bolt's Zeus are high-quality rendering for movie and scientific industries as well as high-performance supercomputer simulations.

If Zeus delivers on its promises, it could establish itself as a serious alternative for scientific computing, path tracing, and offline rendering. But without strong software support, it risks struggling against dominant market leaders.

Anton Shilov
Contributing Writer

Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.

  • jp7189
    So strange to position this against a 5090 or any consumer GPU for that matter.
    Reply
  • bit_user
    The article said:
    As Zeus is aimed at path tracing rendering technique as well as compute workloads, it does not seem to have traditional fixed-function GPU hardware like texture units (TMUs) and raster operation units (ROPs), so it has to rely on compute shaders (or similar methods) for texture sampling and graphics outputs. This saves precious silicon real estate for compute elements. Nonetheless, each Zeus GPU has one DisplayPort 2.1a and one HDMI 2.1b output.
    Wow, I'm getting major deja vu from Xeon Phi, here. Larrabee actually did have TMUs, but not much else. Intel demo'd pure software ray tracing on it, IIRC. And someone eventually found a later model Xeon Phi that still had display interfaces on it, in a dumpster outside of Intel's labs.

    The article said:
    two PCIe Gen5 x16 slots with CXL 3.0 on top
    BTW, CXL 3.0 uses the same phy as PCIe 6.0, which makes it a little puzzling they kept PCIe 5.0 for the "slots". It's not like there are any PCIe 6.0 platforms yet, to my knowledge, but probably the next gen of server boards will be.

    Edit: In the doc I found (see post #23), there's no mention of CXL. So, I don't know where that idea came from. They specifically discuss using a 400 GbE switch for multi-GPU communication. The diagram of their next gen architecture shows the PCIe 5.0 being able to bifurcate into as many as 8 PCIe 5.0 x4 links (but that's still 32 lanes in total) and supporting 2x 800 Gb Ethernet links per I/O chiplet, though it's not clear if those capabilities can be used concurrently.

    The article said:
    The quad-chiplet Zeus implementation is not a card, but rather is a server.
    Unlike high-end GPUs that prioritize bandwidth, Bolt is evidently focusing on greater memory size to handle larger datasets for rendering or simulations.
    Yeah, let's please just call this what it is, which is a server CPU with a built-in display controller.

    The article said:
    even the entry-level Zeus 1c26-32, offers significantly higher FP64 compute performance than Nvidia's GeForce RTX 5090 — up to 5 TFLOPS vs. 1.6 TFLOPS
    That's a bad comparison, because client GPUs are intentionally designed to have minimal fp64 compute, since it's not generally useful for gaming or AI inference and would therefore be a waste of silicon. If you want to compare it to a client GPU, focus on the fp32 and AI inference horsepower, which shows this Zeus processor is truly no GPU.

    Also, their fp64 TFLOPS numbers don't compare well with proper datacenter GPUs. They claim up to 20 TFLOPS of fp64, which indeed is more than double of a 192-core Zen 5 EPYC Turin, by my math. However, Nvidia's B200 claims 67 TFLOPS and AMD's MI300X claims 81.7 (163.4 matrix).

    So, it's indeed powerful for a server CPU, but pretty weak compared with GPUs. Also, you cannot get around the memory bandwidth problem, when you start slinging such amounts of compute. Each fp64 is 8 bytes, which means that if your processor can only manage 1.45 TB/s, which works out to a mere 0.18T loads or stores of a fp64. It's funny they mention FFTs as an application, because it does seem like it'd be bandwidth-starved, there. Where this really becomes a problem is for AI, which is why the big training "GPUs" use HBM and have up to like 4x that bandwidth.

    The article said:
    ... and considerably higher path tracing performance: 77 Gigarays vs. 32 Gigarays.
    This part is probably the most intriguing. I'd like to see how real-world performance compares. I wonder if the Zeus processor is claiming theoretical peak, but once you hit it with a real dataset, it quickly becomes bandwidth-limited. Perhaps that's the reason Nvidia didn't put more RT cores in the RTX 5090.

    The article said:
    Unlike CUDA for Nvidia and ROCm for AMD, Bolt's Zeus lacks a mature, widely adopted software ecosystem. Since it is based on RISC-V, Zeus can potentially leverage existing open-source tools and libraries, but without strong developer support, adoption will be limited.
    I think their approach is basically that it's just like a server CPU, so you can use existing threading libraries and techniques. That's not ideal, from a scalability perspective, but it does open them up to the vast majority of HPC/scientific software out there.

    Edit: I'm now less sure about this, since they don't mention what OS or runtime the device is using. I had assumed it's just a massively-parallel RISC-V CPU, but it seems not. Check the doc, for details.

    The article said:
    Bolt Graphics says that the first developer kits will be available in late 2025, with full production set for late 2026, which will give time for software developers to play with the hardware.
    Ah, and here's the rub. What we usually see with upstart HPC makers is that, by the time they can get something to market, mainstream is already passing them by. This should be concurrent with Zen 6 and Diamond Rapids, which will have even more cores and even more memory & IO bandwidth.

    Well, good luck to them. It's cool to hear people still doing software ray tracing, in this day and age. I'd love to see that benchmarked.
    Reply
  • Gururu
    This could literally destroy nVidia GPU dominance, a true disrupter finally.
    Reply
  • Elusive Ruse
    I would love to have a new player in this field, but I’m not sure how much stake I can put into these claims. I certainly hope this startup is up to sth.
    Reply
  • bit_user
    Gururu said:
    This could literally destroy nVidia GPU dominance, a true disrupter finally.
    It shouldn't be compared to a client GPU, like the RTX 5090. That makes about as much sense as comparing EPYC or Xeon to one.

    The main difference between this and regular server CPUs is just that this thing has a built-in display controller. To me, that display engine seems like it's just there for the sake of integration, like putting a BMC right in your I/O die.

    Edit: okay, it's more than just a server CPU with an iGPU. Check post #23 for more detailed analysis.
    Reply
  • A Stoner
    Maybe it can come in at MSRP?
    Reply
  • usertests
    There is one major catch: Zeus can only beat the RTX 5090 GPU in path tracing and FP64 compute workloads because it does not support traditional rendering techniques.
    If it could be used alongside a traditional GPU, like that 5090 + 3050 combo used for PhysX, then it could at least be interesting as a curiosity for YouTube clicks.
    Reply
  • Notton
    What an interesting card design.

    Is that two PCIe connectors on it?
    Is that an SFP and RJ-45 connector on the back?
    The GPU has 4 memory chips nearby, but then an extra two SODIMMs further away.
    Reply
  • Heiro78
    Notton said:
    What an interesting card design.

    Is that two PCIe connectors on it?
    Is that an SFP and RJ-45 connector on the back?
    The GPU has 4 memory chips nearby, but then an extra two SODIMMs further away.
    It certainly looks like two PCIe connectors and a RJ-45.

    I wish for these claims to be even a 10th true. More competition would be good.
    Reply
  • Gururu
    Heiro78 said:
    It certainly looks like two PCIe connectors and a RJ-45.

    I wish for these claims to be even a 10th true. More competition would be good.
    It even has DDR5 slots! Pretty badass, the website is very promising, at least in certain specifications blows away current GPUs.
    Reply