AMD posts first Instinct MI300X MLPerf benchmark results — roughly in line with Nvidia H100 performance

AMD
(Image credit: AMD)

AMD has finally published its first official MLPerf results for the Instinct MI300X accelerator for AI and HPC. The MI300X processor apparently performs in line with Nvidia's previous-generation H100 GPU in the Llama 2 70B model for generative AI, but it falls well behind the revamped H200 version — never mind the upcoming Nvidia B200, which also got its first MLPerf results yesterday.

It's noteworthy that AMD only shared MI300X performance numbers in the MLPerf 4.1 generative AI benchmark on the Llama 2 70B model. Based on the data that AMD shared, a system with eight MI300X processors was only slightly slower (23,512 tokens/second offline) than a system with eight Nvidia H100 SXM3 processors (24,323 tokens/second offline), which can probably be called 'competitive' given how well is Nvidia's software stack is optimized for popular large language models like Llama 2 70B. The AMD MI300X system is also slightly faster than the Nvidia H100 machine in more or less real-world server benchmark: 21,028 tokens/second vs 20,605 tokens/second.

As with Nvidia's B200 results, we need to dig a little deeper to fully dissect these results.

Swipe to scroll horizontally
MLPerf 4.1 generative AI benchmark on Llama 2 70B model (tokens/second)
Row 0 - Cell 0 # of GPUsOfflineServerper GPU Offlineper GPU Server
AMD MI300X 192GB HBM313,0622,520--
AMD MI300X 192GB HBM3823,51421,0282,9392,629
Nvidia H100 80GB HBM3410,6999,5222,6752,381
Nvidia H100 80GB HBM3824,32320,6053,0402,576
Nvidia H200 141GB HBM3E832,12429,7394,0163,717
Nvidia B200 180GB HBM3E111,26410,755--
TOPICS
Anton Shilov
Contributing Writer

Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.

  • Bikki
    AMD clearly has a lot of work to be done for their software stacks if they want to compete. Good article.
    Reply
  • Pierce2623
    Bikki said:
    AMD clearly has a lot of work to be done for their software stacks if they want to compete. Good article.
    I mean, realistically, anybody that’s ordered AMD parts already knows they’re significantly behind in the software stack. I’d say the main thing people are looking at if they realistically considering AMD is that the Mi300x is going to be available at like 1/3 of the price of the H100 while including a CPU in the deal, too.
    Reply