AI disruptor DeepSeek's next-gen model delayed by Nvidia GPU export restrictions to China — short supply of AI GPUs hinders development

Nvidia Hopper HGX H200
(Image credit: Nvidia)

DeepSeek attracted a lot of attention with its R1 AI model earlier this year, but it looks like development of the next-generation R2 model has stalled due to shortage of Nvidia's H20 processors in China, reports The Information. DeepSeek itself has not commented on when its R2 model is set to be available.

DeepSeek used a cluster consisting of 50,000 Hopper GPUs — including 30,000 H20s, 10,000 H800s, and 10,000 H100s — obtained by its investor High-Flyer Capital Management — to train its R1 model. It is unclear whether R2 has already been fully pre-trained. The Information reports citing two individuals familiar with the project that DeepSeek team has been working intensively on the model, but CEO Liang Wenfeng is not yet satisfied with its capabilities. Work continues internally to improve performance before the model is cleared for deployment.

TOPICS
Anton Shilov
Contributing Writer

Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.

  • ThisIsMe
    Thought they claimed they didn’t need NVidia hardware. Weird…
    Reply
  • rm12
    Well, hope they don't have incentives to develop/copy nvidia's hardware.
    So rare metals and magnets will become very rare outside of china?
    Reply
  • mj-88
    This is a loss for the regular joe.
    The US being petty and can't stand competition which is ironic because they were happy to share when they were in the lead.
    Reply