Elon Musk reveals photos of Dojo D1 Supercomputer cluster — roughly equivalent to 8,000 Nvidia H100 GPUs for AI training
The Dojo D1 uses a system-on-wafer design to deliver impressive processing power for AI video training.
Fresh off firing up the Memphis Supercluster, claimed to be "the most powerful AI cluster in the world," Elon Musk has now shared pictures of a supercomputer cluster that uses his own homegrown Dojo AI accelerators. He also announced on the Tesla earnings call that he would double down on Dojo development and deployment due to the high pricing of Nvidia's GPUs.
Aside from the opening of the xAI facility in Tennessee, which aims to have 100,000 Nvidia H100 GPUs on a single fabric, Musk said that he will have Dojo D1 up and running by the end of the year. As Musk said, it would have the processing power of 8,000 of Nvidia's H100 chips, which is “Not massive, but not trivial either.”
Dojo pics pic.twitter.com/Lu8YiZXo8cJuly 23, 2024
Musk first unveiled the Dojo D1 chip in 2021 with a performance target of 322 TeraFLOPs of power. Then, in August last year, Tesla was spotted hiring a Senior Engineering Program Manager for Data Centers, which is usually one of the first steps any organization would take when planning its own data centers. Tesla also doubled its orders for the Dojo D1 the following month, which shows its confidence in its performance.
By May 2024, it was reported that the Dojo processor was already in mass production. Now, it seems that the Dojo chips have already made their way to the States and into Elon’s hands, and yesterday he shared pictures of the Dojo Supercomputer at their home in the data center.
And Dojo 1 will have roughly 8k H100-equivalent of training online by end of year.Not massive, but not trivial either.July 23, 2024
The Dojo chips are system-on-wafer processors with a 5-by-5 array. This means its 25 ultra-high-performance dies are interconnected using TSMC’s integrated fan-out (InFO) technology) so they can act as a single processor and perform more efficiently than similar multi-processor machines.
TSMC manufactures Dojo chips for Tesla, and Musk will run them alongside his Nvidia-powered Memphis Supercluster. However, while the Tennessee facility is owned by xAI and is primarily used for training Grok, the Dojo chips are more tuned for AI machine learning and video training, especially as they will be used to train Tesla’s Full Self-Driving technology based on the video data gathered from Tesla cars.
When Musk combined all the chips he has on hand, he said that he’d have 90,000 Nvidia H100 chips, 40,000 Nvidia AI4, and the Dojo D1 wafers running by the end of 2024. This substantial computing power shows how much effort and resources the billionaire is pouring into artificial intelligence.
Stay On the Cutting Edge: Get the Tom's Hardware Newsletter
Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.
Jowi Morales is a tech enthusiast with years of experience working in the industry. He’s been writing with several tech publications since 2021, where he’s been interested in tech hardware and consumer electronics.
-
mygt1 Please correct the article, the last paragraph mentions " 40,000 Nvidia AI4", the AI4 chips (Formelly known as hardware 4) are TESLA chips not NVIDIA.Reply -
DopaTestone
What exactly are you asking? Is Tesla still developing their AI that they're... still developing? Yes, they're still developing it, thus still training it on video data.JRStern said:"learning" from "video data gathered from Tesla cars", is that still a thing?
Companies are starting to realize video is the only real solution to a universal self driving AI, not LIDAR, as Musk stated 5 years ago. XPeng is abandoning their LIDAR system, GM is haulting their Cruise LIDAR system, multiple other Chinese companies are following suit. -
Notton
I looked this up, and all I see are the opposite results. GM is expanding LIDAR usage, but is not solely relying on it. They are using LIDAR+Radar+cameras.DopaTestone said:GM is haulting their Cruise LIDAR system
https://www.theverge.com/2023/3/7/23627656/gm-ultra-cruise-sensor-radar-lidar-hands-freehttps://news.gm.com/newsroom.detail.html/Pages/news/us/en/2024/feb/0215-supercruise.html
This looks to be true, but is not the full story. They are using Camera + Radar, so it's not solely relying on cameras.DopaTestone said:XPeng is abandoning their LIDAR system
It makes sense to keep the radar. It's a mature, reliable, and low complexity device for collusion avoidance. -
ThePizza Slight correction to this article. In the last paragraph it said they're Nvidia AI4. The HW4 chip that is now renamed to AI4 is a Tesla computer, not Nvidia.Reply