AMD Advancing AI Event Live Blog: Instinct MI300 Launch, Ryzen 8000 "Hawk Point" Expected

Refresh

2023-12-06T16:35:27.580Z

2023-12-06T17:59:59.643Z

AMD has begun displaying its cautionary statements on the screen, so the show is about to start.

2023-12-06T18:02:50.599Z

AMD CEO Lisa Su has come out on the stage. She opened the presentation reminiscing on the launch of ChatGPT just one year ago, and the explosive impact it has had on the world.

AMD Advancing AI event — (Image credit: AMD)

2023-12-06T18:04:37.678Z

Generative AI will require significant investments to meet the needs for training and inference workloads. One year ago, AMD predicted a $150 billion TAM for AI workloads by 2027. Now AMD has revised that estimate up to $400 billion in 2027.

2023-12-06T18:06:11.580Z

AMD is currently focusing on tearing down the barriers to AI adoption and cooperating with its partners to develop new solutions.

2023-12-06T18:07:56.095Z

Lisa Su said that the availability of GPU hardware is the biggest barrier, and now the company is helping address that with the launch of its Instinct MI300 accelerators. The new CDNA 3 architecture delivers huge performance gains in multiple facets.

2023-12-06T18:09:31.823Z

The MI300 has 150 billion transistors. 128-channels of HBM3, fourth-gen Infinity Fabric, and eight CDNA 3 GPU chiplets.

2023-12-06T18:09:44.092Z

The Instinct MI300 is a game-changing design - the data center APU blends a total of 13 chiplets, many of them 3D-stacked, to create a chip with twenty-four Zen 4 CPU cores fused with a CDNA 3 graphics engine and 8 stacks of HBM3. Overall, the chip weighs in with 146 billion transistors, making it the largest chip AMD has pressed into production.

2023-12-06T18:10:49.570Z

AMD claims up to a 1.3X more performance than Nvidia's H100 GPUs in certain workloads. The slide above outlines the claimed performance advantages.

2023-12-06T18:11:12.038Z

2023-12-06T18:12:29.505Z

Scalability is incredibly important -- performance needs to increase linearly as more GPUs are employed. Here AMD shows they match Nvidia's eight-GPU H100 HGX system with an eight-GPU AMD platform.

2023-12-06T18:14:05.889Z

The MI300 delivers performance parity in training with Nvidia, but exhibits the strongest advantages in inference. AMD highlights a 1.6X advantage in inferencing.

2023-12-06T18:15:36.797Z

Microsoft CTO Kevin Scott has come to the stage to talk with Lisa Su about the challenges of building out AI infrastructure.

2023-12-06T18:16:41.080Z

While they discuss the details, here are some details about MI300.

2023-12-06T18:17:17.561Z

Microsoft will have MI300X coud instances available in preview today.

2023-12-06T18:19:30.617Z

Lisa Su displayed the AMD Instinct MI300X platform.

2023-12-06T18:20:14.460Z

2023-12-06T18:25:14.348Z

AMD CTO Victor Peng has come to stage to talk about the latest advances in ROCM, AMD's open source competitor to Nvidia's CUDA.

2023-12-06T18:26:02.486Z

2023-12-06T18:26:49.594Z

Peng talked about the advantages of the open ROCm ecosystem, as opposed to Nvidia's proprietary approach.

2023-12-06T18:27:56.519Z

AMD's next-gen ROCm 6 is launching later this month. Support for Radeon GPUs continues, but it also has new optimizations for MI300.

2023-12-06T18:29:19.621Z

ROCm provides up to a 2.6X improvement in vLLM, among other optimizations that total an 8X improvement on MI300X compared to ROCm 5 on MI250X (this isn't a great comparison).

2023-12-06T18:31:05.087Z

AMD continues to work with industry stalwarts like Hugging Face and PyTorch to expand the open source ecosystem.

2023-12-06T18:31:47.781Z

AMD GPUs, including the MI300, will be supported in the standard Triton distribution starting with version 3.0.

2023-12-06T18:34:02.781Z

Peng is now talking with leaders from Databricks, essential AI, and Lamini.

2023-12-06T18:43:30.129Z

The talk has turned to different forms of AI, and possible evolutionary updates in the future.

2023-12-06T18:45:51.125Z

Here are some of the specifications of AMD's new Instinct MI300X platform. The system consists of eight MI300X accelerators in one system. It supports 400 GbE networking and has a monstrous 1.5TB of total HBM3 capacity.

2023-12-06T18:48:53.588Z

62,000 AI models run on the Instinct lineup today, and many more will run on the MI300X. Peng says the arrival of ROCm 6 heralds the inflection point for the broader adoption of AMD's software.

2023-12-06T18:50:40.713Z

Lisa Su has returned to stage, inviting Ajit Mathews, the Senior Director of Engineering at Meta, to the stage.

2023-12-06T18:51:55.008Z

Meta feels that an open source approach to AI is the best path forward for the industry.

2023-12-06T18:54:14.182Z

Meta has been benchmarking ROCm and working to build its support in PyTorch for several years. Meta will deploy Instinct MI300X GPUs in its data centers.

2023-12-06T18:57:16.549Z

AMD is working to bring integrated AI solutions to market for enterprises, a lucrative portion of the market.

Arthur Lewis, the President of Dell's Core Business Operations, Global Infrastructure Solutions Group, to talk about the company's partnership with AMD.

2023-12-06T18:59:06.847Z

Dell has added AMD's MI300X to its portfolio, offering Poweredge servers with eight of the GPUs inside.

2023-12-06T19:03:54.841Z

Supermicro founder and CEO Charles Liang has come to the stage to talk about how the company is embracing the generative AI wave with new systems.

2023-12-06T19:06:04.929Z

Supermicro has MI300X systems in both air and watercooled versions, thus allowing customers to build rack scale solutions.

2023-12-06T19:11:20.322Z

Kirk Skaugen, the EVP and President of the Lenovo Infrastructure Solutions Group, has come to the stage. Lenovo is focusing heavily on developing new AI ThinkEdge systems.

2023-12-06T19:12:13.420Z

Lenovo has added the MI300X to the ThinkSystem platform.

2023-12-06T19:13:20.006Z

AMD has engaged with an incredible number of OEM and ODM system vendors, and is now working with new cloud service providers, too.

2023-12-06T19:14:47.969Z

Forrest Norrod, AMD's EVP and GM of the data center group, has come to the stage.

AI performance needs are driving the growth of clusters, thus requiring high-performance networking.

2023-12-06T19:15:35.734Z

AMD uses its Infinity Fabric technology to provide near linear performance scaling, while Nvidia uses its NVLink.

2023-12-06T19:16:36.118Z

AMD is now opening up its Infinity Fabric technology to outside firms, a huge announcement that will expand the number of companies that use its networking protocol. Meanwhile, Nvidia's CUDA remains proprietary.

2023-12-06T19:18:36.841Z

AMD thinks Ethernet is a better solution than Fibre Channel for data center networking. Ethernet does have a host of advantages, including scalability and an open design. AMD is part of the new Ultra Ethernet standard to further performance for AI and HPC workloads.

2023-12-06T19:21:53.583Z

Norrod invited representatives from Arista, Broadcom, and Cisco to stage to talk about the importance of continued adoption of the Ethernet standard for data centers.

If you're wondering why this is important -- Nvidia has acquired Mellanox and uses its Fibre Channel networking gear heavily in its systems. Notably, Nvidia is not a member of the Ultra Ethernet consortium.

2023-12-06T19:22:16.756Z

2023-12-06T19:29:32.659Z

Here comes another hardware announcement! Norrod is talking about AMD's traditional approach to CPUs.

2023-12-06T19:31:19.252Z

AMD has announced the first data center CPU, the MI300A, has entered into volume production. This is the chip that powers El Capitan. The MI300A uses the same fundamental design and methodology as the MI300X but substitutes in three 5nm core compute die (CCD) with eight Zen 4 CPU cores apiece, the same as found on the EPYC and Ryzen processors, thus displacing two of the XCD GPU chiplets.

2023-12-06T19:32:40.003Z

Here are the mind-bending stats behind the MI300A.

2023-12-06T19:34:08.741Z

AMD claims that MI300A provides up to 4X the performance in the OpenFOAM motorbike test, but this comparison isn’t ideal: The H100 is a GPU, while the blended CPU and GPU compute in the MI300A provides an inherent advantage in this memory-intensive workload through its shared memory addressing space. Comparisons to the Nvidia Grace Hopper GH200 Superchip, which also brings a CPU and GPU together in a tightly coupled implementation, would be better here, but AMD says that it couldn’t find any publicly listed OpenFOAM results for Nvidia’s chip.

2023-12-06T19:35:21.291Z

AMD says MI300A is twice as power efficient as the Nvidia Grace Hopper Superchip.

2023-12-06T19:35:42.089Z

2023-12-06T19:37:10.654Z

Here's a nice shot of the MI300A.

2023-12-06T19:38:40.831Z

AMD broke the exascale barrier with the Frontier supercomputer, which uses its MI250X accelerators. Now the MI300A will be deployed into El Capitan, which is expected to pass two exaflops of performance.

2023-12-06T19:39:55.242Z

HPE and AMD have developed the supercomputers for the Department of Energy.

2023-12-06T19:44:12.369Z

The MI300A will be available soon from partners around the world.

2023-12-06T19:45:45.996Z

Lisa Su has come back to the stage and has moved into talking about consumer AI enablement. AMD integrated the XDNA architecture into its Ryzen 7040 chips, bring the first dedicated on-die AI processor to market in PCs.

2023-12-06T19:46:44.954Z

AMD has worked diligently to enable the AI-accelerated software ecosystem in Windows.

2023-12-06T19:47:35.430Z

AMD has released its Ryzen AI 1.0 software today. This software will allow its customers to easily deploy AI models on NPU-equipped laptops.

2023-12-06T19:48:30.410Z

Lisa Su announced the launch of the Ryzen 8040 series, codenamed Hawk Point. These chips are shipping to partners now. AMD claims up to 60% more performance in AI workloads.

2023-12-06T19:53:17.576Z

AMD is working with Microsoft to broaden the AI ecosystem with AI processing power.

2023-12-06T19:54:27.462Z

Lisa teased the next-gen "Strix Point" processors that will arrive next year. AMD also provided performance claims for XDNA 1, saying the NPU alone delivers 10 TOPS (teraops INT8) of performance in the Phoenix 7040 series, and that increases to 16 TOPS in the Hawk Point 8040 series

2023-12-06T19:57:01.249Z

Here's the wrap-up slide. MI300X and MI300A are already shipping and are in production with a wide range of OEM partners. ROCm 6 is coming, and AMD launched the Ryzen 8040 series.

And with that, Lisa closed the show.

Live

AMD Advancing AI Event Live Blog: Instinct MI300 Launch, Ryzen 8000 "Hawk Point" Expected

AMD takes the Nvidia bull by the horns.

LIVE: Latest Updates