AMD AI Chip Unveiling - Assembly - Salesforce Research

AMD Data Center and AI Technology Premiere Live Blog (Starts at 10am PT/17:00 UTC)

anandtech.com - None - Read On Original Website

12:55PM EDT - AMD this morning is hosting their first data center and server-focused event in quite some time. Dubbed the "AMD Data Center and AI Technology Premiere," we're expecting a sizable number of announcements from AMD around their server and AI product portfolio

12:55PM EDT - We're also expecting some additional details on AMD's remaining server CPU products for the year. This includes the density-focused Bergamo CPU, which will offer up to 128 CPU cores based on AMD's Zen 4c architecture. Genoa-X, which is AMD's V-cache equipped version of the EPYC 9004 series, offering up to 1.1GB of L3 cache per chip. And Siena, a 64 core low-cost EPYC chip

12:55PM EDT - And not to be left out, AMD also has their FPGA and networking divisions, courtesy of recent acquisition like Xilinx and Pensando. Those teams have also been hard at work at their own products, which are due for announcements as well

12:56PM EDT - This is AMD's first live event focused on the data center market in a while. The low frequency of these events means that AMD's most recent slate of announcements were during the tail-end of the pandemic, before live events had resumed

12:57PM EDT - So while not AMD's first live event overall, it's certainly their most important data center event in quite some time

12:59PM EDT - I am here in person in cloudy San Francisco, where AMD is holding their event. Backing me up, as always, is Gavin Bonshor, who is in decidedly warmer England

12:59PM EDT - AMD has asked everyone to silence their devices; the show is about to begin

01:00PM EDT - One thing to note that, for as important as this show is for AMD's customers, there's also a distinct element of pleasing AMD's shareholders

01:00PM EDT - Which, to be sure, as a company AMD is always looking to do that. But with the explosion in demand for AI products, there's a lot of pressure on AMD to make sure they're going to capture a piece of that pie

01:01PM EDT - "We have a lot of products and exciting news to share with you today"

01:01PM EDT - So with no further ado, we're getting started

More Context

keyboard_arrow_down keyboard_arrow_right Who are the potential competitors for AMD's chip?

finance.yahoo.com Nvidia

anandtech.com EPYC and Intel

crn.com Amazon Web Services, Microsoft Azure, Google Cloud and others

benzinga.com NVIDIA Corp. NVDA, Meta Platforms Inc. META, and Tesla Inc. TSLA

wccftech.com NVIDIA

seekingalpha.com Nvidia Corporation

servethehome.com Cerebras

fool.com Nvidia's H100

01:01PM EDT - "AMD technology is truly everywhere"

01:02PM EDT - EPYC processors, Instinct accelerators, and AMD's product ecosystem

01:02PM EDT - "Today we lead the industry" with their EPYC processors

01:02PM EDT - Lisa is going to show how AMD brings their current products together, and how they'll be expanding their portfolio

01:03PM EDT - EPYC adoption is also growing in the Enterprise market

01:04PM EDT - AMD is still ramping their 4th generation "Genoa" EPYC processors

01:04PM EDT - AMD thinks Genoa is still by far the highest performance and most efficient processor in the industry

More Context

keyboard_arrow_down keyboard_arrow_right How does the chip compare in terms of power and efficiency with its competitors?

crn.com industry leading performance

anandtech.com None

01:04PM EDT - Performance comparisons between EPYC and Intel's 4th gen Xeon platform (Sapphire Rapids)

01:05PM EDT - "We want leadership performance. But we must have best-in-class energy efficiency"

01:06PM EDT - The vast majority of AI workloads today are still being run on CPUs

01:06PM EDT - So AMD sees themselves as having a big stake - and big advantage - in that market

01:06PM EDT - Expect no shortage of guests today. Starting with AWS VP Dave Brown

01:08PM EDT - AWS has introed over 100 different AMD-based instances at this point

01:09PM EDT - AWS has a broad range of customers, who have benefitted from the cost savings of using AMD instances

01:09PM EDT - Brown is talking about the various things AWS's customers have been up to - and how much money they've saved

01:11PM EDT - AWS is building new EC2 instances using EPYC 9004 processors and AWS's Nitro system

01:12PM EDT - Up to 50% more perf than M6a instances

01:13PM EDT - AMD is using AWS today for their data analytics workloads

01:13PM EDT - AMD will be expanding their use of AWS to use the service for more technical workloads like EDA

01:14PM EDT - "We're really pleased with the response we're getting on Genoa"

01:14PM EDT - Oracle is also announcing new Genoa instances that will be available in July

01:14PM EDT - "Genoa is ramping nicely"

01:15PM EDT - More customres coming online in the coming weeks

01:15PM EDT - Now talking about the breadth of AMD's data center product stack

01:16PM EDT - Cloud computing clients have different needs than AMD's standard EPYC customers

01:16PM EDT - AMD's density-optimized CPU design for higher core counts

01:16PM EDT - 128 cores per socket "for leadership performance and energy efficiency in the cloud"

01:17PM EDT - 8 CCDs, each with 16 Zen 4c cores

01:17PM EDT - Zen 4c core is 2.48mm2 on TSMC 5nm, versus 3.84mm2 for Zen 4

01:18PM EDT - AMD starts from the same RTL as Zen 4, and then optimize the physical implementation for reduced area

01:18PM EDT - The only real difference between Genoa and Bergamo is the CCDs

01:19PM EDT - Genoa and Bergamo use the same SP5 socket, and can be swapped

01:19PM EDT - Now for performance comparison benchmarks versus Intel's 4th gen Xeon

01:20PM EDT - Bergamo is shipping in volume now to AMD's hyperscale customers

01:20PM EDT - And now for another guest: Meta VP Infrastructure, Alexis Bjorlin

01:21PM EDT - Meta and AMD have been collabing on EPYC server design since 2019

01:22PM EDT - Meta is a big supporter and provider for the Open Compute Project (OCP)

01:22PM EDT - So Meta's server designs are in significant use in the world

01:23PM EDT - AMD has proven to be able to meet their commitments to Meta

01:24PM EDT - Some of the insights from Meta have helped to shape Bergamo

01:24PM EDT - Meta will be deploying Bergamo for their next-gen high density server platform

01:25PM EDT - AMD is looking forward to the coming years with their Meta partnership. And that's Meta.

01:26PM EDT - Dan McNamara is now taking the stage. SVP and GM of AMD's server business unit

01:27PM EDT - He's starting with a look at how AMD has optimized its designed for the "technical computing" market

More Context

keyboard_arrow_down keyboard_arrow_right What are the businesses that AMD is targeting with the new chip?

crn.com AI, Cloud

anandtech.com AI/HPC

techradar.com data center & AI

finance.yahoo.com the latest stock market news and

videocardz.com cloud native and technical computing

latestly.com data centre

wepc.com high-performance computing (HPC) and AI workloads

wccftech.com CPU / GPU workloads

servethehome.com PyTorch, Hugging Face

neowin.net and for cloud data centers

cnbc.com developers and server makers

benzinga.com health care to 5G networks and data centers

zdnet.com artificial intelligence computing

beststocks.com high-performance computing, graphics, and visualization technologies

channelnewsasia.com cloud computing providers and other large chip buyers

seekingalpha.com supercomputers and traditional high-performance computing

prnewswire.com growing cloud native environments

01:27PM EDT - Over 1GB of L3 cache on a 96 core EPYC CPU

More Context

keyboard_arrow_down keyboard_arrow_right What are the technical specifications of the chip?

anandtech.com from 16 cores to 96 cores

neowin.net 128-core

wccftech.com 6 XCDs (Up To 228 CUs), 3 CCDs (Up To 24 Zen 4 Cores), 8 HBM3 Stacks

servethehome.com 128GB of HBM3, 5.2TB/s of memory bandwidth 896GB/s of Infinity Fabric bandwidth

crn.com higher core design and greater energy efficiency

zdnet.com multiple GPU "chiplets" plus 192 gigabytes of HBM3 DRAM memory, and 5.2 terabytes per second of memory bandwidth

finance.yahoo.com it can use up to 192GB of memory

videocardz.com up to 128 cores, deliver up to 3.7x throughput performance for key cloud native workloads compared to Ampere1

prnewswire.com unmatched core count in 1U and 2U density

01:28PM EDT - 4 new SKUs, from 16 cores to 96 cores

01:28PM EDT - Genoa-X is aimed at technical computing. Workloads that can benefit from substantially larger L3 cache sizes

01:29PM EDT - Now for some performance slides with some EDA workloads

01:29PM EDT - Platforms featuring Genoa-X will be available next quarter

01:30PM EDT - Another guest: Microsot's GM for Azure (apologies, didn't get the name)

01:31PM EDT - Talking about the history of Azure's HB series instances

01:34PM EDT - Azure is also offering the HX series for even higher performance (and lower latency)

01:35PM EDT - And now talking a bit about Azure's customer adoption, and what they've been doing with their instances

01:36PM EDT - Azure is going to be 100% renewable energy by 2025

01:36PM EDT - Which is helpful for their customers who are wanting to get to net-zero carbon emissions

01:38PM EDT - Meanwhile, ST Micro has been able to reduce their simulation time by 30%

01:39PM EDT - Final piece of the Zen 4 portfolio: Siena

More Context

keyboard_arrow_down keyboard_arrow_right How could this chip change the landscape of video conference calls?

anandtech.com More accurate and better models

neowin.net It offers immersive collaboration experiences

finance.yahoo.com increase the ability for even the framing of your

01:39PM EDT - AMD's low-cost EPYC processor for telco and other markets

More Context

keyboard_arrow_down keyboard_arrow_right What is AMD's latest chip that has been unveiled?

phoronix.com Genoa-X

fool.com MI300

cnbc.com A.I

zdnet.com MI300x

videocardz.com Instinct MI300X GPU

servethehome.com Instinct MI300

wepc.com MI300X

crn.com Instinct MI300X, EPYC 97X4

finance.yahoo.com AI superchip

neowin.net 128-core EPYC 97X4 Series

insidehpc.com 4th Generation EPYC

latestly.com EPYC 97X4

anandtech.com Instinct Mi300X

techradar.com 144-Core EPYC Bergamo

benzinga.com Genoa

wccftech.com Instinct MI300 APUs

pcmag.com Instinct MI300X

seekingalpha.com MI300 series

prnewswire.com EPYC(tm)

01:39PM EDT - More on that in the second half of the year

01:39PM EDT - Now on to Forrest Norrad, EVP and GM of AMD's data center solutions business group

01:40PM EDT - Who is bringing on another guest: Jeff Maurona, Managing Director and COO of Citadel Securities

01:40PM EDT - "World's most profitable hedge fund"

01:40PM EDT - As well as the world's largest market-making firm

01:43PM EDT - EPYC's memory bandwidth in particular has unlocked a lot of performance for Citadel

01:44PM EDT - Citadel finds Xilinx FPGAs to be absolutely essential as well

01:45PM EDT - Citadel is using over a million CPU cores in the cloud

01:46PM EDT - Now focusing on AMD's network portfolio. One of their recent expansions via the Pensando acquisition

01:46PM EDT - Networking is an increasingly important part of the data center market - and thus AMD's own offerings

01:47PM EDT - Forrest is talking about the challenges of offering a hybrid cloud environment

01:48PM EDT - Focusing in part on the CPU overhead involved in offering those services while maintaining the necessary isolation

01:48PM EDT - A purpose-built architecture to provide important services at line rate

01:49PM EDT - DPUs offload a good chunk of the CPU overhead

01:49PM EDT - Reducing the need for a set of external appliances

01:50PM EDT - And as part of Pensando SmartNICs, offer multiple new use cases

01:51PM EDT - Deployed into an existing infrastructure, or designed into a new one

01:52PM EDT - AMD is working with HP Aruba to develop a smart switch. An industry-standard switch enhanced with P4 DPUs

01:53PM EDT - And that's how AMD is helping customers evolve their data center environments and make them more efficient

01:53PM EDT - And now back to Lisa Su for a look at AI

01:54PM EDT - (An aurora background? That has to be intentional...)

01:54PM EDT - 3 key areas for AI: broad portfolio of CPUs and GPUs, open and proven software platform, and a deep ecosystem of partners

01:55PM EDT - AMD is uniquely positioned with a broad collection of AI platforms across everything from client to server

01:56PM EDT - Lisa is talking about some of the customers using AMD hardware today for AI tasks, not the least of which being NASA

More Context

keyboard_arrow_down keyboard_arrow_right What are the other applications that the chip could be used for?

crn.com AI, Cloud Expansion

neowin.net business and cloud

videocardz.com cloud native and technical computing

servethehome.com NICs, storage, and even memory

wepc.com high-performance computing (HPC) and AI workloads

finance.yahoo.com large language models and generative AI

latestly.com Software Enablement for Generative AI (Artificial Intelligence)

benzinga.com health care to 5G networks and data centers

zdnet.com large language models

wccftech.com various core IPs, memory interfaces, interconnects

beststocks.com high-performance computing, graphics, and visualization technologies

anandtech.com AI and HPC

seekingalpha.com AI segment

prnewswire.com technical computing

01:56PM EDT - Meanwhile AMD is expecting more than 70 laptop designs to launch through later this year featuring Ryzen AI

01:57PM EDT - "We are very, very early in the lifecycle of the AI market"

01:57PM EDT - 50% compound annual growth rate, from $30B today

More Context

keyboard_arrow_down keyboard_arrow_right What is the current market share of Nvidia and AMD in the AI industry?

cnbc.com over 80%

finance.yahoo.com 80% to 95%

01:58PM EDT - Talking about some of AMD's supercomputer wins, including, of course, Frontier, the industry's first exascale supercomputer

01:59PM EDT - And now rolling a video on the Lumi supercomputer (#3 on the current Top500 list)

02:00PM EDT - More accurate and better models as a result of Lumi

More Context

keyboard_arrow_down keyboard_arrow_right What is the impact of AMD's new chip on the PC market?

finance.yahoo.com 'incredibly powerful' technology

marketwatch.com 'closest competitor' to Nvidia in AI hardware

videocardz.com offer leadership performance in cloud native and technical computing

latestly.com drive leadership performance and energy efficiency

crn.com help it compete against the likes of Nvidia, Intel and others

wepc.com the most advanced accelerator for generative AI

zdnet.com enormous memory and data throughput

neowin.net revolutionizes business laptops by enabling premium AI experiences

02:00PM EDT - Generative AI requires hardware as well as good software

02:00PM EDT - Now on stage: AMD's President, Victor Peng, to talk about the software side of matters

02:01PM EDT - Peng also heads up AMD's newly formed AI group

02:02PM EDT - Talking about some of AMD's accomplishments to date

02:03PM EDT - As well as sampling new Vitis AI products

02:04PM EDT - A significant portion of which is open source

02:04PM EDT - ROCm in its fifth generation, with a comprehensive suite of AI optimizations

02:05PM EDT - Another guest on stage: Soumith Chintala, the founder of PyTorch and VP at Meta

02:06PM EDT - Recapping PyTorch and what it's used for. One of the most popular AI frameworks on the market

02:08PM EDT - How does AMD's collab benefit the developer community?

02:09PM EDT - Removed a lot of the work required/friction in moving platforms

02:10PM EDT - PyTorch 2.0 offers day-0 support for ROCm 5

02:11PM EDT - Another guest on stage, Clement Delangue, CEO of Hugging Face

02:11PM EDT - Sharing his thoughts on why open source matters in AI

02:12PM EDT - Giving companies the tools to build AI themselves, rather than just relying on provided tools

02:13PM EDT - AMD and Hugging Face recently formalized their partnership, which is being announced today

02:14PM EDT - Hugging Face is the most used open platform for AI

02:14PM EDT - Over 5000 new models added to their service just last week

02:15PM EDT - And they will be optimizing these models for AMD's platforms

02:17PM EDT - AMD, of course, shares in this vision, which is why they're working with Hugging Face

02:17PM EDT - The rate of innovation for AI is unprecidented

02:18PM EDT - "We've made a tremendous amount of progress over the past year with ROCm"

02:19PM EDT - Generative AI and LLMs have changed the landscape

02:19PM EDT - New compute engine, latest data formats, 5/6nm processes

02:20PM EDT - Recapping the MI300, now known as the MI300A

02:20PM EDT - 24 Zen 4 CPU cores, 128GB HBM3 memory

02:20PM EDT - All in a single package with unified memory across the CPU and GPU

02:20PM EDT - MI300A is slated for use in the El Capitan supercomputer in LLNL

02:21PM EDT - AMD is replacing the Zen CPU chiplets to create a GPU-only version: MI300X

02:22PM EDT - 129GB HBM3, 5.2 TB/second of memory bandwidth, 896GB/ec memory bandwith, 153B transistors

02:22PM EDT - "Leadership generative AI accelerator"

02:22PM EDT - It looks very similar to MI300A. Removed 3 CPU chiplets, added 2 GPU chiplets

02:23PM EDT - So AMD has done an XPU 3+ years before Intel

02:23PM EDT - Comparing MI300X to NVIDIA's H100 accelerator in terms of HBM3 density and bandwidth

More Context

keyboard_arrow_down keyboard_arrow_right How does the chip compare to Nvidia's chips?

anandtech.com ado

marketwatch.com closest competitor

zdnet.com 2.4 times the memory density

wccftech.com GB

crn.com fewer GPUs

wepc.com the MI300X has up to 2.4X the HBM density and 1.6X the HBM bandwidth

finance.yahoo.com a purported rival

servethehome.com similar

02:24PM EDT - AMD supports 8 HBM3 stacks, versus 6 on H100, which gives them a capacity and bandwidtbh advantage

02:24PM EDT - Doing a live demo with the Falcon-40B model running on one MI300X

02:26PM EDT - More memory and more memory bandwidth allows larger models, and also running LLMs on fewer GPUs overall

02:26PM EDT - Which AMD believes will offer a TCO advantage

02:28PM EDT - Making it easier to deploy MI300 into their existing server and AI infrastructure

02:28PM EDT - MI300A began sampling earlier this quarter. MI300X and the 8-way platform sampling in Q3 of this year

02:29PM EDT - Expecting both products to ramp into production later this year

02:29PM EDT - Recapping all of AMD's announcements, and the scope of AMD's overall data center product lineup

02:30PM EDT - And that's a wrap on AMD's data center event

02:30PM EDT - Thanks for joining us. Now off to find out more about MI300X