AMD AI Chip Unveiling - Assembly - Salesforce Research
AMD Data Center and AI Technology Premiere Live Blog (Starts at 10am PT/17:00 UTC)
anandtech.com - None - Read On Original Website
12:55PM EDT - AMD this morning is hosting their first data center and server-focused event in quite some time. Dubbed the "AMD Data Center and AI Technology Premiere," we're expecting a sizable number of announcements from AMD around their server and AI product portfolio
12:55PM EDT - We're also expecting some additional details on AMD's remaining server CPU products for the year. This includes the density-focused Bergamo CPU, which will offer up to 128 CPU cores based on AMD's Zen 4c architecture. Genoa-X, which is AMD's V-cache equipped version of the EPYC 9004 series, offering up to 1.1GB of L3 cache per chip. And Siena, a 64 core low-cost EPYC chip
12:55PM EDT - And not to be left out, AMD also has their FPGA and networking divisions, courtesy of recent acquisition like Xilinx and Pensando. Those teams have also been hard at work at their own products, which are due for announcements as well
12:56PM EDT - This is AMD's first live event focused on the data center market in a while. The low frequency of these events means that AMD's most recent slate of announcements were during the tail-end of the pandemic, before live events had resumed
12:57PM EDT - So while not AMD's first live event overall, it's certainly their most important data center event in quite some time
12:59PM EDT - I am here in person in cloudy San Francisco, where AMD is holding their event. Backing me up, as always, is Gavin Bonshor, who is in decidedly warmer England
12:59PM EDT - AMD has asked everyone to silence their devices; the show is about to begin
01:00PM EDT - One thing to note that, for as important as this show is for AMD's customers, there's also a distinct element of pleasing AMD's shareholders
01:00PM EDT - Which, to be sure, as a company AMD is always looking to do that. But with the explosion in demand for AI products, there's a lot of pressure on AMD to make sure they're going to capture a piece of that pie
01:01PM EDT - "We have a lot of products and exciting news to share with you today"
01:01PM EDT - So with no further ado, we're getting started
More Context
keyboard_arrow_down keyboard_arrow_right Who are the potential competitors for AMD's chip?
anandtech.com EPYC and Intel
crn.com Amazon Web Services, Microsoft Azure, Google Cloud and others
benzinga.com NVIDIA Corp. NVDA, Meta Platforms Inc. META, and Tesla Inc. TSLA
seekingalpha.com Nvidia Corporation
fool.com Nvidia's H100
01:01PM EDT - "AMD technology is truly everywhere"
01:02PM EDT - EPYC processors, Instinct accelerators, and AMD's product ecosystem
01:02PM EDT - "Today we lead the industry" with their EPYC processors
01:02PM EDT - Lisa is going to show how AMD brings their current products together, and how they'll be expanding their portfolio
01:03PM EDT - EPYC adoption is also growing in the Enterprise market
01:04PM EDT - AMD is still ramping their 4th generation "Genoa" EPYC processors
01:04PM EDT - AMD thinks Genoa is still by far the highest performance and most efficient processor in the industry
More Context
keyboard_arrow_down keyboard_arrow_right How does the chip compare in terms of power and efficiency with its competitors?
crn.com industry leading performance
01:04PM EDT - Performance comparisons between EPYC and Intel's 4th gen Xeon platform (Sapphire Rapids)
01:05PM EDT - "We want leadership performance. But we must have best-in-class energy efficiency"
01:06PM EDT - The vast majority of AI workloads today are still being run on CPUs
01:06PM EDT - So AMD sees themselves as having a big stake - and big advantage - in that market
01:06PM EDT - Expect no shortage of guests today. Starting with AWS VP Dave Brown
01:08PM EDT - AWS has introed over 100 different AMD-based instances at this point
01:09PM EDT - AWS has a broad range of customers, who have benefitted from the cost savings of using AMD instances
01:09PM EDT - Brown is talking about the various things AWS's customers have been up to - and how much money they've saved
01:11PM EDT - AWS is building new EC2 instances using EPYC 9004 processors and AWS's Nitro system
01:12PM EDT - Up to 50% more perf than M6a instances
01:13PM EDT - AMD is using AWS today for their data analytics workloads
01:13PM EDT - AMD will be expanding their use of AWS to use the service for more technical workloads like EDA
01:14PM EDT - "We're really pleased with the response we're getting on Genoa"
01:14PM EDT - Oracle is also announcing new Genoa instances that will be available in July
01:14PM EDT - "Genoa is ramping nicely"
01:15PM EDT - More customres coming online in the coming weeks
01:15PM EDT - Now talking about the breadth of AMD's data center product stack
01:16PM EDT - Cloud computing clients have different needs than AMD's standard EPYC customers
01:16PM EDT - AMD's density-optimized CPU design for higher core counts
01:16PM EDT - 128 cores per socket "for leadership performance and energy efficiency in the cloud"
01:17PM EDT - 8 CCDs, each with 16 Zen 4c cores
01:17PM EDT - Zen 4c core is 2.48mm2 on TSMC 5nm, versus 3.84mm2 for Zen 4
01:18PM EDT - AMD starts from the same RTL as Zen 4, and then optimize the physical implementation for reduced area
01:18PM EDT - The only real difference between Genoa and Bergamo is the CCDs
01:19PM EDT - Genoa and Bergamo use the same SP5 socket, and can be swapped
01:19PM EDT - Now for performance comparison benchmarks versus Intel's 4th gen Xeon
01:20PM EDT - Bergamo is shipping in volume now to AMD's hyperscale customers
01:20PM EDT - And now for another guest: Meta VP Infrastructure, Alexis Bjorlin
01:21PM EDT - Meta and AMD have been collabing on EPYC server design since 2019
01:22PM EDT - Meta is a big supporter and provider for the Open Compute Project (OCP)
01:22PM EDT - So Meta's server designs are in significant use in the world
01:23PM EDT - AMD has proven to be able to meet their commitments to Meta
01:24PM EDT - Some of the insights from Meta have helped to shape Bergamo
01:24PM EDT - Meta will be deploying Bergamo for their next-gen high density server platform
01:25PM EDT - AMD is looking forward to the coming years with their Meta partnership. And that's Meta.
01:26PM EDT - Dan McNamara is now taking the stage. SVP and GM of AMD's server business unit
01:27PM EDT - He's starting with a look at how AMD has optimized its designed for the "technical computing" market
More Context
keyboard_arrow_down keyboard_arrow_right What are the businesses that AMD is targeting with the new chip?
crn.com AI, Cloud
techradar.com data center & AI
finance.yahoo.com the latest stock market news and
videocardz.com cloud native and technical computing
latestly.com data centre
wepc.com high-performance computing (HPC) and AI workloads
wccftech.com CPU / GPU workloads
servethehome.com PyTorch, Hugging Face
neowin.net and for cloud data centers
cnbc.com developers and server makers
benzinga.com health care to 5G networks and data centers
zdnet.com artificial intelligence computing
beststocks.com high-performance computing, graphics, and visualization technologies
channelnewsasia.com cloud computing providers and other large chip buyers
seekingalpha.com supercomputers and traditional high-performance computing
prnewswire.com growing cloud native environments
01:27PM EDT - Over 1GB of L3 cache on a 96 core EPYC CPU
More Context
keyboard_arrow_down keyboard_arrow_right What are the technical specifications of the chip?
anandtech.com from 16 cores to 96 cores
neowin.net 128-core
wccftech.com 6 XCDs (Up To 228 CUs), 3 CCDs (Up To 24 Zen 4 Cores), 8 HBM3 Stacks
servethehome.com 128GB of HBM3, 5.2TB/s of memory bandwidth 896GB/s of Infinity Fabric bandwidth
crn.com higher core design and greater energy efficiency
zdnet.com multiple GPU "chiplets" plus 192 gigabytes of HBM3 DRAM memory, and 5.2 terabytes per second of memory bandwidth
finance.yahoo.com it can use up to 192GB of memory
videocardz.com up to 128 cores, deliver up to 3.7x throughput performance for key cloud native workloads compared to Ampere1
prnewswire.com unmatched core count in 1U and 2U density
01:28PM EDT - 4 new SKUs, from 16 cores to 96 cores
01:28PM EDT - Genoa-X is aimed at technical computing. Workloads that can benefit from substantially larger L3 cache sizes
01:29PM EDT - Now for some performance slides with some EDA workloads
01:29PM EDT - Platforms featuring Genoa-X will be available next quarter
01:30PM EDT - Another guest: Microsot's GM for Azure (apologies, didn't get the name)
01:31PM EDT - Talking about the history of Azure's HB series instances
01:34PM EDT - Azure is also offering the HX series for even higher performance (and lower latency)
01:35PM EDT - And now talking a bit about Azure's customer adoption, and what they've been doing with their instances
01:36PM EDT - Azure is going to be 100% renewable energy by 2025
01:36PM EDT - Which is helpful for their customers who are wanting to get to net-zero carbon emissions
01:38PM EDT - Meanwhile, ST Micro has been able to reduce their simulation time by 30%
01:39PM EDT - Final piece of the Zen 4 portfolio: Siena
More Context
keyboard_arrow_down keyboard_arrow_right How could this chip change the landscape of video conference calls?
anandtech.com More accurate and better models
neowin.net It offers immersive collaboration experiences
finance.yahoo.com increase the ability for even the framing of your
01:39PM EDT - AMD's low-cost EPYC processor for telco and other markets
More Context
keyboard_arrow_down keyboard_arrow_right What is AMD's latest chip that has been unveiled?
videocardz.com Instinct MI300X GPU
servethehome.com Instinct MI300
wepc.com MI300X
crn.com Instinct MI300X, EPYC 97X4
finance.yahoo.com AI superchip
neowin.net 128-core EPYC 97X4 Series
insidehpc.com 4th Generation EPYC
latestly.com EPYC 97X4
anandtech.com Instinct Mi300X
techradar.com 144-Core EPYC Bergamo
wccftech.com Instinct MI300 APUs
pcmag.com Instinct MI300X
seekingalpha.com MI300 series
01:39PM EDT - More on that in the second half of the year
01:39PM EDT - Now on to Forrest Norrad, EVP and GM of AMD's data center solutions business group
01:40PM EDT - Who is bringing on another guest: Jeff Maurona, Managing Director and COO of Citadel Securities
01:40PM EDT - "World's most profitable hedge fund"
01:40PM EDT - As well as the world's largest market-making firm
01:43PM EDT - EPYC's memory bandwidth in particular has unlocked a lot of performance for Citadel
01:44PM EDT - Citadel finds Xilinx FPGAs to be absolutely essential as well
01:45PM EDT - Citadel is using over a million CPU cores in the cloud
01:46PM EDT - Now focusing on AMD's network portfolio. One of their recent expansions via the Pensando acquisition
01:46PM EDT - Networking is an increasingly important part of the data center market - and thus AMD's own offerings
01:47PM EDT - Forrest is talking about the challenges of offering a hybrid cloud environment
01:48PM EDT - Focusing in part on the CPU overhead involved in offering those services while maintaining the necessary isolation
01:48PM EDT - A purpose-built architecture to provide important services at line rate
01:49PM EDT - DPUs offload a good chunk of the CPU overhead
01:49PM EDT - Reducing the need for a set of external appliances
01:50PM EDT - And as part of Pensando SmartNICs, offer multiple new use cases
01:51PM EDT - Deployed into an existing infrastructure, or designed into a new one
01:52PM EDT - AMD is working with HP Aruba to develop a smart switch. An industry-standard switch enhanced with P4 DPUs
01:53PM EDT - And that's how AMD is helping customers evolve their data center environments and make them more efficient
01:53PM EDT - And now back to Lisa Su for a look at AI
01:54PM EDT - (An aurora background? That has to be intentional...)
01:54PM EDT - 3 key areas for AI: broad portfolio of CPUs and GPUs, open and proven software platform, and a deep ecosystem of partners
01:55PM EDT - AMD is uniquely positioned with a broad collection of AI platforms across everything from client to server
01:56PM EDT - Lisa is talking about some of the customers using AMD hardware today for AI tasks, not the least of which being NASA
More Context
keyboard_arrow_down keyboard_arrow_right What are the other applications that the chip could be used for?
crn.com AI, Cloud Expansion
neowin.net business and cloud
videocardz.com cloud native and technical computing
servethehome.com NICs, storage, and even memory
wepc.com high-performance computing (HPC) and AI workloads
finance.yahoo.com large language models and generative AI
latestly.com Software Enablement for Generative AI (Artificial Intelligence)
benzinga.com health care to 5G networks and data centers
zdnet.com large language models
wccftech.com various core IPs, memory interfaces, interconnects
beststocks.com high-performance computing, graphics, and visualization technologies
anandtech.com AI and HPC
prnewswire.com technical computing
01:56PM EDT - Meanwhile AMD is expecting more than 70 laptop designs to launch through later this year featuring Ryzen AI
01:57PM EDT - "We are very, very early in the lifecycle of the AI market"
01:57PM EDT - 50% compound annual growth rate, from $30B today
More Context
keyboard_arrow_down keyboard_arrow_right What is the current market share of Nvidia and AMD in the AI industry?
01:58PM EDT - Talking about some of AMD's supercomputer wins, including, of course, Frontier, the industry's first exascale supercomputer
01:59PM EDT - And now rolling a video on the Lumi supercomputer (#3 on the current Top500 list)
02:00PM EDT - More accurate and better models as a result of Lumi
More Context
keyboard_arrow_down keyboard_arrow_right What is the impact of AMD's new chip on the PC market?
finance.yahoo.com 'incredibly powerful' technology
marketwatch.com 'closest competitor' to Nvidia in AI hardware
videocardz.com offer leadership performance in cloud native and technical computing
latestly.com drive leadership performance and energy efficiency
crn.com help it compete against the likes of Nvidia, Intel and others
wepc.com the most advanced accelerator for generative AI
zdnet.com enormous memory and data throughput
neowin.net revolutionizes business laptops by enabling premium AI experiences
02:00PM EDT - Generative AI requires hardware as well as good software
02:00PM EDT - Now on stage: AMD's President, Victor Peng, to talk about the software side of matters
02:01PM EDT - Peng also heads up AMD's newly formed AI group
02:02PM EDT - Talking about some of AMD's accomplishments to date
02:03PM EDT - As well as sampling new Vitis AI products
02:04PM EDT - A significant portion of which is open source
02:04PM EDT - ROCm in its fifth generation, with a comprehensive suite of AI optimizations
02:05PM EDT - Another guest on stage: Soumith Chintala, the founder of PyTorch and VP at Meta
02:06PM EDT - Recapping PyTorch and what it's used for. One of the most popular AI frameworks on the market
02:08PM EDT - How does AMD's collab benefit the developer community?
02:09PM EDT - Removed a lot of the work required/friction in moving platforms
02:10PM EDT - PyTorch 2.0 offers day-0 support for ROCm 5
02:11PM EDT - Another guest on stage, Clement Delangue, CEO of Hugging Face
02:11PM EDT - Sharing his thoughts on why open source matters in AI
02:12PM EDT - Giving companies the tools to build AI themselves, rather than just relying on provided tools
02:13PM EDT - AMD and Hugging Face recently formalized their partnership, which is being announced today
02:14PM EDT - Hugging Face is the most used open platform for AI
02:14PM EDT - Over 5000 new models added to their service just last week
02:15PM EDT - And they will be optimizing these models for AMD's platforms
02:17PM EDT - AMD, of course, shares in this vision, which is why they're working with Hugging Face
02:17PM EDT - The rate of innovation for AI is unprecidented
02:18PM EDT - "We've made a tremendous amount of progress over the past year with ROCm"
02:19PM EDT - Generative AI and LLMs have changed the landscape
02:19PM EDT - New compute engine, latest data formats, 5/6nm processes
02:20PM EDT - Recapping the MI300, now known as the MI300A
02:20PM EDT - 24 Zen 4 CPU cores, 128GB HBM3 memory
02:20PM EDT - All in a single package with unified memory across the CPU and GPU
02:20PM EDT - MI300A is slated for use in the El Capitan supercomputer in LLNL
02:21PM EDT - AMD is replacing the Zen CPU chiplets to create a GPU-only version: MI300X
02:22PM EDT - 129GB HBM3, 5.2 TB/second of memory bandwidth, 896GB/ec memory bandwith, 153B transistors
02:22PM EDT - "Leadership generative AI accelerator"
02:22PM EDT - It looks very similar to MI300A. Removed 3 CPU chiplets, added 2 GPU chiplets
02:23PM EDT - So AMD has done an XPU 3+ years before Intel
02:23PM EDT - Comparing MI300X to NVIDIA's H100 accelerator in terms of HBM3 density and bandwidth
More Context
keyboard_arrow_down keyboard_arrow_right How does the chip compare to Nvidia's chips?
marketwatch.com closest competitor
zdnet.com 2.4 times the memory density
crn.com fewer GPUs
wepc.com the MI300X has up to 2.4X the HBM density and 1.6X the HBM bandwidth
finance.yahoo.com a purported rival
02:24PM EDT - AMD supports 8 HBM3 stacks, versus 6 on H100, which gives them a capacity and bandwidtbh advantage
02:24PM EDT - Doing a live demo with the Falcon-40B model running on one MI300X
02:26PM EDT - More memory and more memory bandwidth allows larger models, and also running LLMs on fewer GPUs overall
02:26PM EDT - Which AMD believes will offer a TCO advantage
02:28PM EDT - Making it easier to deploy MI300 into their existing server and AI infrastructure
02:28PM EDT - MI300A began sampling earlier this quarter. MI300X and the 8-way platform sampling in Q3 of this year
02:29PM EDT - Expecting both products to ramp into production later this year
02:29PM EDT - Recapping all of AMD's announcements, and the scope of AMD's overall data center product lineup
02:30PM EDT - And that's a wrap on AMD's data center event
02:30PM EDT - Thanks for joining us. Now off to find out more about MI300X