The Circuit cover image

EP 163: Breaking the Memory Wall: Micron’s Strategy for the AI Era

The Circuit

00:00

The inference era and 'memory wall'

Jeremy distinguishes training versus inference and introduces the KV cache problem causing the memory wall.

Play episode from 10:16
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app