Training's scale: memory and networking

Ben Thompson describes why massive models require HBM and chip-to-chip networking across many GPUs for training.

Play episode from 01:55

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!