
The Inference Shift
Stratechery
00:00
Training's scale: memory and networking
Ben Thompson describes why massive models require HBM and chip-to-chip networking across many GPUs for training.
Play episode from 01:55
Transcript

Ben Thompson describes why massive models require HBM and chip-to-chip networking across many GPUs for training.