Dwarkesh Podcast cover image

Reiner Pope – The math behind how LLMs are trained and served

Dwarkesh Podcast

00:00

MoE Layers Fit Naturally on Racks

Reiner Pope maps experts across GPUs, explains all-to-all routing, and shows why one rack is a natural boundary for expert parallelism.

Play episode from 31:58
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app