Dwarkesh Podcast cover image

Reiner Pope – The math behind how LLMs are trained and served

Dwarkesh Podcast

00:00

Pipelining Saves Memory but Adds Hassle

Reiner Pope explains pipeline bubbles, microbatches, and why pipelining mainly reduces weight-memory pressure while complicating architecture choices.

Play episode from 54:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app