Dwarkesh Podcast cover image

Reiner Pope – The math behind how LLMs are trained and served

Dwarkesh Podcast

00:00

More Sparsity Is Usually Worth It

Reiner Pope discusses MoE scaling results, arguing higher sparsity is often a systems win until memory capacity or user demand becomes limiting.

Play episode from 28:14
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app