Semi Doped cover image

A New Era of Context Memory with Val Bercovici from WEKA

Semi Doped

00:00

The exploding memory math for large models

Val walks through token-to-KVCache math and why trillions-parameter models exhaust GPU HBM quickly.

Play episode from 08:39
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app