Semi Doped cover image

An Interview with Microsoft's Saurabh Dighe About Maia 200

Semi Doped

00:00

KV-Cache Strategies and Long Contexts

Saurabh explains handling hot, warm, and cold KV cache via HBM, head-node DDR/SSD, and Azure Boost remote storage.

Play episode from 48:08
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app