Latent Space: The AI Engineer Podcast cover image

NVIDIA's AI Engineers: Agent Inference at Planetary Scale and "Speed of Light" — Nader Khalil (Brev), Kyle Kranen (Dynamo)

Latent Space: The AI Engineer Podcast

00:00

Grove and Kubernetes scaling for Dynamo

Kyle describes Grove, Dynamo's Kubernetes component, and dynamic scaling of prefill and decode worker ratios.

Play episode from 41:09
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app