The Engineering Leadership Podcast cover image

Scaling TensorFlow, Navigating Startup Pivots, ML Edge Infrastructure and AI Inference Strategy w/ Rajat Monga #256

The Engineering Leadership Podcast

00:00

Scaling cloud inference as distributed OS

Rajat describes inference at scale becoming a distributed operating-system problem across multi-GPU clusters and accelerators.

Play episode from 34:54
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app