NVIDIA AI Podcast cover image

How AI-Powered Holograms Are Reimagining Fan Experiences at the Big Game - Ep. 288

NVIDIA AI Podcast

00:00

NIM, cloud scaling, and inference speedups

Jia describes collaboration using NIM on Google Cloud to deploy MoE models and achieve sixfold token speed improvements.

Play episode from 25:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app