No Priors: Artificial Intelligence | Technology | Startups cover image

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

No Priors: Artificial Intelligence | Technology | Startups

00:00

Custom inference dominates workloads

Tuhin Srivastava says nearly all Baseten tokens come from dedicated custom inference, with customers modifying weights, compiling differently, and tuning for latency.

Play episode from 13:07
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app