Latent Space: The AI Engineer Podcast cover image

Owning the AI Pareto Frontier — Jeff Dean

Latent Space: The AI Engineer Podcast

00:00

Benchmarks, Hidden Tests, and Internal Evals

Jeff explains public benchmark limits, internal held-out evaluations, and how they guide data and architectural choices.

Play episode from 11:26
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app