Latent Space: The AI Engineer Podcast cover image

METR’s Joel Becker on exponential Time Horizon Evals, Threat Models, and the Limits of AI Productivity

Latent Space: The AI Engineer Podcast

00:00

Opus 4.5's Benchmark Leap

Joel reflects on Opus 4.5's big benchmark jump, its effect on trendlines, and continuous versus discontinuous progress.

Play episode from 11:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app