Latent Space: The AI Engineer Podcast cover image

METR’s Joel Becker on exponential Time Horizon Evals, Threat Models, and the Limits of AI Productivity

Latent Space: The AI Engineer Podcast

00:00

Rebench, SWATSWAR, and Task Tiers

Joel outlines METR's task suites from atomic SWAT tasks to H-cost and RE-bench research engineering challenges.

Play episode from 07:40
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app