The Leverage Podcast cover image

The Real Reason Claude Mythos Should Alarm You

The Leverage Podcast

00:00

Limits of Benchmarks and Growing Uncertainty

Evan argues benchmarks are failing and we increasingly rely on vibes and surveys to judge capabilities.

Play episode from 02:39
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app