Mystery AI Hype Theater 3000 cover image

A Bad Case of Hype-itis, 2026.02.02

Mystery AI Hype Theater 3000

00:00

Evaluating AI: benchmarks and shifting models

They explain poor evaluation practices, proprietary model drift, and the challenge of apples-to-apples comparisons.

Play episode from 16:56
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app