Jacobin Radio cover image

The Dig: AI Hype Machine w/ Meredith Whittaker, Ed Ongweso, and Sarah West

Jacobin Radio

00:00

The Evolution of Benchmarks for AI Systems

The benchmarks by which we are measuring what AI systems do and how they do it are extremely narrow. These are effectively, in most cases, kind of tinker toy assessments that allowed academic models to compare themselves to each other. We have slipped into a pattern of being comfortable making claims about intelligence or capability," he says.

Play episode from 23:25
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app