CrowdScience cover image

Could AI present CrowdScience?

CrowdScience

00:00

Measuring AI task performance over time

Alex Hern explains benchmarks for AI tasks, 50% versus 80% accuracy metrics, and rapid improvements in capabilities.

Play episode from 06:36
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app