Epoch After Hours cover image

AI math capabilities could be jagged for a long time – Daniel Litt

Epoch After Hours

00:00

Pitfalls when curating hard benchmark problems

Litt warns problems authored by busy researchers may be easy for experts and quickly saturated by models.

Play episode from 01:30:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app