Scaling Laws cover image

Why AI Needs Independent Auditors, with Miles Brundage

Scaling Laws

00:00

Role and Limits of Benchmarks (BenchRisk)

Miles explains benchmarks' usefulness, BenchRisk findings, Goodhart problems, and limits in mapping to real-world misuse.

Play episode from 42:40
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app