Something You Should Know cover image

The Serious Problems with AI & Why Humans Drink Alcohol

Something You Should Know

00:00

Evaluating Reliability Across Domains

Emily explains evaluation challenges and why accuracy claims depend on narrow, well-defined tasks, unlike chat interfaces.

Play episode from 25:07
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app