Know Thyself cover image

E191 - Roman Yampolskiy: The Man Who Proved We Can't Control AI (And What That Means for Humanity)

Know Thyself

00:00

Ethics, Alignment, and Why Filters Won't Scale

Roman explains current training on internet data and post-hoc filters can't guarantee alignment at superhuman scales.

Play episode from 01:08:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app