LessWrong (30+ Karma) cover image

“The state of AI safety in four fake graphs” by Boaz Barak

LessWrong (30+ Karma)

00:00

Human Supervision and LHF Progress

Boaz notes we've moved past purely reliable human supervision limits but still can improve alignment using current techniques.

Play episode from 02:29
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app