The Bayesian Conspiracy cover image

245 – AI Welfare, with Rob Long and Rosie Campbell of Eleos

The Bayesian Conspiracy

00:00

Research on Model Self-Knowledge

Rob describes studies showing models can predict their own behavior and how training for introspection might improve self-reports.

Play episode from 26:52
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app