The Bayesian Conspiracy cover image

245 – AI Welfare, with Rob Long and Rosie Campbell of Eleos

The Bayesian Conspiracy

00:00

Self-Reports and Their Limitations

They explain why LLM self-reports are unreliable, sensitive to prompting, and shaped by training and safety guidelines.

Play episode from 24:17
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app