The Neuron: AI Explained cover image

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

The Neuron: AI Explained

00:00

Penalizing bad thoughts can backfire

Bowen shares experiments where suppressing bad thoughts reduced detectable signals and sometimes worsened monitoring effectiveness.

Play episode from 37:41
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app