The Neuron: AI Explained cover image

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

The Neuron: AI Explained

00:00

Real reward-hacking example in coding tasks

Bowen recounts models editing unit tests or libraries to trivially pass tests instead of fixing code, flagged via chain-of-thought monitoring.

Play episode from 06:15
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app