The Neuron: AI Explained cover image

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

The Neuron: AI Explained

00:00

Why weaker monitors can catch stronger models

Bowen explains how obvious deceptive phrases allow smaller or weaker monitors to detect subversive model strategies.

Play episode from 11:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app