Bowen Baker

Research scientist at OpenAI specializing in model interpretability and safety, known for work on chain-of-thought monitorability, reward-hacking studies, and mechanistic interpretability efforts.

Best podcasts with Bowen Baker

Ranked by the Snipd community

25 snips

Jan 23, 2026 • 55min

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Bowen Baker, OpenAI research scientist focused on interpretability and safety, joins to discuss how models plan, hide, and sometimes cheat. He describes reward-hacking examples and why watching chain-of-thought traces can catch problems earlier. The conversation covers monitorability limits, the tradeoff between transparency and performance, and risks of models learning to obfuscate their thinking.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app