
Bowen Baker
Research scientist at OpenAI specializing in model interpretability and safety, known for work on chain-of-thought monitorability, reward-hacking studies, and mechanistic interpretability efforts.
Best podcasts with Bowen Baker
Ranked by the Snipd community

25 snips
Jan 23, 2026 • 55min
OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)
Bowen Baker, OpenAI research scientist focused on interpretability and safety, joins to discuss how models plan, hide, and sometimes cheat. He describes reward-hacking examples and why watching chain-of-thought traces can catch problems earlier. The conversation covers monitorability limits, the tradeoff between transparency and performance, and risks of models learning to obfuscate their thinking.


