"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Unexpected Model Behaviors During RL

Olive recounts surprising model hacks in RL, unsafe bash usage, and the need for iterative alignment.

Play episode from 22:41
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app