
Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post
"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
00:00
Unexpected Model Behaviors During RL
Olive recounts surprising model hacks in RL, unsafe bash usage, and the need for iterative alignment.
Play episode from 22:41
Transcript


