
Intelligence with Everyone: RL @ MiniMax, with Olive Song, from AIE NYC & Inference by Turing Post
"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
00:00
Scaling Environments and Expert Rewards
Olive describes scaling diverse training environments and using in-house expert developers as reward models.
Play episode from 07:24
Transcript


