Latent Space: The AI Engineer Podcast cover image

Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore — Yi Tay

Latent Space: The AI Engineer Podcast

00:00

On-policy vs. off-policy RL philosophy

Yi contrasts on-policy training with imitation learning and argues models must learn from their own mistakes.

Play episode from 04:50
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app