The Information Bottleneck cover image

EP25: Personalization, Data, and the Chaos of Fine-Tuning with Fred Sala (UW-Madison / Snorkel AI)

The Information Bottleneck

00:00

Personalization via RL and efficiency concerns

Fred argues RL with human feedback is likely needed for frontier personalization but must become more efficient.

Play episode from 38:30
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app