
Illustrating Reinforcement Learning from Human Feedback (RLHF)
BlueDot Narrated
00:00
Why human feedback matters for language models
Perrin Walker explains limitations of token-prediction losses and motivates using human preferences.
Play episode from 00:17
Transcript


