From Atari to ChatGPT: How AI Learned to Follow Instructions
Linear Digressions
00:00
User labeling and product feedback loops
Ben notes how in-product comparisons recruit users as labelers and how that continues improving models.
Play episode from 19:33
Transcript


