From Atari to ChatGPT: How AI Learned to Follow Instructions
Linear Digressions
00:00
Labelers shape model behavior
Katie and Ben discuss how the 40 contractors' judgments and screening influenced InstructGPT's outputs and values.
Play episode from 16:00
Transcript


