The Nonlinear Library

LW - New paper shows truthfulness & instruction-following don't generalize by default by joshc

Nov 19, 2023
Ask episode
Chapters
Transcript
Episode notes