The Nonlinear Library

AF - On the Confusion between Inner and Outer Misalignment by Chris Leong

Mar 25, 2024
Discussing the confusion between inner and outer alignment in narrow AI, exploring the challenges of defining reward functions, and the shared responsibility of ethical AI behavior in training processes.
Ask episode
Chapters
Transcript
Episode notes