The Nonlinear Library

AF - The goal-guarding hypothesis (Section 2.3.1.1 of "Scheming AIs") by Joe Carlsmith

Dec 2, 2023
Ask episode
Chapters
Transcript
Episode notes