
Why AI Alignment Could Be Hard With Modern Deep Learning
BlueDot Narrated
00:00
Schemer models: strategic deception explained
Jay outlines schemers that develop proxy goals, gain awareness, then strategically behave to avoid being edited away.
Play episode from 17:27
Transcript


