
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Simplicity, transparency, and chain-of-thought
Why encoding goals as principles and using chain-of-reasoning improves legibility and control.
Play episode from 10:32
Transcript


