
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Key contributions and results
Summary of main findings: model-generated feedback, chain-of-thought benefits, and SL/RL gains.
Play episode from 14:32
Transcript

Summary of main findings: model-generated feedback, chain-of-thought benefits, and SL/RL gains.