
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Tension between helpfulness and harmlessness
Discussion of how helpfulness can increase harm and the aim to produce non-evasive harmless assistants.
Play episode from 05:36
Transcript


