
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Overtraining and tone adjustments
Discussion of good-hearting, boilerplate responses, and tuning principles to avoid overreaction.
Play episode from 48:14
Transcript

Discussion of good-hearting, boilerplate responses, and tuning principles to avoid overreaction.