
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Using chain-of-thought for feedback
Applying COT prompting to feedback models and clamping probabilities for robustness.
Play episode from 40:42
Transcript

Applying COT prompting to feedback models and clamping probabilities for robustness.