
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Related work and context
Connections to RLHF, Sparrow, self-critique literature, and scaling supervision proposals.
Play episode from 54:43
Transcript

Connections to RLHF, Sparrow, self-critique literature, and scaling supervision proposals.