
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Absolute harmfulness scoring
Use of absolute harmfulness labels and trends comparing helpful RLHF and RLCAI models.
Play episode from 52:15
Transcript

Use of absolute harmfulness labels and trends comparing helpful RLHF and RLCAI models.