
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
Models, data and baseline training
Explanation of pretraining, RLHF helpful models, and collection of helpfulness and harmlessness data.
Play episode from 20:41
Transcript

Explanation of pretraining, RLHF helpful models, and collection of helpfulness and harmlessness data.