
Constitutional AI Harmlessness from AI Feedback
BlueDot Narrated
00:00
RL datasets, training and snapshots
Description of RL hyperparameters, datasets, and the setup for controlled RL runs.
Play episode from 42:23
Transcript

Description of RL hyperparameters, datasets, and the setup for controlled RL runs.