
Controlling AI Models from the Inside
Practical AI
00:00
Where to start on model safety
Alizishaan advises identifying general undesirables and context-specific risks, then planning mitigations and detection.
Play episode from 08:46
Transcript

Alizishaan advises identifying general undesirables and context-specific risks, then planning mitigations and detection.