
Controlling AI Models from the Inside
Practical AI
00:00
Interpretability and mechanistic approaches
Alizishaan introduces interpretability and mechanistic interpretability to observe and control internal model behavior.
Play episode from 11:48
Transcript


