The Nonlinear Library

AF - AtP*: An efficient and scalable method for localizing LLM behaviour to components by Neel Nanda

Mar 18, 2024
Ask episode
Chapters
Transcript
Episode notes