The Nonlinear Library

AF - Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders by Johnny Lin

Mar 25, 2024
Johnny Lin, a researcher in Sparse Autoencoders, talks about the Neuronpedia platform for SAE research. Topics include hosting models, data visualizations, collaboration opportunities, and the importance of mechanistic interpretability. They discuss the challenges, tools like Transformer debugger, and the implications of SAEs for AI alignment and dual-use risks in technical AI safety.
Ask episode
Chapters
Transcript
Episode notes