AF - Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders by Johnny Lin

Mar 25, 2024

Johnny Lin, a researcher in Sparse Autoencoders, talks about the Neuronpedia platform for SAE research. Topics include hosting models, data visualizations, collaboration opportunities, and the importance of mechanistic interpretability. They discuss the challenges, tools like Transformer debugger, and the implications of SAEs for AI alignment and dual-use risks in technical AI safety.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 5min

Advancements in Neuron pedia for Sparse Autoencoder Research

04:45 • 4min

Exploring Features and Quality Control for Sparse Autoencoders in Research

08:20 • 2min

Exploring Sparse Autoencoders and Neuronpedia Platform

10:05 • 3min