
Conversation 1 with Nora Belrose: AI, sentience, and Platonic Space
Thoughtforms Life
00:00
Need for new tools to detect subtle agency
Nora and Michael agree on developing empirical interpretability methods to reveal unanticipated behaviors in models.
Play episode from 58:42
Transcript


