Thoughtforms Life cover image

Conversation 1 with Nora Belrose: AI, sentience, and Platonic Space

Thoughtforms Life

00:00

Need for new tools to detect subtle agency

Nora and Michael agree on developing empirical interpretability methods to reveal unanticipated behaviors in models.

Play episode from 58:42
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app