Last Week in AI cover image

#235 - Sonnet 4.6, Deep-thinking tokens, Anthropic vs Pentagon

Last Week in AI

00:00

Mechanistic Interpretability: Geometry of Counting

Andrey and Jeremie summarize Anthropic's interpretability work on counting tasks and manifolds in models.

Play episode from 01:04:39
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app