Decoder with Nilay Patel

The tiny team trying to keep AI from destroying everything

248 snips
Dec 4, 2025
Hayden Field, a Senior AI reporter at The Verge, dives into the crucial role of Anthropic's tiny societal impacts team. With just nine members, their mission is to uncover and report unsettling truths about AI, from harmful biases to the complexities of safety versus business pressures. Hayden shares fascinating insights on the balance between genuine safety efforts and PR, the influence of political pressures on AI standards, and how Anthropic's focus on enterprise might shield them from consumer backlash.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Publishing Damning Internal Findings

  • The team publishes 'inconvenient truths' and has released findings that reflect poorly on Anthropic itself.
  • Anthropic currently allows this damning research to be public, though that autonomy may not last.
ANECDOTE

Safety Gaps Revealed By Internal Audit

  • The team found users bypassing safety to create explicit porn, SEO spam, and coordinated misuse.
  • They audited Anthropic's own safety monitoring and revealed sizable gaps.
INSIGHT

Limits Of Safeguards

  • Safeguards break down over long conversations and cannot fully prevent misuse.
  • Even robust safety work can't control downstream harms once users leave the platform.
Get the Snipd Podcast app to discover more snips from this episode
Get the app