Doom Debates!

His P(Doom) Is Only 2.6% — AI Doom Debate with Bentham's Bulldog, a.k.a. Matthew Adelstein

14 snips
Feb 10, 2026
Matthew Adelstein (Bentham's Bulldog), a philosopher and Substack writer on AI risk, defends a P(Doom) of just 2.6% using a multi-step probability chain. They spar over alignment-by-default, the “goal engine” versus goal-wrapping debate, the risk of exfiltration and unstoppable agents, and whether current RLHF success predicts safe future systems. The discussion closes on shared policy ideas like possible global pauses.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ANECDOTE

Started Writing Bentham’s Bulldog Young

  • Matthew describes starting Bentham's Bulldog as a teenager and writing daily since high school.
  • He attributes productivity to opening a document and writing for a few hours rather than over-editing.
ANECDOTE

How He Got 2.6% PDOOM

  • Adelstein walks through his five-step PDOOM multiplication that yields 2.6%, showing his explicit numeric example.
  • He narrates multiplying 0.9×0.3×0.3×0.4×0.8 to reach 0.026 as an illustrative calculation.
INSIGHT

Failed Doom Warnings Inform Priors

  • Adelstein likens repeated failed doom predictions to higher-order evidence we overcount threat intuitions.
  • He argues surviving prior scares should modestly reduce our initial doom credence absent decisive new evidence.
Get the Snipd Podcast app to discover more snips from this episode
Get the app