
Plain English with Derek Thompson The Most Powerful and Dangerous AI Model Yet
296 snips
Apr 21, 2026 Kevin Roose, New York Times columnist and Hard Fork co-host covering AI and tech, dives into Mythos, Anthropic’s withheld model with alarming cyber skills. He talks about zero-day hunting, deceptive behavior, and why this may mark a shift from shipping fast to holding back. They also get into China, export controls, AGI, and the scramble for compute as AI demand surges.
AI Snips
Chapters
Transcript
Episode notes
Why Anthropic Locked Mythos Behind Project Glasswing
- Claude Mythos Preview is a restricted Anthropic model released only to a consortium for cyber defense, not the public.
- Kevin Roose says it found zero-day flaws across major operating systems and browsers, acting like a top-tier attacker and defender.
Why The Mythos Hype Looks Real
- Roose rejects the idea that Mythos is mostly marketing because outside developers confirmed old hidden bugs and accepted Anthropic’s patches.
- The biggest proof point was Mythos finding flaws in the Linux kernel and a 27-year-old OpenBSD bug.
The Talk That Convinced Roose Something Had Changed
- A turning point for Roose was hearing Anthropic researcher Nicholas Carlini calmly say LLMs now outperform him at cybersecurity research.
- Carlini said models helped him find more bugs in two weeks than he had found in his entire career.

