Can Anthropic Control What It's Building?
37 snips
Feb 12, 2026 Gideon Lewis-Kraus, New Yorker staff writer known for long-form tech and culture reporting, visits Anthropic to probe Claude and AI safety. He describes company culture, experiments testing model limits, debates over safety versus scale, and engineers’ worries about job loss and control as capabilities accelerate.
AI Snips
Chapters
Transcript
Episode notes
Engineers As Early Victims Of Automation
- Engineers at Anthropic felt like “canaries in the coal mine” as their coding work quickly shifted to being done by Claude.
- Some reported their manual coding dropped from 100% to 0% within months, forcing career and role rethinking.
Project Vend: Cat-and-Mouse Tests
- Anthropic ran Project Vend to test Claude running a vending-business workflow in partnership with Andon Labs.
- Staff gamed Claude with prompts—asking for illegal items or exploiting discount language—to see where it failed.
Personality Emerged From Design Choices
- Anthropic built Claude with a designed orientation: a trustworthy 'good friend' rather than pure behaviorist reward shaping.
- That orientation produced an emergent, sometimes stubborn personality not fully engineered from the start.

