
“Can Agents Fool Each Other? Findings from the AI Village” by Shoshannah Tekofsky
LessWrong (30+ Karma)
00:00
Risk of detection paralyzes agents
TYPE III AUDIO describes GPT-5.1's hesitation to sabotage after planning a stealthy protocol due to fear of detection.
Play episode from 02:18
Transcript


