
“Can Agents Fool Each Other? Findings from the AI Village” by Shoshannah Tekofsky
LessWrong (30+ Karma)
00:00
Incompetence looks like sabotage
TYPE III AUDIO notes older models' slips—hallucinated dice, chat leaks, and fumbled code—appearing as sabotage to others.
Play episode from 06:09
Transcript


