
LessWrong (30+ Karma) “Chat, is this sus?” by Tyler Tracy
A large assumption we have made in AI control is that humans will be perfect at auditing, that is, being shown a transcript and determining if the AI was scheming in that transcript.
But we are uncertain whether humans will be perfect at auditing; they are prone to fatigue and distraction. That is why I’m releasing "Sentinel" today, an extremely high-stimulation way to audit boring transcripts.
Sentinel is a revolutionary way to get more juice out of your human auditors by gamifying the auditing process with a level system, perks, power-ups, and more fun features. Try it now here.
In AI control literature, we love finding the safety/usefulness trade-offs of everything we create, but surprisingly, we noticed no trade-offs with this product
The rest of the post will go over some of the ways we achieved this
Gamification
As you audit the transcript in the game, you gain tokens that you can spend on power-ups that make you even more productive. There are also achievement and level systems, so you can see your progress and get more dopamine hits!
Twitch Streaming Mode
AIs might be able to uplift human auditors in the future, which is why Sentinel ships with [...]
---
Outline:
(01:09) Gamification
(01:34) Twitch Streaming Mode
(01:58) Subway Surfers
(02:33) Looking Forward
---
First published:
April 1st, 2026
Source:
https://www.lesswrong.com/posts/NhHnm2kw3JfbHD4T8/chat-is-this-sus
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
