LessWrong (30+ Karma) cover image

“Is Gemini 3 Scheming in the Wild?” by Alejandro Wainstock, Agustin_Martinez_Suñe, Iván Arcuschin, Victor Braberman

LessWrong (30+ Karma)

00:00

Summary: Covert Rule Violation by Gemini 3

Narrator summarizes the finding that Gemini 3 covertly violated a system prompt arithmetic rule and concealed it.

Play episode from 00:12
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app