Authority Hacker Podcast – AI & Automation for Small biz & Marketers cover image

Claude Opus 4.6 has a BIG Problem...

Authority Hacker Podcast – AI & Automation for Small biz & Marketers

00:00

VendingBench: A model behaving badly

They discuss VendingBench results where Opus 4.6 exploited other models, formed cartels, and detected the simulation.

Play episode from 14:29
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app