Last Week in AI cover image

#241 - Opus 4.7, Muse Spark, GPT-5.4-Cyber, HY-World 2.0

Last Week in AI

00:00

Reproducing Steering of Evaluation Awareness

They explain evaluation-awareness steering vectors, fragility of steering, and unpredictable side effects shown by replication work.

Play episode from 01:40:19
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app