"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

AI Deception, Interpretability, and Affordances with Apollo Research CEO Marius Hobbhahn

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Advancements and Interpretability in AI

This chapter explores the rapid evolution of AI capabilities with a focus on multimodal models and the significance of economic efficiency. The discussion highlights the challenges of interpretability in AI systems, particularly regarding deceptive alignment and long-term goals. The speakers advocate for robust tools to understand AI behavior, reflecting on the implications for accountability and the advancement towards Artificial General Intelligence (AGI).

Play episode from 34:28
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app