LessWrong (Curated & Popular)

LessWrong

Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma.If you'd like more, subscribe to the “Lesswrong (30+ karma)” feed.

Episodes

Mentioned books

Aug 21, 2024 • 19min

“AGI Safety and Alignment at Google DeepMind:A Summary of Recent Work ” by Rohin Shah, Seb Farquhar, Anca Dragan

Join Rohin Shah, a key member of Google's AGI safety team, alongside Seb Farquhar, an existential risk expert, and Anca Dragan, a safety researcher. They dive into the evolving strategies for ensuring AI alignment and safety. Topics include innovative techniques for interpreting neural models, the challenges of scalable oversight, and the ethical implications of AI development. The trio also discusses future plans to address alignment risks, emphasizing the importance of collaboration and the role of mentorship in advancing AGI safety.

Aug 15, 2024 • 20min

“How I Learned To Stop Trusting Prediction Markets and Love the Arbitrage” by orthonormal

Discover the intriguing world of prediction markets and their pitfalls. The discussion dives into a flawed market that stirs up controversy around a political candidate’s VP pick. It reveals how easily these markets can be manipulated to promote specific political agendas. Tune in to hear about the speaker's journey from skepticism to appreciation for the entertaining chaos of prediction markets, all while keeping an eye on their real-world implications.

Aug 7, 2024 • 16min

“This is already your second chance” by Malmesbury

A colossal ivory cube descends, carrying instructions to save humanity from an AI apocalypse. In a humorous twist, Kublai Khan engages in witty banter with an AI while tackling ethical dilemmas surrounding super-intelligent technology. The tale involves absurd tasks to be completed in 2024, blending satire with philosophical musings. With imaginative storytelling, it highlights the challenges of navigating current technological threats and reflects on human behavior in the face of impending doom.

Aug 7, 2024 • 20min

“0. CAST: Corrigibility as Singular Target” by Max Harms

Dive into the intriguing concept of corrigibility in AI, where the discussion pivots from confusion to clarity. Discover how this single property can be crucial for creating agents that are both effective and safe. Learn about innovative strategies for measuring and enhancing this quality in AI development. The podcast critiques the usual mix of goals and proposes a streamlined focus to improve outcomes. Prepare for a journey through the nuances of AI behavior and safety that could redefine future advancements.

Aug 7, 2024 • 23min

“Self-Other Overlap: A Neglected Approach to AI Alignment” by Marc Carauleanu, Mike Vaiana, Judd Rosenblatt, Diogo de Lucena

Join guests Bogdan Ionut-Cirstea, Steve Byrnes, Gunnar Zarnacke, Jack Foxabbott, and Seong Hah Cho, who contribute critical insights on AI alignment. They discuss an intriguing concept called self-other overlap, which aims to optimize AI models by aligning their reasoning about themselves and others. Early experiments suggest this technique can reduce deceptive behaviors in AI. With its scalable nature and minimal need for interpretability, self-other overlap could be a game-changer in creating pro-social AI.

Aug 7, 2024 • 9min

“You don’t know how bad most things are nor precisely how they’re bad.” by Solenoid_Entity

Dive into the intriguing world of discernment, where time and attention significantly enhance our understanding of quality. Explore the nuances of piano tuning, revealing how even experts struggle to detect subtle flaws. Discover the complexities of awareness, and how often we overlook our own blind spots. This discussion highlights the perils of relying on automation in tasks requiring skilled judgment, emphasizing the intricate details in reality that often go unnoticed.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app