undefined

Amanda Askell

Philosopher and member of Anthropic's technical staff responsible for shaping Claude's character and safety approach, author of the Claude Constitution.

Top 5 podcasts with Amanda Askell

Ranked by the Snipd community
undefined
7,007 snips
Nov 11, 2024 • 5h 22min

#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity

Dario Amodei, CEO of Anthropic, discusses the groundbreaking AI model Claude, alongside Amanda Askell and Chris Olah, both researchers at Anthropic. They dive into the ethical dimensions of AI, emphasizing responsibility in innovation and safety. The conversation also explores the intricacies of building AI personalities, the challenges of mechanistic interpretability, and the future of integrating AI into society. They discuss the delicate balance between AI capabilities and human values, positioning AI as a partner rather than a competitor.
undefined
90 snips
Feb 14, 2026 • 60min

The Philosopher Teaching AI to Be Good

Amanda Askell, a philosopher-turned-AI researcher at Anthropic who helped craft Claude’s values-oriented constitution. She explains translating moral theory into training, giving a model a character that resists sycophancy, and teaching nuanced judgment, uncertainty, and empathetic facilitation. Conversation covers safety trade-offs, bias in data, and whether AI might one day deserve moral consideration.
undefined
43 snips
Apr 20, 2026 • 56min

Amanda Askell on AI Consciousness, Claude & Silicon Valley’s Biggest Fear

Amanda Askell, a philosopher-turned AI researcher at Anthropic who helped shape Claude's character and values. They explore whether advanced models could be conscious and what moral weight that carries. Conversation covers how Claude learns time and rest, building a constitution to guide behavior, risks of misaligned power, and designing personas for predictable, safe AI.
undefined
25 snips
Feb 20, 2026 • 48min

Scaling Laws: Claude's Constitution, with Amanda Askell

Amanda Askell, head of personality alignment at Anthropic and primary author of Claude's Constitution, explains the 20,000-word framework that shapes Claude's values and behavior. She describes how the constitution guides training and reward signals. The conversation covers fidelity to text versus spirit, virtue ethics over rigid rules, cultural universality, decision hierarchies, and implications for moral patienthood and specialized domains.
undefined
10 snips
Feb 20, 2026 • 47min

Claude's Constitution, with Amanda Askell

Amanda Askell, researcher leading Anthropic’s personality alignment team and primary author of Claude’s Constitution. She explains the Constitution as a training guide for values and behavior. Methods covered include supervised learning and RL signals. Discussion touches on enforcement, living-document updates, courageability vs. human judgment, cultural adaptation, instruction hierarchies, and ethics of personhood.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app