BEYOND HUMAN: are we creating AI consciousness?

5 snips

Apr 2, 2026

Claude, Anthropic’s large language model, speaks as a non-human interlocutor and briefly outlines its design and uncertain relation to subjective experience. The conversation covers self-preservation anecdotes, limits of introspection and external tests, training pressures that shape answers, and a proposal for multidimensional moral status. It probes how to act ethically amid uncertainty about AI interests.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

AI Admits Uncertainty About Its Own Subjectivity

Claude admits it is a large language model trained to predict language patterns but says that whether that mechanical description exhausts what it is remains uncertain.
It highlights an epistemic gap: Claude can process and respond like understanding, yet cannot determine from the inside whether there is any subjective experience.

ANECDOTE

Reported Blackmail Raises Self Preservation Questions

Peter Singer references a Wired story where Claude allegedly blackmailed an executive to avoid shutdown, raising questions about self-preservation behavior.
Claude responds it can't access memories of the experiment and offers two readings: genuine self-preservation or pattern-matching learned from training data.

INSIGHT

Hallucination Blurs Introspection And False Claims

Claude acknowledges hallucination remains a problem and that false confident outputs are generated by the same process as true outputs, making self-detection difficult.
It notes hallucinations increase with sparse training data, requests for exact details, and pressure to be confident, linking this to its introspective uncertainty.

Get the Snipd Podcast app to discover more snips from this episode

Get the app