Knowledge Graph Insights

Robert Sanderson: Building Yale’s Cultural Heritage Knowledge Graph – Episode 46

Mar 16, 2026
Robert Sanderson, Senior Director for Digital Cultural Heritage at Yale and architect of the LUX discovery system. He discusses using compact knowledge graph and ontology patterns across Yale’s millions of artifacts. Topics include AI and LLMs speeding graph construction, a three-billion-triple KG with a tiny core ontology, handling non-object collection data, reconciling lesser-known people, and extensibility for new domains.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Yale’s Three Billion Triple Knowledge Graph

  • Yale built a 3 billion-triple knowledge graph that aligns people, places, events, works, and collections across multiple museums and libraries.
  • The graph uses Linked Art on top of CIDOC-CRM design patterns and exposes queries via a user-friendly UI instead of forcing SPARQL.
ADVICE

Translate Research Questions Into Graph Queries

  • Design interfaces for natural language input because 95% of users default to keyword or sentence queries rather than graph queries.
  • Translate user research questions into graph queries with an AI-backed natural language processor, not a chatbot.
ANECDOTE

Handwritten Text Recognition Unlocks Vast Archives

  • Yale used LLM-based handwritten text recognition on 650,000 archive images using an HPC cluster, costing about $1,200 in compute time.
  • That work produced full text that would have cost roughly $12 million in human transcription, enabling automated triple extraction into the KG.
Get the Snipd Podcast app to discover more snips from this episode
Get the app