Contributor

Data Processing Evolved: OpenLineage with Willy Lulciuc

13 snips
Nov 18, 2025
Willy Lulciuc, a data engineer and co-creator of OpenLineage, shares his journey in the field from his time at WeWork to founding Oleander, which focuses on AI-enabled data tooling. He discusses the importance of data lineage and the challenges of observability in data processing. Willy explains how OpenLineage emerged from previous projects, the synergy with OpenTelemetry, and how AI can enhance, but not replace, the role of data engineers. He also outlines the future of data platforms and the unique benefits of OpenLineage for engineering teams.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ADVICE

Design Runtime Lineage Events

  • Emit ordered runtime events (start, running, abort, complete) and include SQL and inputs/outputs.
  • Build a consumer backend that handles ordering, scale, and occasional out-of-order events.
ANECDOTE

Acquisition And Astro Observe

  • Datakin scaled to a small engineering team and was acquired by Astronomer within two years.
  • The product became Astro Observe, integrating OpenLineage events into Airflow observability for customers.
INSIGHT

Data Platform As A Graph

  • Oleander views the data platform as a graph of nodes and edges where runs produce dataset versions.
  • Combining that graph with LLM context enables automated root-cause analysis of pipeline failures and runtime drift.
Get the Snipd Podcast app to discover more snips from this episode
Get the app