Data Engineering Podcast

High Performance And Low Overhead Graphs With KuzuDB

14 snips
Aug 18, 2025
Prashanth Rao, an AI engineer at KuzuDB, delves into the cutting-edge features of their embeddable graph database. He explains how KuzuDB tackles performance issues with innovative columnar storage and unique join algorithms. The conversation reveals KuzuDB's potential for enhancing graph applications, especially in edge computing and ephemeral workloads. Prashanth also discusses the growing interest in graph databases for AI integration and how Kuzu can seamlessly work with other data formats like Iceberg and Parquet.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Zero‑Copy Indexing Over External Tables

  • Kuzu can scan external table formats and ingest into its native indices, but zero‑copy modes are being explored to index without copying data.
  • Zero‑copy would keep data in the primary source while Kuzu maintains indices on top.
INSIGHT

Arrow And Parquet Drive Fast Ingestion

  • Kuzu leverages the Arrow/Parquet ecosystem to ingest large datasets efficiently via columnar batches and data frames.
  • Strong typing and Arrow integration let Kuzu apply compression and fast batch ingestion from Polars and Parquet.
INSIGHT

New Joins And Factorization Accelerate Queries

  • Columnar storage enables vectorized, morsel‑driven parallelism and factorized intermediate results to greatly speed graph queries.
  • Kuzu's planner fuses binary and worst‑case‑optimal joins to optimize both acyclic and cyclic queries.
Get the Snipd Podcast app to discover more snips from this episode
Get the app