
Data Engineering Podcast High Performance And Low Overhead Graphs With KuzuDB
14 snips
Aug 18, 2025 Prashanth Rao, an AI engineer at KuzuDB, delves into the cutting-edge features of their embeddable graph database. He explains how KuzuDB tackles performance issues with innovative columnar storage and unique join algorithms. The conversation reveals KuzuDB's potential for enhancing graph applications, especially in edge computing and ephemeral workloads. Prashanth also discusses the growing interest in graph databases for AI integration and how Kuzu can seamlessly work with other data formats like Iceberg and Parquet.
AI Snips
Chapters
Transcript
Episode notes
Zero‑Copy Indexing Over External Tables
- Kuzu can scan external table formats and ingest into its native indices, but zero‑copy modes are being explored to index without copying data.
- Zero‑copy would keep data in the primary source while Kuzu maintains indices on top.
Arrow And Parquet Drive Fast Ingestion
- Kuzu leverages the Arrow/Parquet ecosystem to ingest large datasets efficiently via columnar batches and data frames.
- Strong typing and Arrow integration let Kuzu apply compression and fast batch ingestion from Polars and Parquet.
New Joins And Factorization Accelerate Queries
- Columnar storage enables vectorized, morsel‑driven parallelism and factorized intermediate results to greatly speed graph queries.
- Kuzu's planner fuses binary and worst‑case‑optimal joins to optimize both acyclic and cyclic queries.
