Building Planetary-Scale Data Systems with Venice • Felix GV & Olimpiu Pop

Mar 3, 2026

Félix GV, former LinkedIn engineer who built the Venice planetary-scale derived data system, explains how Venice unbundles components like Kafka and RocksDB into independent distributed systems. He covers client caching patterns, rigorous chaos engineering and load tests, trade-offs of asynchronous writes and CAP theorem in multi-region deployments, and experiments integrating DuckDB for analytics.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

INSIGHT

Unbundled Architecture Makes Each Piece A Distributed System

Venice is built as an unbundled distributed database where each component (pub/sub, servers, control plane, clients) is its own distributed system.
Servers host RocksDB locally and offer an eager-cache client that embeds RocksDB in-app processes to act as follower replicas for lower latency.

ADVICE

Exercise Multi‑DC Failover With Realistic Peak Load

Regularly run aggressive load tests that concentrate traffic into a single data center to validate failover behavior under peak conditions.
Venice ran multi-data-center chaos tests several times a week, draining traffic to one DC during weekday morning peaks to expose weak components.

INSIGHT

Derived Data Systems Favor Asynchronous Ingestion

Venice is a derived data system where ingestion is asynchronous from pub/sub or batch jobs, optimizing for very high throughput rather than immediate visibility.
It supports mixed ingestion (batch + stream) and can orchestrate partial column refresh patterns for different latency needs.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

This interview was recorded for GOTO Unscripted.
https://gotopia.tech

Check out more here:
https://gotopia.tech/articles/421

Félix GV - Current Interests: Multi-Planetary Databases, Data Sovereignty & Lifelogging
Olimpiu Pop - Technologist & Tech Journalist

RESOURCES
Félix
https://bsky.app/profile/felixgv.ninja
https://github.com/FelixGV
https://www.linkedin.com/in/felixgv

Olimpiu
https://x.com/olimpiupop
https://github.com/zroll
https://www.linkedin.com/in/olimpiupop

Links
https://venicedb.org
https://github.com/linkedin/venice
https://rocksdb.org
https://duckdb.org

DESCRIPTION
Félix GV, a former engineer at LinkedIn and architect of the Venice database system, discusses the complexity of building planetary-scale data systems. He explains Venice's unbundled architecture where each component—from Kafka-based pub/sub to RocksDB-powered servers—operates as an independent distributed system. Félix details their rigorous chaos engineering practices, including regular load tests that push data centers beyond normal capacity to ensure reliability.

The discussion covers fundamental distributed systems concepts like the CAP theorem and the trade-offs between consistency and availability in multi-region deployments. He also explains why Venice, as a derived data system, deliberately sacrifices strong consistency for high throughput and availability, and concludes by discussing their experimental integration of DuckDB for SQL-based analytics and data exploration capabilities.

RECOMMENDED BOOKS
Kasun Indrasiri & Danesh Kuruppu • gRPC: Up and Running • https://amzn.to/3sBGBJJ
Tomer Shiran, Jason Hughes & Alex Merced • Apache Iceberg: The Definitive Guide • https://amzn.to/488Z30k
William Smith • Arrow Flight Protocols and Practices • https://amzn.to/4o2Q2fd
Adi Polak • Scaling Machine Learning with Spark • https://amzn.to/3N9vx1H
Mark Needham, Michael Hunger & Michael Simons • DuckDB in Action • https://amzn.to/45QwSli
Simon Aubury & Ned Letcher • Getting Started with DuckDB • https://amzn.to/3VPk4q

Bluesky
Instagram
LinkedIn
Facebook

CHANNEL MEMBERSHIP BONUS
Join this channel to get early access to videos & other perks:
https://www.youtube.com/channel/UCs_tLP3AiwYKwdUHpltJPuA/join

Looking for a unique learning experience?
Attend the next GOTO conference near you! Get your ticket: gotopia.tech

SUBSCRIBE TO OUR YOUTUBE CHANNEL - new videos posted daily!