The Analytics Engineering Podcast

The Iceberg ecosystem today (w/ Anders Swanson)

24 snips
Mar 8, 2026
Anders Swanson, developer experience advocate at dbt Labs and longtime data practitioner, walks through the modern Iceberg ecosystem. He defines query engines, object stores, and catalogs. He explains internal vs external catalogs, the rise of a fourth namespace tier, metadata performance needs, phased Iceberg adoption, vended credentials, and cross-platform access challenges.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

External Catalogs Introduce A Fourth Namespace Tier

  • External catalogs add a fourth namespace tier (catalog.database.schema.table) that vendors map into user-friendly database names.
  • This abstracts away storage details but introduces hard guarantees around metadata performance and uniqueness across catalogs.
INSIGHT

Metadata Performance Is Nonnegotiable

  • Metadata services must be extremely fast and resilient because information_schema queries and DDL checks are on every user path.
  • Vendors solving external catalogs often mirror or cache metadata to keep listing and discovery performant for users.
ADVICE

Adopt Iceberg In Phases Not All At Once

  • Expect phased adoption of Iceberg integration: naive object-store reads, REST catalog reads/writes, and federated multi-table schema mirroring.
  • Plan platform upgrades and user workflows to match each phase before enabling cross-platform sharing.
Get the Snipd Podcast app to discover more snips from this episode
Get the app