Data Engineering Podcast

From Bits to Tables: The Evolution of S3 Storage

71 snips
Aug 5, 2025
In this discussion, Andy Warfield, an Amazon storage enhancement expert, dives into the evolution of S3 storage. He explores the revolutionary functionalities of S3 Tables and Vectors, crucial for modern data management and analytics. Andy shares insights on how customer feedback has shaped these developments, improving performance for AI workloads. He also discusses the innovative applications of these features in industries like genomics and finance, along with the technical challenges faced in integrating advanced data types.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Understanding Vectors and S3 Vectors Role

  • Vectors represent data in high-dimensional space enabling similarity search and semantic retrieval.
  • S3 Vectors store and query arbitrary dimensional vectors, acting as building blocks linking varied applications and datasets.
INSIGHT

S3 Vectors Indexing Trade-offs

  • Vector indexes using graph traversal are unsuitable for high latency storage like S3.
  • S3 Vectors trade round trips for parallel data reads, balancing performance and cost for bursty and archival workloads.
INSIGHT

API Design Lessons from Iceberg

  • The S3 Tables team learned from Iceberg's evolution to unify file format and REST catalog APIs.
  • For S3 Vectors, starting with a REST API enables flexibility to evolve indexing without breaking clients.
Get the Snipd Podcast app to discover more snips from this episode
Get the app