
Data Engineering Podcast From Bits to Tables: The Evolution of S3 Storage
71 snips
Aug 5, 2025 In this discussion, Andy Warfield, an Amazon storage enhancement expert, dives into the evolution of S3 storage. He explores the revolutionary functionalities of S3 Tables and Vectors, crucial for modern data management and analytics. Andy shares insights on how customer feedback has shaped these developments, improving performance for AI workloads. He also discusses the innovative applications of these features in industries like genomics and finance, along with the technical challenges faced in integrating advanced data types.
AI Snips
Chapters
Transcript
Episode notes
Understanding Vectors and S3 Vectors Role
- Vectors represent data in high-dimensional space enabling similarity search and semantic retrieval.
- S3 Vectors store and query arbitrary dimensional vectors, acting as building blocks linking varied applications and datasets.
S3 Vectors Indexing Trade-offs
- Vector indexes using graph traversal are unsuitable for high latency storage like S3.
- S3 Vectors trade round trips for parallel data reads, balancing performance and cost for bursty and archival workloads.
API Design Lessons from Iceberg
- The S3 Tables team learned from Iceberg's evolution to unify file format and REST catalog APIs.
- For S3 Vectors, starting with a REST API enables flexibility to evolve indexing without breaking clients.
