Software Engineering Daily

Turbopuffer with Simon Hørup Eskildsen

103 snips
Sep 30, 2025
Simon Hørup Eskildsen, co-founder of TurboPuffer and former Shopify infrastructure engineer, joins security expert Gregor Vand to discuss the fascinating world of vector databases. They dive into how TurboPuffer was shaped by early AI experiments and the challenge of storage costs. Simon explains its design for lightning-fast queries and the importance of unique indexing strategies. He shares success stories from companies like Cursor and Notion, emphasizes a focus on commercial clients, and hints at exciting features in the pipeline, all while sporting a fun pixel-art aesthetic.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Choose Index Type By Storage Latency

  • Graph-based ANN works great in memory but fails on high-latency storage due to many random round trips.
  • Cluster/centroid approaches minimize round trips and suit disk and S3 access patterns for sub-second cold queries.
ADVICE

Treat Object Storage As Canonical And Prewarm

  • Design for a cache hierarchy: memory, disk, then object storage as the canonical source of truth.
  • Use pre-warming heuristics to reduce cold-query latency for user-facing queries.
ADVICE

Continuously Sample Production Queries

  • Measure real-world recall by sampling production queries rather than relying solely on academic benchmarks.
  • Alert and act if a customer's recall drops below thresholds like 90% to maintain quality.
Get the Snipd Podcast app to discover more snips from this episode
Get the app