Latent Space: The AI Engineer Podcast

Retrieval After RAG: Hybrid Search, Agents, and Database Design — Simon Hørup Eskildsen of Turbopuffer

802 snips
Mar 12, 2026
Simon Eskildsen, founder and CEO of Turbopuffer and former Shopify infra engineer, digs into why search for unstructured data broke old cost models. He gets into hybrid retrieval for code, agents making parallel tool calls, bold database design around object storage and NVMe, and the wild stories behind Readwise, Notion, and Cursor.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Readwise Proved The Demand But Killed The Economics

  • Turbopuffer started after Simon Eskilden built Readwise article recommendations that worked well but would have raised infra cost from roughly $5k to $30k per month.
  • That pricing gap convinced him latent demand existed if vector search became about 10x cheaper.
INSIGHT

Why Turbopuffer Went All In On Object Storage

  • Simon Eskilden designed Turbopuffer around object storage and NVMe by minimizing round trips and inflating hot data from S3 into SSD and DRAM only when needed.
  • He says S3 strong consistency and modern NVMe made a fully new database architecture possible only recently.
ANECDOTE

The Notion Deal Triggered A Dark Fiber Detour

  • Turbopuffer delayed adding a separate consensus system by using GCP object-store compare-and-swap, then bet S3 would eventually catch up.
  • While serving Notion, Simon Eskilden even bought dark fiber and tuned TCP windows rather than add state in Zookeeper.
Get the Snipd Podcast app to discover more snips from this episode
Get the app