Latent Space: The AI Engineer Podcast

⚡️The Rise and Fall of the Vector DB Category

1850 snips
May 1, 2025
Jo Kristian Bergum, a seasoned search infrastructure expert with two decades at Yahoo and Fast Search & Transfer, dives deep into the evolution of vector databases. He discusses the surge in vector database popularity post-ChatGPT and the misconceptions surrounding embedding-based similarity search. The conversation explores the dynamic interplay between traditional search methods and embedding techniques. Additionally, Joe sheds light on the future of retrieval-augmented generation and the importance of knowledge graphs in AI development.
Ask episode
AI Snips
Chapters
Transcript
Episode notes

Choose Search System by Scale

  • Use PostgreSQL with pgvector for moderate scale vector search if you already use it as your database.
  • For critical search business needs, consider specialized search engines for better search quality.

Embeddings Blur Search and Recommendations

  • Embedding-based retrieval has long been integral to recommender systems and now converges with search technologies.
  • There's a layered approach of retrieving candidates and re-ranking them before final presentation.

Build Search with Hybrid Approach

  • Start search systems with classical keyword algorithms like BM25 to establish baselines.
  • Add embeddings and re-ranking layers as budget and latency allow, tuning sequence by use case.
Get the Snipd Podcast app to discover more snips from this episode
Get the app