AI Agents Podcast

Unlocking AI Vector Databases with James Luan, Zilliz CPO | EP 130

7 snips
Mar 27, 2026
James Luan, Co-founder and VP of Engineering at Zilliz, who helped build Milvus and vector database infrastructure. He explains why vector databases are central to modern AI, how retrieval augments LLMs, and the mechanics behind RAG, hallucinations, and long-term memory for agents. He also discusses scaling production-grade systems, MCP as a tool layer, and practical developer productivity wins.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Why Purpose Built Vector Databases Matter

  • Zilliz built Milvus as a purpose-built vector database because traditional relational systems couldn't understand high-dimensional embeddings.
  • James describes leveraging GPU and SIMD to accelerate dense compute workloads like reverse image search at scale.
ADVICE

Design Vector Systems For Compute And Cloud

  • Use infrastructure designed for vectors and cloud-native patterns to scale vector search cost-effectively.
  • James recommends GPU/CPU SIMD acceleration, Kubernetes deployment, and S3-backed storage to lower ops overhead.
ANECDOTE

Using Vector Search To Find Robot Failure Cases

  • Zilliz helped an embodied AI robotics customer find failure video clips by converting video metadata into embeddings and tags for search.
  • James recounts locating clips where robots missed a stop sign by querying video embeddings to collect fine-tuning data.
Get the Snipd Podcast app to discover more snips from this episode
Get the app