Data Engineering Podcast

From Data Models to Mind Models: Designing AI Memory at Scale

32 snips
Feb 22, 2026
Vasilije Markovich, founder of Cognee and former data engineer turned cognitive-science-informed entrepreneur, builds agentic memory and knowledge-layer systems. He discusses permanent vs session memory, graph+vector architectures, storage and latency trade-offs, metadata and decay strategies, trace-based scoring, multi-tenant isolation, and practical vertical uses like pharma, logistics, and security.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Use Hot Session Memory For Low Latency Responses

  • Use session (hot) memory for low-latency agent interactions and sync to permanent memory when needed to avoid slow retrievals.
  • Cognee stores transformed embedding triplets in Redis for fast search and syncs to the permanent store for quality.
ADVICE

Start With Files And Upgrade Only When Needed

  • Start simple: use prompt templating or MD files for small projects and only move to vector/graph stores when you need complex relationships or reconciliation.
  • LensDB, Qdrant, Milvus, Neo4j and others are good next steps; LensDB can back up to S3 for fast iteration.
INSIGHT

Separate Graph Metadata From Embedding Content

  • Memory design splits into metadata/graph filters and embedding content; agents often operate mainly on embeddings with the graph storing source and relationships.
  • Cognee uses node sets, timestamps, and post-processing to add edges and enable traversals across concepts.
Get the Snipd Podcast app to discover more snips from this episode
Get the app