From Data Models to Mind Models: Designing AI Memory at Scale

32 snips

Feb 22, 2026

Vasilije Markovich, founder of Cognee and former data engineer turned cognitive-science-informed entrepreneur, builds agentic memory and knowledge-layer systems. He discusses permanent vs session memory, graph+vector architectures, storage and latency trade-offs, metadata and decay strategies, trace-based scoring, multi-tenant isolation, and practical vertical uses like pharma, logistics, and security.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ADVICE

Use Hot Session Memory For Low Latency Responses

Use session (hot) memory for low-latency agent interactions and sync to permanent memory when needed to avoid slow retrievals.
Cognee stores transformed embedding triplets in Redis for fast search and syncs to the permanent store for quality.

ADVICE

Start With Files And Upgrade Only When Needed

Start simple: use prompt templating or MD files for small projects and only move to vector/graph stores when you need complex relationships or reconciliation.
LensDB, Qdrant, Milvus, Neo4j and others are good next steps; LensDB can back up to S3 for fast iteration.

INSIGHT

Separate Graph Metadata From Embedding Content

Memory design splits into metadata/graph filters and embedding content; agents often operate mainly on embeddings with the graph storing source and relationships.
Cognee uses node sets, timestamps, and post-processing to add edges and enable traversals across concepts.

Get the Snipd Podcast app to discover more snips from this episode

Get the app