Observational Memory: The Human-Inspired Memory System for AI Agents, with Tyler Barnes

Feb 20, 2026

Tyler Barnes, founding engineer at Mastra and creator of Observational Memory, explains a human-inspired memory system for AI that compresses conversations into dense, cacheable observations. They cover how it beats semantic recall, the reflector and observation mechanics, LongMemEval results, integration tips, and real-world benefits like stability and cost savings.

Ask episode

AI Snips

Chapters

Books

Transcript

Episode notes

INSIGHT

Cacheable Dense Observations Improve Recall

Observational Memory blends stable prompt caching with higher accuracy than RAG by compressing conversations into dense observations and appending new messages after them.
Tyler reported ~84% on GPT-4 and ~94.87% with GPT-5 mini on LongMemEval, beating previous memory systems and enabling cacheable contexts.

INSIGHT

Two-Tier Context With Periodic Reflection

The system keeps two context buckets: raw recent messages and compressed observations for older history, then periodically runs a reflector to reorganize and prune observations.
The reflector merges similar observations and drops low-value info, enabling graceful long-term forgetting while keeping a stable cache.

ANECDOTE

Idea Born From A Personal Coding Agent

Tyler built the concept while making a personal coding agent that pinned many files and blew up the context window, inspiring a human-like observational approach.
He converted long file reads into short observations to keep knowledge while drastically reducing token costs.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

Tyler Barnes, founding engineer at Mastra, introduces Observational Memory. It is a new memory system for AI agents that achieves state-of-the-art results on LongMemEval with a completely stable context window.

Unlike semantic recall (which uses RAG and invalidates prompt caching), Observational Memory compresses conversations into dense observations while maintaining a stable, fully cacheable context.

The result: 94.87% accuracy on LongMemEval with GPT-5 mini. This is the highest score recorded by any memory system to date.

In this conversation, Tyler explains how the system works, why it outperforms raw context, and how you can integrate it into your agents in under 20 minutes. We also dive into the research, the benchmarks, and what's next for Observational Memory.

🧰 RESOURCES MENTIONED

Observational Memory Launch Blog: https://mastra.ai/blog/observational-memory
Full Research Breakdown: https://mastra.ai/research/observational-memory
Tyler Barnes on X: https://x.com/tylbar
Tyler's Announcement Post (Feb 9 ): https://x.com/tylbar/status/2020924183979397512

📚 MASTRA RESOURCES
Mastra: https://mastra.ai
Learn Mastra in the world's first MCP-Based Course: https://mastra.ai/course
Principles of Building AI Agents (Book): https://mastra.ai/book
Patterns for Building AI Agents (New Book): https://mastra.ai/blog/patterns-book https://docs.google.com/forms/d/e/1FAIpQLSduJjc515f6RZJqtkR2ByqJZrB0iP8B7SUKnjjZE9IajH_I8w/viewform

MASTRA?
Mastra is an open-source TypeScript framework designed for building and shipping AI-powered applications and agents with minimal friction. It supports the full lifecycle of agent development—from prototype to production. You can integrate it with frontend and backend stacks (e.g., React, Next.js, Node) or run agents as standalone services. If you’re a JavaScript or TypeScript developer looking to build an agentic or AI-powered product without starting from first principles, Mastra provides the scaffolding, tools, and integrations to accelerate that process.

00:00 – Intro
00:26 – The Origin Story
01:14 – Previous Memory Systems: Semantic Recall vs Working Memory
02:23 – How Observational Memory Works
03:52 – Human-Inspired Memory System
06:11 – Buffered Observations
06:32 – Research & Benchmarks
10:34 – Live Demo
13:57 – No More Compaction Hell
15:08 – Performance & Cost Benefits
16:42 – Shipped Code vs Research Papers
17:33 – Future Roadmap & Next Ideas