Ep#54: MemER: Scaling Up Memory for Robot Control via Experience Retrieval

6 snips

Dec 17, 2025

Ajay Sridhar, a Robotics PhD student, and Jenny Pan, a visiting researcher, dive into MemER's groundbreaking work on robot memory. They discuss how robots can enhance decision-making by selecting crucial keyframes, improving long-horizon task execution like object search. Jenny explains the innovative training methods for keyframe selection, while Ajay shares insights into robust retry behaviors and the challenges of memory management in dynamic environments. Their vision includes transferable memories across robots, enhancing collaboration in robotic tasks.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Hierarchical Memory Scales Robot Control

MemER uses a hierarchical policy: a high-level VLM selects and stores keyframes while a low-level policy executes short-horizon actions.
This lets the system solve long-horizon robotic tasks (object search, counting, dusting) by compressing relevant history into sparse snapshots.

ADVICE

Bound Context And Deduplicate Keyframes

Limit the VLM input to a bounded recent window (eight frames at 2 Hz) and only store nominated keyframes to avoid latency growth.
Use online deduplication (simple clustering) to consolidate repeated nominations into persistent keyframes.

ADVICE

Prefer Sparse Snapshots Over Dense History

Favor sparse, task-relevant snapshots over indiscriminate long histories to reduce training brittleness and covariate shift.
Sparse keyframes keep memory in-distribution during retries and make policies more robust.

Get the Snipd Podcast app to discover more snips from this episode

Get the app