RoboPapers

Ep#54: MemER: Scaling Up Memory for Robot Control via Experience Retrieval

6 snips
Dec 17, 2025
Ajay Sridhar, a Robotics PhD student, and Jenny Pan, a visiting researcher, dive into MemER's groundbreaking work on robot memory. They discuss how robots can enhance decision-making by selecting crucial keyframes, improving long-horizon task execution like object search. Jenny explains the innovative training methods for keyframe selection, while Ajay shares insights into robust retry behaviors and the challenges of memory management in dynamic environments. Their vision includes transferable memories across robots, enhancing collaboration in robotic tasks.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Hierarchical Memory Scales Robot Control

  • MemER uses a hierarchical policy: a high-level VLM selects and stores keyframes while a low-level policy executes short-horizon actions.
  • This lets the system solve long-horizon robotic tasks (object search, counting, dusting) by compressing relevant history into sparse snapshots.
ADVICE

Bound Context And Deduplicate Keyframes

  • Limit the VLM input to a bounded recent window (eight frames at 2 Hz) and only store nominated keyframes to avoid latency growth.
  • Use online deduplication (simple clustering) to consolidate repeated nominations into persistent keyframes.
ADVICE

Prefer Sparse Snapshots Over Dense History

  • Favor sparse, task-relevant snapshots over indiscriminate long histories to reduce training brittleness and covariate shift.
  • Sparse keyframes keep memory in-distribution during retries and make policies more robust.
Get the Snipd Podcast app to discover more snips from this episode
Get the app