Latent Space: The AI Engineer Podcast

SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)

531 snips
Dec 18, 2025
Nikhila Ravi leads the Segment Anything project at Meta, with Pengchuan Zhang contributing as a researcher specializing in vision models. They discuss the groundbreaking SAM 3, which enables concept segmentation using natural language prompts. The conversation dives into the impressive real-time performance, the massive SACO benchmark of over 200k concepts, and how SAM 3 revolutionizes data annotation—reducing time from two minutes to just 25 seconds. Joseph Nelson from Roboflow shares insights on real-world applications in fields like cancer research and the automation of complex visual reasoning.
Ask episode
AI Snips
Chapters
Transcript
Episode notes

Data Engine Was The Key Multiplier

  • The data engine, not just the model, drove SAM3's step change by automating annotation and verification at scale.
  • Automating exhaustivity checks let them scale dataset size and diversity efficiently.

Roboflow's Real‑World Impact Metrics

  • Joseph Nelson shared Roboflow's stats: 106M smart polygons created and ~130 human years of labeling time saved.
  • Use cases range from cancer research to underwater trash cleanup and autonomous vehicle perception.

Use Few Positives And A Few Negatives

  • Fine‑tune with as few as ~10 examples and include a few negative examples to shift model priors quickly.
  • Use negative examples especially to teach presence/absence distinctions and reduce false positives.
Get the Snipd Podcast app to discover more snips from this episode
Get the app