"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Training the AIs' Eyes: How Roboflow is Making the Real World Programmable, with CEO Joseph Nelson

152 snips

Apr 4, 2026

Joseph Nelson, Roboflow co-founder and CEO, talks about why computer vision still struggles in the real world. He gets into frontier model failures, the push from cloud APIs to edge models, and the open-source race shaped by Meta, NVIDIA, and China. They also explore world models, smart glasses, aesthetic judgment, and how vision could make everyday life programmable.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

How Wimbledon Distills Vision To The Edge

Wimbledon replay systems use frontier models to label prior footage, then train smaller task-specific models that can run live at the court.
Joseph Nelson says the broadcast stack needed sub-10-millisecond performance, so SAM-style open-vocabulary models were too heavy despite strong accuracy.

INSIGHT

Why Vision Data Needs Depend On Variability

Data needs in vision depend more on scene variability and required reliability than on any fixed scaling law.
Joseph Nelson contrasts self-driving, which needs massive edge-case coverage, with factory inspection, where hundreds of images can already produce useful models in controlled settings.

INSIGHT

Why Open Source Vision Is A Geopolitical Race

Joseph Nelson says China has led visual AI more consistently than the US, while American open source depends heavily on Meta and increasingly NVIDIA.
He argues Roboflow regained US leadership in real-time segmentation and detection by building RF-DETR on Meta’s DINOv2 backbone.

Get the Snipd Podcast app to discover more snips from this episode

Get the app