Training General Robots for Any Task: Physical Intelligence’s Karol Hausman and Tobi Springenberg

147 snips

Jan 6, 2026

Guest

Tobi Springenberg

Guest

Karol Hausman

Karol Hausman and Tobi Springenberg from Physical Intelligence discuss the groundbreaking potential of robotic foundation models. They argue that the intelligence bottleneck, not hardware, limits robotics and explain their mission to create models capable of performing diverse tasks. The duo dives into their end-to-end learning approach, emphasizing recent improvements in reinforcement learning and real-world deployment. Insights into unexpected applications from open-sourced models and the aspiration for continual robot learning highlight a pivotal shift in intelligent machine design.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Teleoperation Shows Hardware Is Capable

Karol recounts teleoperated robots that could clean houses if a human controlled them, proving hardware can already do many tasks.
This example motivated focusing on intelligence as the core bottleneck for robotics progress.

INSIGHT

Three Axes For Scaling Robots

They judge progress by capability, generalization, and performance as separate scaling challenges.
Diversity of data is the primary path to generalization; deployment unlocks massive data collection.

INSIGHT

Vision-Language-Action Architecture

Their architecture extends vision-language models with an action expert to map images and instructions to robot actions.
They pre-train mostly on their own robotics data mixed with internet data and use large transformers as backbones.

Get the Snipd Podcast app to discover more snips from this episode

Get the app