Latent Space: The AI Engineer Podcast

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

640 snips
Dec 6, 2025
Pim DeWitte, the visionary Founder and CEO of General Intuition, shares his insights on the future of AI. He discusses turning down a $500M offer from OpenAI to focus on building world models using action-labeled game clips. Pim reveals how training on game highlights fosters superhuman abilities in AI agents, making them capable of real-time action predictions. He explains the importance of episodic memory in learning, and his ambitious goal for spatial-temporal models to revolutionize AI interactions by 2030.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes

Frames-To-Actions Enables Video Transfer

  • GI labels internet video by predicting actions from frames, enabling transfer from games to real-world clips.
  • That lets any video become free training data once models map frames to action tokens.

World Models Handle Partial Observability

  • Their world models generate future frames conditioned on actions and handle camera shake, smoke, and partial observability.
  • Models maintain spatial consistency across view changes and different camera dynamics.

Protect Privacy With Action Overlays

  • Preserve privacy by mapping overlays to action labels instead of logging raw key presses.
  • Convert inputs into abstract actions for training while keeping user-level keys private.
Get the Snipd Podcast app to discover more snips from this episode
Get the app