Eye On A.I. cover image

#303 Fei-Fei Li: Spatial Intelligence, World Models & the Future of AI

Eye On A.I.

00:00

Multimodality vs. video-only inputs

Fei-Fei Li argues world models must be multimodal—video, audio, tactile, language and 3D layouts—not just text or video.

Play episode from 05:34
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app