undefined

Stefano Ermon

Associate professor at Stanford University and CEO of Inception; researcher and entrepreneur specializing in generative models and diffusion-based approaches for images, text, and code.

Top 5 podcasts with Stefano Ermon

Ranked by the Snipd community
undefined
64 snips
Feb 24, 2024 • 35min

Beyond Uncanny Valley: Breaking Down Sora

In this engaging discussion, Stefano Ermon, a leading Professor of Computer Science at Stanford, reveals the inner workings of OpenAI's groundbreaking Sora model for AI-generated video. He discusses the shift from GANs to diffusion models and the significance of high-quality training data. The conversation explores the uncanny valley and how Sora's capabilities could reshape our understanding of video compression and generation. Ermon also hints at the exciting future of personalized video content and its applications in various fields.
undefined
39 snips
Jan 4, 2026 • 52min

#310 Stefano Ermon: Why Diffusion Language Models Will Define the Next Generation of LLMs

Stefano Ermon, co-founder and CEO of Inception and former Stanford professor, explores diffusion language models and their potential to revolutionize AI. He explains how these models generate text in parallel, enhancing speed and cost efficiency compared to traditional methods. The discussion delves into the architecture's controllability and safety, the necessity of efficient inference for broader AI applications, and the exciting future of coding workflows and voice agents. Ermon emphasizes the role of these innovative models in shaping the next generation of generative intelligence.
undefined
31 snips
Feb 24, 2026 • 49min

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Stefano Ermon, Stanford CS professor and founder of Inception Labs, explains Mercury — a diffusion approach that drafts full text then refines it. He discusses why diffusion can edit many tokens in parallel, how that reduces latency and GPU memory bottlenecks, which real-time applications benefit most, and the tradeoffs around quality, context length, and industry adoption.
undefined
29 snips
Mar 26, 2026 • 1h 3min

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Stefano Ermon, Stanford associate professor and CEO of Inception known for work on generative models, discusses adapting diffusion methods from images to text and code. He covers technical hurdles of discrete tokens, Mercury 2’s multi-token, low-latency inference, tradeoffs between denoising iterations and autoregressive sampling, real-world serving challenges, and where diffusion shines like editing and fast voice/agent loops.
undefined
19 snips
May 24, 2024 • 1h 6min

ARCHIVE: Open Models (with Arthur Mensch) and Video Models (with Stefano Ermon)

Guests Arthur Mensch and Stefano Ermon discuss open foundation models and video models, emphasizing the importance of neutrality in technology and the regulation of AI applications. They explore the evolution of language models, the advantages of open-source collaboration, and the future prospects of specialized AI models. Additionally, they delve into the challenges and advancements in generating longer videos, showcasing how physics aids in accurate predictions and data compression in 3D models.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app