The Information Bottleneck cover image

Diffusion LLM & Why the Future of AI Won't Be Autoregressive - Stefano Ermon (Stanford /Inception)

The Information Bottleneck

00:00

Temperature, sampling and inference decoupling

Stefano explains temperature subtleties for diffusion LLMs and the advantage of decoupling training from inference.

Play episode from 07:32
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app