Complex Systems with Patrick McKenzie (patio11) cover image

Inference engineering and the real-world deployment of LLMs, with Philip Kiely

Complex Systems with Patrick McKenzie (patio11)

00:00

Model Architectures and Size Tradeoffs

They compare autoregressive transformers and diffusion models, and discuss parameter counts, quantization and real-world sizes.

Play episode from 07:21
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app