Complex Systems with Patrick McKenzie (patio11) cover image

Inference engineering and the real-world deployment of LLMs, with Philip Kiely

Complex Systems with Patrick McKenzie (patio11)

00:00

Test-Time Compute and Scaling Inference

Philip describes test-time compute, chain-of-thought, and how extra inference rounds improve generalization in new domains.

Play episode from 41:21
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app