Complex Systems with Patrick McKenzie (patio11) cover image

Inference engineering and the real-world deployment of LLMs, with Philip Kiely

Complex Systems with Patrick McKenzie (patio11)

00:00

Local Models, Routing, and Hybrid Architectures

They explore routing requests to local cheap models, when to call state-of-the-art models, and domain-specialized LLMs.

Play episode from 45:27
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app