Tool Use - AI Conversations

How To Build a Hybrid AI System with Any-LLM (ft Nathan Brake)

17 snips

Mar 3, 2026

Nathan Brake, a Mozilla AI engineer and creator of Any-LLM, builds tools for talking to any LLM and hosting models locally. He discusses rapid model switching, running models on local hardware, cost and privacy benefits of small open-source models. He demos Any-LLM tooling, the Platform vault for keys and budgets, and his Porch Songs app that dogfoods model behavior.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Any-LLM Uses Provider SDKs To Reduce Fragility

Any-LLM is a thin Python SDK that standardizes calls to many providers by delegating to each provider's official SDK for correctness and quick updates.
Nathan built it to avoid brittle patches across integrations and to surface provider SDK breaking changes fast via integration tests.

ADVICE

Try Smaller Models Before Locking In Big Ones

Regularly switch models during development to find smaller, cheaper models that meet your needs instead of defaulting to the biggest model.
Nathan notes smaller models now can match last year's larger models, saving cost, latency, and enabling self-hosting.

INSIGHT

Three-Part Stack With Privacy-Preserving Key Vault

Any-LLM comprises three pieces: language SDKs, a gateway proxy for enterprise controls, and the Any-LLM Platform vault for encrypted key management and token tracking.
The platform tracks usage and budgets without routing or seeing prompt content by encrypting user keys locally.

Get the Snipd Podcast app to discover more snips from this episode