Gradient Dissent: Conversations on AI

Providing Greater Access to LLMs with Brandon Duderstadt, Co-Founder and CEO of Nomic AI

35 snips

Jul 27, 2023

Brandon Duderstadt

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Small Models Win On Cost And Privacy

Smaller models will dominate many production roles because they offer better cost and privacy tradeoffs.
Domain-specific models plus careful data curation beat brute-force large models for many tasks.

ADVICE

Fine-Tune Efficiently With QLoRA

Use QLoRA or adapter methods to fine-tune efficiently on compressed base models.
Prefer cheap adapter training before full-weight fine-tuning to cut compute costs dramatically.

INSIGHT

Quantization Trades Few Bits For Big Gains

Quantization compresses model weights (e.g., 4-bit, 8-bit) to drastically reduce memory and compute.
Many intermediate quantizations keep most performance while enabling orders-of-magnitude speedups.

Get the Snipd Podcast app to discover more snips from this episode