How AI Is Built

#005 Building Reliable LLM Applications, Production-Ready RAG, Data-Driven Evals

16 snips

May 3, 2024

Creators of Ragas, Shahul and Jithin, discuss challenges in building LLM applications, emphasizing the importance of evaluation, data quality, and continuous RAG evolution. Practical takeaways include starting with a solid testing strategy and embracing synthetic data to automate test data set creation.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ADVICE

Start With Test Data And Metrics

Start evaluation with a proper test dataset and iterate like traditional ML workflows.
Use Ragas to automate test data generation and metric-driven validation before production.

INSIGHT

Open Models Need Post-Training Work

Open-source foundation models need significant post-training to be useful for specific applications.
Investing in fine-tuning, alignment, or continued pretraining yields practical gains over out-of-the-box use.

ADVICE

Match Training Method To The Goal

Choose post-training techniques (fine-tuning, DPO) for style and alignment and continuous pretraining to change model knowledge.
Use smaller fine-tuned models to cheaply distill tasks like synthetic data generation.

Get the Snipd Podcast app to discover more snips from this episode