How AI Is Built

#005 Building Reliable LLM Applications, Production-Ready RAG, Data-Driven Evals

16 snips
May 3, 2024
Creators of Ragas, Shahul and Jithin, discuss challenges in building LLM applications, emphasizing the importance of evaluation, data quality, and continuous RAG evolution. Practical takeaways include starting with a solid testing strategy and embracing synthetic data to automate test data set creation.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ADVICE

Start With Test Data And Metrics

  • Start evaluation with a proper test dataset and iterate like traditional ML workflows.
  • Use Ragas to automate test data generation and metric-driven validation before production.
INSIGHT

Open Models Need Post-Training Work

  • Open-source foundation models need significant post-training to be useful for specific applications.
  • Investing in fine-tuning, alignment, or continued pretraining yields practical gains over out-of-the-box use.
ADVICE

Match Training Method To The Goal

  • Choose post-training techniques (fine-tuning, DPO) for style and alignment and continuous pretraining to change model knowledge.
  • Use smaller fine-tuned models to cheaply distill tasks like synthetic data generation.
Get the Snipd Podcast app to discover more snips from this episode
Get the app