

Caterina Constantinescu
Expert in Large Language Models (LLMs)
Best podcasts with Caterina Constantinescu
Ranked by the Snipd community

Aug 18, 2023 • 33min
706: Large Language Model Leaderboards and Benchmarks
Caterina Constantinescu discusses Large Language Models (LLMs), leaderboard comparisons, evaluation challenges, dataset contamination, and platforms like HELM and Chatbot Arena. Learn about LAMA 2, benchmark evolution, user preferences in chatbots, human feedback for model improvement, and the impact of perception on model evaluations.


