
706: Large Language Model Leaderboards and Benchmarks
Super Data Science: ML & AI Podcast with Jon Krohn
00:00
Intro
In this chapter, Katarina Konstantinescu talks about leaderboards for comparing the quality of open source and commercial large language models. The discussion touches on the advantages and challenges of evaluating these models, as well as Katarina's journey from psychology to data science and their involvement in data science meetups.
Play episode from 00:00
Transcript


