Super Data Science: ML & AI Podcast with Jon Krohn cover image

706: Large Language Model Leaderboards and Benchmarks

Super Data Science: ML & AI Podcast with Jon Krohn

00:00

Intro

In this chapter, Katarina Konstantinescu talks about leaderboards for comparing the quality of open source and commercial large language models. The discussion touches on the advantages and challenges of evaluating these models, as well as Katarina's journey from psychology to data science and their involvement in data science meetups.

Play episode from 00:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app