Equity cover image

The PhD students who became the judges of the AI industry

Equity

00:00

Benchmarking agents, coding, and expert leaderboards

Wei-Lin explains Arena's move to agent evaluations, CoArena for coding agents, and expert leaderboards for legal and medical use.

Play episode from 19:54
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app