AI and "Do No Harm"
JAMA+ AI Conversations
00:00
What the live leaderboard does
David details the benchmark design: real specialist questions, gold‑standard answers, and ranking of over 30 models.
Play episode from 03:38
Transcript
David details the benchmark design: real specialist questions, gold‑standard answers, and ranking of over 30 models.