Latent Space: The AI Engineer Podcast cover image

Artificial Analysis: Independent LLM Evals as a Service — with George Cameron and Micah-Hill Smith

Latent Space: The AI Engineer Podcast

00:00

Why They Run Their Own Evals

George describes labs' prompting differences, cherry-picking, and the need for consistent independent evaluation.

Play episode from 05:50
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app