How to design and validate a custom LLM judge

Elena explains turning human labelling criteria into judge prompts and validating judges by comparing to human labels on sample sets.

Play episode from 11:38

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!