The Growth Podcast cover image

AI Evals Explained Simply by Ankit Shukla

The Growth Podcast

00:00

Limitations of BLEU/ROUGE and Newer Judging

Ankit explains why traditional BLEU/ROUGE metrics often fail and why LLM-judge prompts are needed for nuance.

Play episode from 32:33
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app