Software Engineering Daily cover image

Optimizing Agent Behavior in Production with Gideon Mendels

Software Engineering Daily

00:00

LLM-as-judge tradeoffs and costs

They discuss LLM judges' fuzziness, expense, and why eval investment still outperforms ad-hoc production rollouts.

Play episode from 27:09
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app