Software Engineering Radio - the podcast for professional software developers cover image

SE Radio 710: Marc Brooker on Spec-Driven AI Dev

Software Engineering Radio - the podcast for professional software developers

00:00

Evaluating Models and Agent Configurations

Marc describes offline evaluations, LLM-as-judge, and gradual online A/B testing to validate model and prompt changes.

Play episode from 41:15
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app