MLOps.community  cover image

Operationalizing AI Agents: From Experimentation to Production // Databricks Roundtable

MLOps.community

00:00

Eval-Driven Development and Testing

Samraj and Ben describe equating evals to unit tests and using judges, CI, and telemetry to ensure quality.

Play episode from 25:40
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app