The Growth Podcast

Evals are the new PRD. Here is the playbook with the CEO of the leader in the space (Ankur Goyal, Founder and CEO, Braintrust)

80 snips
Mar 20, 2026
Ankur Goyal, founder and CEO of Braintrust, leads a major eval platform used by top AI product teams. He explains why evals should start the product process, how to build data-task-score evals, and shows a live demo creating an eval from scratch. Conversation covers scoring design, connecting evals to tools like Linear, and using failing evals as a roadmap for improvement.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Vibe Checks Are Early Evals

  • Vibe checks are a form of evaluation using your brain as the scoring function.
  • Early-stage manual checks scale poorly as product usage grows and require software and process to run consistent evals.
INSIGHT

Evals Outlast Model Churn

  • Evals are a durable investment that outlast model and architectural churn.
  • Encoding user needs as evals preserves product intent across frequent model swaps and agent rewrites.
INSIGHT

Distance From Users Determines Eval Need

  • The farther you are from the end user, the more critical structured evals become.
  • Teams like Anthropic can rely on internal feedback loops; healthcare-focused apps cannot and need formal evals with domain experts.
Get the Snipd Podcast app to discover more snips from this episode
Get the app