DevOps Paradox

DOP 341: AI Widened the Highway but Nobody Rebuilt the Bridge

Mar 11, 2026
Trevor Stuart, co-founder of Split.io and head of Feature Management & Experimentation at Harness, has deep experience with feature flags and experimentation. He discusses AI-written code creating a six-lane highway into a two-lane bridge of reviews and delivery. Teams are embedding AI configs into flags and A/B testing agents in production. Culture, flag lifecycle, and running revenue-driving experiments receive special focus.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Kiosk Experiments Generated Millions

  • A major fast food chain used experiments on kiosk UI wording and placement to generate several million dollars of incremental sales.
  • Trevor uses this to show execs accept data when experiments clearly move revenue.
INSIGHT

Fear Of Failure Kills Experimentation

  • A core cultural barrier to experimentation is fear of admitting failure, which stops teams after one failed test.
  • Trevor says teams that quit after a first failed experiment lose the learning loop and stop running the next 30 experiments.
ADVICE

Run Prompt Tests Inside Feature Flag Configs

  • Put prompts, token limits and model settings into feature flag configs so you can A/B test agents in production safely.
  • Trevor reports teams run different prompt variants to 5% of traffic to compare hallucination and business impact.
Get the Snipd Podcast app to discover more snips from this episode
Get the app