
DevOps Paradox DOP 341: AI Widened the Highway but Nobody Rebuilt the Bridge
Mar 11, 2026
Trevor Stuart, co-founder of Split.io and head of Feature Management & Experimentation at Harness, has deep experience with feature flags and experimentation. He discusses AI-written code creating a six-lane highway into a two-lane bridge of reviews and delivery. Teams are embedding AI configs into flags and A/B testing agents in production. Culture, flag lifecycle, and running revenue-driving experiments receive special focus.
AI Snips
Chapters
Transcript
Episode notes
Kiosk Experiments Generated Millions
- A major fast food chain used experiments on kiosk UI wording and placement to generate several million dollars of incremental sales.
- Trevor uses this to show execs accept data when experiments clearly move revenue.
Fear Of Failure Kills Experimentation
- A core cultural barrier to experimentation is fear of admitting failure, which stops teams after one failed test.
- Trevor says teams that quit after a first failed experiment lose the learning loop and stop running the next 30 experiments.
Run Prompt Tests Inside Feature Flag Configs
- Put prompts, token limits and model settings into feature flag configs so you can A/B test agents in production safely.
- Trevor reports teams run different prompt variants to 5% of traffic to compare hallucination and business impact.
