S3E21 - A/B Testing Statistics Concepts Experimenters must know with Ronny Kohavi

53 snips

Jun 4, 2023

Data scientist Ronny Kohavi discusses A/B testing statistics concepts all experimenters must know, including the Overall Evaluation Criterion, Twyman's law, and the amount of traffic needed for A/B tests. He shares insights on handling tests that turn out to be wrong and presenting experiment results to the CEO of Amazon. The podcast also highlights the benefits of AV tests and variety in designs for experimentation and CRO, while cautioning against the pitfalls of long-term testing.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Most Experiment Ideas Fail More Often Than You Think

Most A/B test ideas fail frequently, revealing that teams overestimate idea quality.
Ronny Kohavi observed 60–70% failure at Microsoft, ~85% at Bing, and 92% at Airbnb, showing failure is widespread even in big tech.

ADVICE

Automate Trust Tests In Your Experiment Platform

Build automated trust checks into your experimentation platform to detect common bugs and validate results.
Kohavi recommends SRM checks, A/A tests, p-value uniformity tests, and correct variance methods for ratio metrics.

ANECDOTE

Removing Jeff Bezos's Favorite Feature Turned Out Better

Jeff Bezos loved bottom-of-page deals, but removing them improved performance and won the experiment.
Kohavi recounts removing the slow feature and Bezos accepted the data-driven recommendation to remove it.

Get the Snipd Podcast app to discover more snips from this episode

Get the app