Debugging AI Products: From Data Leakage to Evals with Hamel Husain

43 snips

Oct 2, 2025

Hamel Husain, a machine learning engineer with over 25 years of experience at GitHub and Airbnb, dives deep into the intricacies of debugging AI products. He shares insights from his work on forecasting Airbnb guest growth, highlighting challenges like data leakage. The conversation uncovers techniques for error analysis in machine learning, the importance of synthetic data, and the pitfalls of AI-generated outputs like hallucinations. Hamel emphasizes the need for systematic improvement and presents practical tips for enhancing AI evaluations.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

ML Is Prediction, Not Magic

Machine learning is fundamentally predictive: classification, recommendation, forecasting.
The core challenge is making models generalize to unseen data, not complexity of algorithms.

ANECDOTE

Nurture Boss: An AI Leasing Assistant

Nurture Boss built an AI leasing assistant for apartment management with tools and SMS/voice.
The small startup launched agentic features like tour scheduling and resident communications.

ADVICE

Use Targeted Synthetic Data To Test Edges

Use synthetic data targeted at observed errors to explore boundaries and frequency.
Generate variations (formats, vagueness, edge cases) and run them as test cases against your agent.

Get the Snipd Podcast app to discover more snips from this episode

Get the app