Just Now Possible

Debugging AI Products: From Data Leakage to Evals with Hamel Husain

43 snips
Oct 2, 2025
Hamel Husain, a machine learning engineer with over 25 years of experience at GitHub and Airbnb, dives deep into the intricacies of debugging AI products. He shares insights from his work on forecasting Airbnb guest growth, highlighting challenges like data leakage. The conversation uncovers techniques for error analysis in machine learning, the importance of synthetic data, and the pitfalls of AI-generated outputs like hallucinations. Hamel emphasizes the need for systematic improvement and presents practical tips for enhancing AI evaluations.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

ML Is Prediction, Not Magic

  • Machine learning is fundamentally predictive: classification, recommendation, forecasting.
  • The core challenge is making models generalize to unseen data, not complexity of algorithms.
ANECDOTE

Nurture Boss: An AI Leasing Assistant

  • Nurture Boss built an AI leasing assistant for apartment management with tools and SMS/voice.
  • The small startup launched agentic features like tour scheduling and resident communications.
ADVICE

Use Targeted Synthetic Data To Test Edges

  • Use synthetic data targeted at observed errors to explore boundaries and frequency.
  • Generate variations (formats, vagueness, edge cases) and run them as test cases against your agent.
Get the Snipd Podcast app to discover more snips from this episode
Get the app