Vanishing Gradients

9: AutoML, Literate Programming, and Data Tooling Cargo Cults

41 snips
Jul 19, 2022
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Copy‑Paste Production Pipeline

  • At Airbnb Hamel discovered a pipeline where an R script's coefficients were copy-pasted into Excel then into Airflow SQL for production.
  • That messy workflow motivated him to focus on ML tooling and better production practices.
ADVICE

Automate Repetitive Work With CI/CD

  • Use CI/CD (like GitHub Actions) to automate repetitive tasks: tests, builds, docs and transforms triggered by repo events.
  • Automate publishing (e.g., convert notebooks to PDFs or blog posts) so engineers focus on code and content, not manual steps.
ADVICE

Ship Docs And Tests From Your Notebook

  • Try literate programming (nbdev) so code, documentation and tests live together in notebooks and stay up to date.
  • Use literate systems to encourage documentation and testing as natural parts of development.
Get the Snipd Podcast app to discover more snips from this episode
Get the app