Software Engineering Radio - the podcast for professional software developers

SE Radio 641: Catherine Nelson on Machine Learning in Data Science

10 snips
Nov 6, 2024
Catherine Nelson, a freelance data scientist and author of "Software Engineering for Data Scientists," dives into the collaboration between data scientists and software engineers in the realm of machine learning. She discusses the essential skills for data scientists, the pivotal role of notebooks in workflows, and the distinct responsibilities in machine learning projects. Nelson emphasizes the importance of data preprocessing, model evaluation, and the balance between technical success and business value, shedding light on the complexities of creating effective machine learning pipelines.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

ML vs. AI

  • Machine learning models solve one particular problem, while AI models can solve many.
  • AI models offer broader applicability across different tasks.
ADVICE

Notebooks vs. Git

  • Jupyter Notebooks are great for initial exploration and data interaction.
  • Refactor into a Git repository when retraining models or needing robustness.
INSIGHT

Data Scientist vs. ML Engineer

  • Data scientists explore problems and determine ML suitability.
  • Machine learning engineers take over for productionization and monitoring.
Get the Snipd Podcast app to discover more snips from this episode
Get the app