
Software Engineering Radio - the podcast for professional software developers SE Radio 641: Catherine Nelson on Machine Learning in Data Science
10 snips
Nov 6, 2024 Catherine Nelson, a freelance data scientist and author of "Software Engineering for Data Scientists," dives into the collaboration between data scientists and software engineers in the realm of machine learning. She discusses the essential skills for data scientists, the pivotal role of notebooks in workflows, and the distinct responsibilities in machine learning projects. Nelson emphasizes the importance of data preprocessing, model evaluation, and the balance between technical success and business value, shedding light on the complexities of creating effective machine learning pipelines.
AI Snips
Chapters
Transcript
Episode notes
ML vs. AI
- Machine learning models solve one particular problem, while AI models can solve many.
- AI models offer broader applicability across different tasks.
Notebooks vs. Git
- Jupyter Notebooks are great for initial exploration and data interaction.
- Refactor into a Git repository when retraining models or needing robustness.
Data Scientist vs. ML Engineer
- Data scientists explore problems and determine ML suitability.
- Machine learning engineers take over for productionization and monitoring.

