Vanishing Gradients

Episode 19: Privacy and Security in Data Science and Machine Learning

12 snips
Aug 14, 2023
Hugo chats with Katharine Jarmul, a Principal Data Scientist at Thoughtworks Germany, specializing in privacy and ethics in data workflows. They dive into the vital distinctions between data privacy and security, demystifying common misconceptions. Katharine highlights the impact of GDPR and CCPA, and explores advanced concepts like federated learning and differential privacy. They also tackle real-world issues like privacy attacks and the ethical responsibilities of data scientists, making a compelling case for prioritizing privacy in data practices.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Marketing Faces The Biggest Privacy Shift

  • Marketing will face the largest disruption from privacy regulation and thus needs new technical patterns.
  • Privacy can become a competitive advantage rather than just a compliance cost.
INSIGHT

Differential Privacy Balances Individual And Aggregate Risk

  • Differential privacy bounds the information gain about any individual while preserving aggregate insights.
  • Local DP adds noise on-device and needs massive scale to recover signal.
ANECDOTE

Apple's Emoji Experiment With Local DP

  • Apple used local differential privacy on phones to gather emoji statistics and then aggregated with device identifiers removed.
  • They found language-specific emoji differences, showing how local DP can work at massive scale.
Get the Snipd Podcast app to discover more snips from this episode
Get the app