
Vanishing Gradients Episode 19: Privacy and Security in Data Science and Machine Learning
12 snips
Aug 14, 2023 Hugo chats with Katharine Jarmul, a Principal Data Scientist at Thoughtworks Germany, specializing in privacy and ethics in data workflows. They dive into the vital distinctions between data privacy and security, demystifying common misconceptions. Katharine highlights the impact of GDPR and CCPA, and explores advanced concepts like federated learning and differential privacy. They also tackle real-world issues like privacy attacks and the ethical responsibilities of data scientists, making a compelling case for prioritizing privacy in data practices.
AI Snips
Chapters
Transcript
Episode notes
Marketing Faces The Biggest Privacy Shift
- Marketing will face the largest disruption from privacy regulation and thus needs new technical patterns.
- Privacy can become a competitive advantage rather than just a compliance cost.
Differential Privacy Balances Individual And Aggregate Risk
- Differential privacy bounds the information gain about any individual while preserving aggregate insights.
- Local DP adds noise on-device and needs massive scale to recover signal.
Apple's Emoji Experiment With Local DP
- Apple used local differential privacy on phones to gather emoji statistics and then aggregated with device identifiers removed.
- They found language-specific emoji differences, showing how local DP can work at massive scale.
