80,000 Hours Podcast

#195 – Sella Nevo on who's trying to steal frontier AI models, and what they could do with them

75 snips
Aug 1, 2024
Sella Nevo, director of the Meselson Center at RAND and seasoned information scientist, dives into the critical issue of securing frontier AI models. He discusses high-stakes examples of cybersecurity breaches, emphasizing how easily model weights can be targeted by rogue states and hackers. With compelling insights on human intelligence manipulation and supply chain vulnerabilities, Sella underscores the pressing need for improved defensive strategies. He also highlights his innovative machine learning work in flood forecasting, a game changer for disaster management.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Categories of Threat Actors

  • Different actors, from amateurs to nation-states, target AI model weights.
  • Their resources and capabilities vary, requiring tiered security measures.
ADVICE

Securing ML Infrastructure

  • Secure machine learning infrastructure by addressing vulnerabilities and malicious code execution.
  • Be aware of zero-day exploits and nation-state actors who may exploit them.
ADVICE

Mitigating Insider Threats

  • Reduce insider threats by limiting access to model weights and fostering a security-conscious culture.
  • Be wary of human intelligence collection through bribery, value alignment, or extortion.
Get the Snipd Podcast app to discover more snips from this episode
Get the app