CXOTalk

Top Data Scientists Explain Bad Data, Poisoned Datasets, and Other AI Killers | CXOTalk #896

62 snips
Oct 9, 2025
Join Dr. David Bray, a tech policy expert at the Stimson Center, and Dr. Anthony Scriffignano, a data science leader, as they dive into the hidden threats of bad data and poisoned datasets in AI. They discuss the Five Ms framework for identifying AI failures and why organizations often rush into AI adoption without proper vetting. Learn about the risks of generative AI, the importance of critical thinking and ethical oversight, and how to recognize malicious data campaigns that can undermine your AI systems.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
INSIGHT

Data Truth Has A Lifespan

  • Truth in data decays and math doesn't care about temporal validity, so models can regress on stale facts.
  • Organizations must ask hard provenance and agency questions before allowing teams to build AI tools.
INSIGHT

Smaller Specialized Models Over Mega LLMs

  • The future likely favors many smaller specialized models communicating rather than one mega-LLM doing everything.
  • Active inference and agentic systems can model continuous environments and coordinate domain-specific intelligence.
ANECDOTE

Nation-State Data Poisoning Example

  • David described a reported Russian campaign to teach LLMs falsehoods across free societies about history and events.
  • He emphasized that poisoned training signals are hard to undo because models don't forget easily.
Get the Snipd Podcast app to discover more snips from this episode
Get the app