Raising Health

Building the Marketplace for AI’s Most Valuable Asset

16 snips
Feb 17, 2026
Bobby Samuels, cofounder and CEO of Protégé, builds a marketplace linking proprietary real-world data to AI labs. He discusses sourcing and packaging multimodal longitudinal data, why real-world beats synthetic, the rise of eval datasets and benchmarks, legal and rights challenges, and expanding beyond healthcare into video, audio, motion, and biology.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Synthetic Data Is Complementary Not Panacea

  • Synthetic and manufactured data have roles but don't replace real-world exhaust data for representativeness.
  • Use synthetic data for prototyping, privacy-safe previews, or simple extrapolations, but train on real data for fidelity.
ANECDOTE

Stitching Multimodal Patient Journeys

  • Protege built longitudinal, multimodal patient journeys by stitching EHRs, imaging, labs, and pathology across institutions.
  • That capability differentiated them from traditional data players focused mainly on structured pharma-oriented datasets.
ADVICE

Design For Recurring Data Access

  • Expect customers to request recurring access and refreshed datasets rather than one-off buys.
  • Structure agreements for renewals and periodic refreshes so models stay current and useful.
Get the Snipd Podcast app to discover more snips from this episode
Get the app