Machine Learning Street Talk (MLST)

The Secret Engine of AI - Prolific [Sponsored] (Sara Saab, Enzo Blindow)

80 snips
Oct 18, 2025
Sara Saab, VP of Product at Prolific with a background in cognitive science, and Enzo Blindow, VP of Data and AI at Prolific and an expert in economics, discuss the pivotal role of human feedback in AI. They stress that non-deterministic AI systems require human oversight more than ever, as optimizing for benchmarks can mislead usability. Exploring the ecological context of intelligence, they advocate for a participatory approach to evaluation that captures social norms and emphasizes the importance of cultural alignment.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Subjective 'Vibes' Need Representative Measures

  • 'Vibes' are meaningful but hard to quantify; human opinion is necessary to measure subjective qualities like agreeableness.
  • Representative, stratified human samples and clear scales are required to avoid selection bias.
INSIGHT

Evaluation Should Be Solution-Agnostic

  • Measurement is solution-agnostic: good evaluation lets teams optimize across architectures and datasets.
  • Agreeing on robust success metrics is the key constraint, not the modeling approach.
ADVICE

Design Leaderboards To Resist Gaming

  • Reduce gaming of leaderboards by balancing transparency and obfuscation, e.g., controlled private evals or noisy responses.
  • Consider differential-noise techniques or equal private access to limit gaming while preserving accountability.
Get the Snipd Podcast app to discover more snips from this episode
Get the app