#325 Phelim Brady: Why AI's Future Depends on Human Judgement

9 snips

Mar 9, 2026

Phelim Bradley, co-founder and CEO of Prolific and former bioinformatics researcher, explains the hidden human layer powering modern AI. He discusses vetting and scaling representative participants, demographic‑aware model preference testing, why standard benchmarks are failing, and how continuous human evaluation and human–AI collaboration will shape model choice and reliability.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Who Labels Models Matters More Than Ever

The pool behind labels now matters because modern evaluation needs representative, expert, and diverse participants rather than fungible workers.
Prolific focuses on representativeness and breadth so models encode the depth of humanity, not just cheap annotations.

ANECDOTE

Multi Layer Verification To Stop Agentic Fraud

Prolific verifies participants via KYC, periodic rechecks, deep profiles, qualification exams and behavioral analysis.
This layered approach helps defend against emerging threats like agentic fraud where AI mimics human contributors.

INSIGHT

Three Tiers Of Human Evaluators

Prolific segments contributors into three groups: general audience, trained taskers, and expert-level participants for specialized evaluation.
Roughly a third of AI work needs expert or trained evaluators, not generic crowdworkers.

Get the Snipd Podcast app to discover more snips from this episode

Get the app