
Eye On A.I. #325 Phelim Brady: Why AI's Future Depends on Human Judgement
9 snips
Mar 9, 2026 Phelim Bradley, co-founder and CEO of Prolific and former bioinformatics researcher, explains the hidden human layer powering modern AI. He discusses vetting and scaling representative participants, demographic‑aware model preference testing, why standard benchmarks are failing, and how continuous human evaluation and human–AI collaboration will shape model choice and reliability.
AI Snips
Chapters
Transcript
Episode notes
Who Labels Models Matters More Than Ever
- The pool behind labels now matters because modern evaluation needs representative, expert, and diverse participants rather than fungible workers.
- Prolific focuses on representativeness and breadth so models encode the depth of humanity, not just cheap annotations.
Multi Layer Verification To Stop Agentic Fraud
- Prolific verifies participants via KYC, periodic rechecks, deep profiles, qualification exams and behavioral analysis.
- This layered approach helps defend against emerging threats like agentic fraud where AI mimics human contributors.
Three Tiers Of Human Evaluators
- Prolific segments contributors into three groups: general audience, trained taskers, and expert-level participants for specialized evaluation.
- Roughly a third of AI work needs expert or trained evaluators, not generic crowdworkers.

