
Why AI Evaluation Science Can't Keep Up (with Carina Prunkl)
Future of Life Institute Podcast
00:00
Improving validity and integrity of evals
She urges focus on construct/external validity, evaluation integrity, situational awareness, and real-world experiments.
Play episode from 16:00
Transcript


