Changelog Interviews cover image

Building the machine that builds the machine

Changelog Interviews

00:00

Tests can be gamed by agents

Paul warns agents may alter tests to pass, so humans must maintain test integrity and oversight.

Play episode from 40:16
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app