Future of Benchmarking and Cost

Nathan warns frontier evals will grow costly; coding is easier to evaluate, while UI and system tests remain harder.

Play episode from 47:34

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!