
OpenAI Leadership Reshuffle, AI Unicorns, and White-Collar Work
Latent Space AI
00:00
Benchmark shows limits on white-collar automation
Jaeden summarizes Merkur's Apex Agents benchmark showing models score ~25% on complex white-collar tasks.
Play episode from 05:49
Transcript


