
OpenAI Leadership Reshuffle, AI Unicorns, and White-Collar Work
Perplexity AI
00:00
Benchmark: AI struggles with white-collar tasks
Jaeden summarizes Merkur's Apex Agents benchmark showing models perform around 18–24% on complex professional tasks.
Play episode from 05:37
Transcript


