
OpenAI Leadership Reshuffle, AI Unicorns, and White-Collar Work
Let Freedom: Political News, Un-Biased, Lex Fridman, Joe Rogan, CNN, Fox News
00:00
Apex Agents benchmark: limits on white-collar tasks
Jaden reviews Merkur's Apex Agents benchmark showing top models score ~24%, struggling with multi-domain workplace tasks.
Play episode from 05:49
Transcript


