
Gemini 3.1 Pro Aces Benchmarks, I Suppose
Don't Worry About the Vase Podcast
00:00
Quirks, Rollout Issues, and Reliability
Zvi highlights common negative feedback: flaky CLI, API errors, post-train rubric issues, and rollout missteps.
Play episode from 17:19
Transcript


