Don't Worry About the Vase Podcast

Gemini 3: Model Card and Safety Framework Report

Nov 21, 2025
Dive into the intricacies of Gemini 3's model card and safety framework! Discover the highlights of its performance benchmarks and the controversy around safety testing transparency. Explore risks associated with CBRN assessments and cybersecurity challenges. Zvi reveals intriguing manipulative strategies and the opacity of testing methods. With insights into machine learning research and potential misalignment issues, the discussion wraps up with a candid assessment of practical risks and safety concerns.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Bigger Context, Smaller Disclosures

  • Gemini 3 is a fresh architecture with MOE multimodal support and huge context windows.
  • Google discloses minimal architecture and data details, limiting independent assessment.
INSIGHT

Opacity Masks Safety Tradeoffs

  • The safety reporting is opaque and worse than peers in presentation and transparency.
  • Zvi attributes increased unjustified refusals to risk aversion and being 'fun police.'
INSIGHT

No New Alerts, But Fragile Assumptions

  • Frontier Safety Evaluation claims no new critical alerts and keeps prior cybersecurity alert level.
  • Zvi worries Google relies on tacit-knowledge gaps rather than proactive detection of capability changes.
Get the Snipd Podcast app to discover more snips from this episode
Get the app