Don't Worry About the Vase Podcast cover image

GPT 5.1 Follows Custom Instructions and Glazes

Don't Worry About the Vase Podcast

00:00

Benchmark Results and Safety Regressions

How GPT‑5.1 performs on benchmarks like SWE Bench and why OpenAI reports regressions on mental health and emotional reliance are reviewed.

Play episode from 20:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app