
OpenAI Podcast Episode 15 - Inside the Model Spec
70 snips
Mar 25, 2026 Jason Wolf, an OpenAI alignment researcher who builds the Model Spec, explains the public framework that defines intended model behavior. He covers how the spec is written and updated, how instruction hierarchies resolve conflicts, handling tricky edge cases like kids’ questions, and how training and transparency shape real-world model behavior.
AI Snips
Chapters
Transcript
Episode notes
Santa Question Showed Spec In Action
- Jason Wolf described a family moment where his child asked the model if Santa Claus is real.
- The model answered vaguely and in a kid-safe way, demonstrating spec-aligned behavior in a real interaction.
Model Spec Is A Public Behavioral North Star
- The model spec is a public articulation of OpenAI's intended high-level model behaviors.
- It explains goals, trade-offs, and important decisions without being a full implementation or claiming perfect compliance.
Explain Policy With Example Answers
- Use concrete examples and ideal-answer snippets to convey tone, style, and edge-case decisions in policy documents.
- The spec includes many borderline examples to show how honesty, politeness, and steerability should be balanced in practice.
