undefined

Olivia Watkins

Member of OpenAI's Frontier Evals team focused on evaluation design and contamination analysis; collaborated on creating and analyzing SWE-Bench Verified and related coding benchmarks.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app