The Generalist cover image

How a 20-Person Startup Won Gold at the Math Olympiad—Tying With OpenAI & DeepMind (Tudor Achim, CEO of Harmonic)

The Generalist

00:00

Reinforcement Learning and Synthetic Data

Tudor explains using RL and verifier feedback to generate synthetic proofs and improve pattern recognition.

Play episode from 41:57
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app