Eye On A.I. cover image

#324 Sharon Zhou: Inside AMD's Plan to Build Self-Improving AI

Eye On A.I.

00:00

Verifiable Rewards from Profiling for RL

Sharon explains using profiler metrics as verifiable rewards to train models via reinforcement learning.

Play episode from 25:53
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app