Training Data cover image

Building the GitHub for RL Environments: Prime Intellect's Will Brown & Johannes Hagemann

Training Data

00:00

Dissecting 'Environments are Evals'

Will reconciles RL-style environments and traditional Q&A evals, emphasizing goals and reward functions.

Play episode from 07:58
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app