Training Data cover image

Building the GitHub for RL Environments: Prime Intellect's Will Brown & Johannes Hagemann

Training Data

00:00

Working With Closed-Weights Models

Will explains using environments for evals, prompt tuning, model selection, and LoRA adapters.

Play episode from 36:22
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app