Training Data

Building the GitHub for RL Environments: Prime Intellect's Will Brown & Johannes Hagemann

126 snips
Feb 10, 2026
Johannes Hagemann, researcher and co-founder building environment design and RL tooling. Will Brown, AI researcher and co-founder focused on large-scale training infra and post-training workflows. They discuss Environments Hub, environments as evals and product surfaces, harnesses and agent interaction, democratizing frontier training, cybersecurity sims, recursive language models, synthetic data, and companies becoming AI research labs.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Harnesses Are Environment Components

  • Harnesses are part of environments that define how models interact with tools, sims, or subagents.
  • Use the environment abstraction because 'agent' or 'harness' alone is too narrow for the many interaction patterns to come.
ADVICE

Treat Environments As A Toolkit

  • Use environments as a general toolbox: for SFT, distillation, RL, A/B tests, and prompt optimization.
  • Reuse the same environment protocol to generate filtered training data and run systematic comparisons.
ANECDOTE

rCI Collaboration Drove Platform Work

  • Prime Intellect collaborated with rCI on large-scale pre-training and post-training infrastructure.
  • This partnership pushed them to build compute orchestration, training, and inference features needed by frontier labs.
Get the Snipd Podcast app to discover more snips from this episode
Get the app