Chain of Thought | AI Agents, Infrastructure & Engineering cover image

Why LLMs Are Plausibility Engines, Not Truth Engines | Dan Klein

Chain of Thought | AI Agents, Infrastructure & Engineering

00:00

Applying RL to open-ended conversations

Dan explains how RL's gradient works and how they simulated agentic interactions for training.

Play episode from 43:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app