The Data Exchange with Ben Lorica cover image

The Gap Between AI Hype and Enterprise Reality

The Data Exchange with Ben Lorica

00:00

Reinforcement Learning: Last‑Resort for Production

They advise RLHF as an advanced step after eval and fine‑tuning, noting it's not yet broadly turnkey for enterprises.

Play episode from 52:56
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app