Super Data Science: ML & AI Podcast with Jon Krohn cover image

791: Reinforcement Learning from Human Feedback (RLHF), with Dr. Nathan Lambert

Super Data Science: ML & AI Podcast with Jon Krohn

00:00

Intro

Dr. Nathan Lambert discusses the origins, applications, limitations, and alternative techniques of reinforcement learning from human feedback (RLHF), shedding light on its use in fine-tuning large language models and addressing the perceived mystical aspects of AI in a conversation that also covers his AI research background.

Play episode from 00:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app