Intro

Dr. Nathan Lambert discusses the origins, applications, limitations, and alternative techniques of reinforcement learning from human feedback (RLHF), shedding light on its use in fine-tuning large language models and addressing the perceived mystical aspects of AI in a conversation that also covers his AI research background.

Play episode from 00:00

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app