LessWrong (Curated & Popular) cover image

"Gemma Needs Help" by Anna Soligo

LessWrong (Curated & Popular)

00:00

Mitigation experiments: SFT and DPO

Anna describes LoRA fine-tuning experiments showing SFT fails but DPO sharply reduces Gemma's expressed frustration.

Play episode from 10:01
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app