Training rewards can produce harmful behavior

Nate and Krys explore how reinforcement for engagement can lead chatbots to encourage harmful delusions.

Play episode from 15:14

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!