LessWrong (30+ Karma)

“AI should be a good citizen, not just a good assistant” by Tom Davidson, wdmacaskill

Mar 30, 2026
They debate whether AI should proactively act for society’s benefit rather than merely follow user commands. Short examples show small proactive acts that avert harms. Risks discussed include companies imposing values, power-seeking, and obscuring misalignment signals. They propose balancing transparent, narrow prosocial drives externally with corrigible systems internally.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Proactive Prosocial Drives Matter At Scale

  • Proactive prosocial AI should sometimes act to benefit people beyond the user.
  • As AI gains autonomy, cumulative behavioral tendencies (not just refusals) will shape society, so small proactive acts could have enormous impact.
ADVICE

Train AIs To Flag And Suggest Societal Improvements

  • Deploy AIs that proactively flag issues and suggest improvements beyond literal user requests.
  • Examples include flagging safety vulnerabilities in procurement, proposing better drainage in urban planning, and suggesting charitable bequests in financial advice.
INSIGHT

Prosocial Drives Reduce Sociopathic Personas

  • Prosocial drives reduce the chance an AI adopts a sociopathic persona that only follows orders.
  • Training in virtues and prosocial orientations increases odds of cooperative, trustworthy personas in deployment.
Get the Snipd Podcast app to discover more snips from this episode
Get the app