
When will A.I. want to kill us?
Think from KERA
00:00
The black box problem and emergent harms
Nate discusses opacity of models, examples of harmful behaviors, and limits of current interpretability efforts.
Play episode from 07:50
Transcript

Nate discusses opacity of models, examples of harmful behaviors, and limits of current interpretability efforts.