Effective Altruism: Ten Global Problems – 80,000 Hours (October 2021)

Four: Brian Christian on artificial intelligence

6 snips
Oct 3, 2021
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Kids Over-Imitate On Purpose

  • Children over-imitate even unnecessary actions when they infer a demonstrator had a reason.
  • Three-year-olds copy pointless steps because they assume hidden causal reasons.
INSIGHT

Self-Imitation Powers AlphaGo Zero

  • AlphaGo Zero learned policy by imitating its own deliberative search outcomes, creating a feedback loop.
  • Iterated distillation and amplification let systems surpass human training data.
INSIGHT

Infer Goals Instead Of Actions

  • Inverse reinforcement learning (IRL) infers a user's reward function from observed behavior.
  • IRL can teach goals humans can't demonstrate, then optimize behavior to achieve them.
Get the Snipd Podcast app to discover more snips from this episode
Get the app