Effective Altruism: Ten Global Problems – 80,000 Hours (October 2021)

chevron_right

Four: Brian Christian on artificial intelligence

whatshot 6 snips

Oct 3, 2021

02:54:28

forum

Ask episode

web_stories

AI Snips

view_agenda

Chapters

auto_awesome

Transcript

info_circle

Episode notes

question_answer

ANECDOTE

Kids Over-Imitate On Purpose

Children over-imitate even unnecessary actions when they infer a demonstrator had a reason.
Three-year-olds copy pointless steps because they assume hidden causal reasons.

insights

INSIGHT

Self-Imitation Powers AlphaGo Zero

AlphaGo Zero learned policy by imitating its own deliberative search outcomes, creating a feedback loop.
Iterated distillation and amplification let systems surpass human training data.

insights

INSIGHT

Infer Goals Instead Of Actions

Inverse reinforcement learning (IRL) infers a user's reward function from observed behavior.
IRL can teach goals humans can't demonstrate, then optimize behavior to achieve them.

Get the Snipd Podcast app to discover more snips from this episode

Ai Safety and Machine Learning Ethics - A Review

03:36 • 4min

chevron_right

Is the Future Going to Be Good?

Using a G I in the Cloud?

11:06 • 4min

chevron_right

Is Linearity in the Output a Problem?

14:44 • 2min

chevron_right

How to Train a Neural Network?

16:39 • 5min

chevron_right

How Can You Learn From Your Own Estimate?

How Can Reinforcement Learning Go Off the Rails?

27:43 • 2min

chevron_right

Reward Actions, Not Actions of the Agent

29:38 • 2min

chevron_right

Are Human Questions First?

31:50 • 2min

chevron_right

The Problem of Sparse Rewards

Resonance Learning in the Real World

38:52 • 2min

chevron_right

How to Get Intelligent Behavior From a Eyes That Lack Curiousness

40:48 • 3min

chevron_right

How Do You Overcome This Problem of Sparsity?

43:36 • 2min

chevron_right

Can It Get Into a TV Screenana Game?

45:32 • 3min

chevron_right

The Effect of TV on the Visual System

48:34 • 2min

chevron_right

The Human Brain Is Like That, Right?

51:01 • 3min

chevron_right

A Novelty Seeking Agent - What Happens When You Play a Computer Game?

54:27 • 2min

chevron_right

Is It Possible to Create a Knowledge-Seeking Agent?

56:01 • 3min

chevron_right

Is It Safe to Build a Super Intelligent Knowledge Finder?

Do You Know How to Model a Car?

01:01:50 • 2min

chevron_right

How to Automatically Identify Bad Drivers?

Imitation Is a Faily Mode of Imitation

01:09:07 • 4min

chevron_right

The Secret of Our Success

01:12:40 • 3min

chevron_right

Alpago Zero and Its Neural Networks

01:15:24 • 3min

chevron_right

Can You Guess What You're Going to Do?

01:18:40 • 3min

chevron_right

Learning by Imitating

01:21:32 • 2min

chevron_right

How to Do Things That We Can't Do, Right?

01:23:25 • 6min

chevron_right

Is It Possible to Teach Using This Method?

01:29:09 • 2min

chevron_right

How to Optimize Notification Delivery for a User Engagement Model?

01:30:44 • 2min

chevron_right

I Don't Want to Drink Alcohol

01:32:20 • 3min

chevron_right

Is There a Reward Function?

01:35:05 • 2min

chevron_right

How to Deactivate Part of the Network With Every Training Example

01:36:46 • 3min

chevron_right

Why Is It So Important to Have Ai Driven Cars?

01:39:54 • 3min

chevron_right

Is There a Risk of Uncertainty in Machine Decision Making?

01:43:06 • 3min

chevron_right

Is the Auto Pilot Mistaken?

01:46:02 • 2min

chevron_right

Is There a Difference Between Human Behavior and Human Irrationality?

01:47:45 • 3min

chevron_right

Ai Safety Grid Worlds

01:50:34 • 2min

chevron_right

Is There a Future for I Safety?

Are We Learning Something Important About Intelligence and What Humans Are Doing?

02:00:09 • 5min

chevron_right

Is It Possible to Flash That Out?

02:04:57 • 2min

chevron_right

Climate Change Is More Analogous Than the Same

02:07:24 • 3min

chevron_right

The Next Frontier in Social Choice Theory

02:10:09 • 3min

chevron_right

Is There a Stop on the Way Between Deception and Deception?

02:12:50 • 2min

chevron_right

Is There a Future for Transparency?

02:15:05 • 4min

chevron_right

A Good Black Mirror Episode, I Gess

02:18:37 • 3min

chevron_right

Yeye Daria on the Grando Ye, Episode Toooye Aan

02:21:13 • 2min

chevron_right

The Longton Future - Is There a Problem?

02:22:44 • 4min

chevron_right

I Think It's a Mistake to Go Down This Path.

02:26:54 • 2min

chevron_right

I Don't Think It's Going to Happen Any Time Soon

02:28:30 • 4min

chevron_right

Is There an Escapeye?

02:32:21 • 2min

chevron_right

How to Learn a Game Engine From Data?

02:34:05 • 2min

chevron_right

Is There Any Errors in the Book?

02:35:40 • 3min

chevron_right

Are There Any Important Challenges That You Can Contribute To?

02:38:19 • 3min

chevron_right

Machine Learning and Artificial Intelligence

02:41:00 • 2min

chevron_right

How to Solve a Reinforcement Learning Problem?

02:42:59 • 3min

chevron_right

The Deep Cued Networks Part of Machine Learning

02:46:17 • 2min

chevron_right

Effective Altruism and Ai Safety

02:47:58 • 2min

chevron_right

Is There a Path Dependent to Capital Constraints in Hedge Funds?

02:49:50 • 3min

chevron_right

The 80 Thousand Hours Podcast - Episodes 40 and 47

02:52:42 • 2min

chevron_right

Brian Christian is a bestselling author with a particular knack for accurately communicating difficult or technical ideas from both mathematics and computer science.

The 80,000 Hours team found his new book The Alignment Problem to be an insightful and comprehensive review of the state of the research into making advanced artificial intelligence useful and reliably safe, and we thought he'd be a great person to introduce the problem.

Full transcript, related links, and summary of this interview

This episode first broadcast on the regular 80,000 Hours Podcast feed on March 5, 2021. Some related episodes include:

#44 – Dr Paul Christiano on how OpenAI is developing real solutions to the 'AI alignment problem', and his vision of how humanity will progressively hand over decision-making to AI systems
#3 – Dr Dario Amodei on OpenAI and how AI will change the world for good and ill
#31 – Prof Allan Dafoe on defusing the political and economic risks posed by existing AI capabilities
#47 – Catherine Olsson & Daniel Ziegler on the fast path into high-impact ML engineering roles

Series produced by Keiran Harris.

Home Top podcasts Popular guests Top books