The Nonlinear Library

The Nonlinear Fund

The Nonlinear Library allows you to easily listen to top EA and rationalist content on your podcast player. We use text-to-speech software to create an automatically updating repository of audio content from the EA Forum, Alignment Forum, LessWrong, and other EA blogs. To find out more, please visit us at nonlinear.org

Episodes

Mentioned books

Dec 1, 2023 • 12min

EA - Doing Good Effectively is Unusual by Richard Y Chappell

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Doing Good Effectively is Unusual, published by Richard Y Chappell on December 1, 2023 on The Effective Altruism Forum. tl;dr: It actually seems pretty rare for people to care about the general good as such (i.e., optimizing cause-agnostic impartial well-being), as we can see by prejudged dismissals of EA concern for non-standard beneficiaries and for doing good via indirect means. Introduction Moral truisms may still be widely ignored. The moral truism underlying Effective Altruism is that we have strong reasons to do more good, and it's worth adopting the efficient promotion of the impartial good among one's life projects. (One can do this in a "non-totalizing" way, i.e. without it being one's only project.) Anyone who personally adopts that project (to any non-trivial extent) counts, in my book, as an effective altruist (whatever their opinion of the EA movement and its institutions). Many people don't adopt this explicit goal as a personal priority to any degree, but still do significant good via more particular commitments (to more specific communities, causes, or individuals). That's fine by me, but I do think that even people who aren't themselves effective altruists should recognize the EA project as a good one. We should all generally want people to be more motivated by efficient impartial beneficence (on the margins), even if you don't think it's the only thing that matters. A popular (but silly) criticism of effective altruism is that it is entirely vacuous. As Freddie deBoer writes: [T]his sounds like so obvious and general a project that it can hardly denote a specific philosophy or project at all… [T]his is an utterly banal set of goals that are shared by literally everyone who sincerely tries to act charitably. This is clearly false. As Bentham's Bulldog replies, most people give lip service to doing good effectively. But then they go and donate to local children's hospitals and puppy shelters, while showing no interest in learning about neglected tropical diseases or improving factory-farmed animal welfare. DeBoer himself dismisses without argument "weird" concerns about shrimp welfare and existential risk reduction, which one very clearly cannot just dismiss as a priori irrelevant if one actually cares about promoting the impartial good. The latter entails a very unusual degree of open-mindedness. The fact is: open-minded, cause-agnostic concern for promoting the impartial good is vanishingly rare. As a result, the few people who sincerely have and act upon this concern end up striking everyone else as extremely weird. We all know that the way you're supposed to behave is to be a good ally to your social group, do normal socially-approved things that signal conformity and loyalty (and perhaps a non-threatening degree of generosity towards socially-approved recipients). "Literally everyone" does this much, I guess. But what sort of weirdo starts looking into numbers, and argues on that basis that chickens are a higher priority than puppies? Horrible utilitarian nerds, that's who! Or so the normie social defense mechanism seems to be (never mind that efficient impartial beneficence is not exclusively utilitarian, and ought rather to be a significant component of any reasonable moral view). Let's be honest Everyone is motivated to rationalize what they're antecedently inclined to do. I know I do plenty of suboptimal things, due to both (i) failing to care as much as would be objectively warranted about many things (from non-cute animals to distant people), and (ii) being akratic and failing to be sufficiently moved even by things I value, like my own health and well-being. But I try to be honest about it, and recognize that (like everyone) I'm just irrational in a lot of ways, and that's OK, even if it isn't ideal. Vegans care more about animals than I ...

Dec 1, 2023 • 20min

AF - Thoughts on "AI is easy to control" by Pope & Belrose by Steve Byrnes

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Thoughts on "AI is easy to control" by Pope & Belrose, published by Steve Byrnes on December 1, 2023 on The AI Alignment Forum. Quintin Pope & Nora Belrose have a new "AI Optimists" website, along with a new essay " AI is easy to control", arguing that the risk of human extinction due to future AI ("AI x-risk") is a mere 1% ("a tail risk worth considering, but not the dominant source of risk in the world"). (I'm much more pessimistic.) It makes lots of interesting arguments, and I'm happy that the authors are engaging in substantive and productive discourse, unlike the ad hominem vibes-based drivel which is growing increasingly common on both sides of the AI x-risk issue in recent months. This is not a comprehensive rebuttal or anything, but rather picking up on a few threads that seem important for where we disagree, or where I have something I want to say. Summary / table-of-contents: Note: I think Sections 1 & 4 are the main reasons that I'm much more pessimistic about AI x-risk than Pope & Belrose, whereas Sections 2 & 3 are more nitpicky. Section 1 argues that even if controllable AI has an "easy" technical solution, there are still good reasons to be concerned about AI takeover, because of things like competition and coordination issues, and in fact I would still be overall pessimistic about our prospects. Section 2 talks about the terms "black box" versus "white box". Section 3 talks about what if anything we learn from "human alignment", including some background on how I think about human innate drives. Section 4 argues that pretty much the whole essay would need to be thrown out if future AI is trained in a substantially different way from current LLMs. If this strikes you as a bizarre unthinkable hypothetical, yes I am here to tell you that other types of AI do actually exist, and I specifically discuss the example of "brain-like AGI" (a version of actor-critic model-based RL), spelling out a bunch of areas where the essay makes claims that wouldn't apply to that type of AI, and more generally how it would differ from LLMs in safety-relevant ways. 1. Even if controllable AI has an "easy" technical solution, I'd still be pessimistic about AI takeover Most of Pope & Belrose's essay is on the narrow question of whether the AI control problem has an easy technical solution. That's great! I'm strongly in favor of arguing about narrow questions. And after this section I'll be talking about that narrow question as well. But the authors do also bring up the broader question of whether AI takeover is likely to happen, all things considered. These are not the same question; for example, there could be an easy technical solution, but people don't use it. So, for this section only, I will assume for the sake of argument that there is in fact an easy technical solution to the AI control and/or alignment problem. Unfortunately, in this world, I would still think future catastrophic takeover by out-of-control AI is not only plausible but likely. Suppose someone makes an AI that really really wants something in the world to happen, in the same way a person might really really want to get out of debt, or Elon Musk really really wants for there to be a Mars colony - including via means-end reasoning, out-of-the-box solutions, inventing new tools to solve problems, and so on. instrumental convergence. But before we get to that, why might we suppose that someone might make an AI that really really wants something in the world to happen? Well, lots of reasons: People have been trying to do exactly that since the dawn of AI. Humans often really really want something in the world to happen (e.g., for there to be more efficient solar cells, for my country to win the war, to make lots of money, to do a certain very impressive thing that will win fame and investors and NeurIPS pape...

Dec 1, 2023 • 5min

EA - Effektiv Spenden's Impact Evaluation 2019-2023 (exec. summary) by Sebastian Schienle

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Effektiv Spenden's Impact Evaluation 2019-2023 (exec. summary), published by Sebastian Schienle on December 1, 2023 on The Effective Altruism Forum. effektiv-spenden.org is an effective giving platform in Germany and Switzerland that was founded in 2019. To reflect on our past impact, we examine Effektiv Spenden's cost-effectiveness as a "giving multiplier" from 2019 to 2022 in terms of how much money is directed to highly effective charities due to our work. We have two primary reasons for this analysis: To provide past and future donors with transparent information about our cost-effectiveness; To hold ourselves accountable, particularly in a situation where we are investing in further growth of our platform. We provide both a simple multiple (or "leverage ratio") of donations raised for highly effective charities compared to our operating costs, as well as an analysis of the counterfactual (i.e. what would have happened had we never existed). Our analysis complements our Annual Review 2022 (in German) and builds on previous updates and annual reviews, such as, amongst others, our reviews of 2021 and 2019. In both instances, we also included initial perspectives on our counterfactual impact. Since then, the investigation of Founders Pledge into giving multipliers as well as Giving What We Can (GWWC)'s recent impact evaluation have provided further methodological refinements. In line with GWWC's approach, we shift to 3-year time horizons, which we feel better represents our impact over time and avoids short-term distortions. However, our attempt to quantify our " giving multiplier" deviates in some parts from the methodologies and assumptions applied by Founders Pledge and GWWC and is an initial, shallow analysis only that we intend to develop further in the future. Below, we share the key results of our analysis. We invite you to share any comments or takeaways you may have, either by directly commenting or by reaching out to sebastian.schienle@effektiv-spenden.org Key results In 2022, we moved 15.3 million to highly effective charities, amounting to 37 million in total donations raised since Effektiv Spenden was founded in 2019. Our leverage ratio, i.e. the money moved to highly effective charities per 1 spent on our operations was 55.7 and 40.8 for the 2019-2021 and 2020-2022 time periods respectively.[1] Our best-guess counterfactual giving multiplier is 17.9 and 13.0 for those two time periods, robustly exceeding 10x. This means that for every 1 spent on Effektiv Spenden between 2019-2022, we are confident to have facilitated more than 10 to support highly effective charities which would not have materialized had Effektiv Spenden not existed. Our conservative counterfactual giving multiplier is 10.4 for 2019-2021, and 7.5 for 2020-2022. The decline of our multiplier over time is driven by the investment into our team. Over the last year, our team has grown substantially to enable further growth. While this negatively impacts our giving multiplier in the short term, we consider it a necessary prerequisite for further growth. Our ambition is to return to a best-guess counterfactual multiplier of at least 15x in the coming years. That said, ultimately our goal is not to maximize the multiplier, but to maximize counterfactually raised funds for highly effective charities. (As long as our work remains above a reasonable cost-effectiveness bar.) How to interpret our results We consider our analysis an important stocktake of our impact, and a further contribution to the growing body of giving multiplier analyses in the effective giving space. That said, we also recognize the limitations of our approach and want to call out some caveats to guide interpretation of these results. Our analysis is largely retrospective, i.e. it compares our past money moved with operating ...

Dec 1, 2023 • 11min

AF - How useful for alignment-relevant work are AIs with short-term goals? (Section 2.2.4.3 of "Scheming AIs") by Joe Carlsmith

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: How useful for alignment-relevant work are AIs with short-term goals? (Section 2.2.4.3 of "Scheming AIs"), published by Joe Carlsmith on December 1, 2023 on The AI Alignment Forum. This is Section 2.2.4.3 of my report "Scheming AIs: Will AIs fake alignment during training in order to get power?". There's also a summary of the full report here (audio here). The summary covers most of the main points and technical terms, and I'm hoping that it will provide much of the context necessary to understand individual sections of the report on their own. Audio version of this section here, or search "Joe Carlsmith Audio" on your podcast app. How much useful, alignment-relevant cognitive work can be done using AIs with short-term goals? So overall, I think that training our models to pursue long-term goals - whether via long episodes, or via short episodes aimed at inducing long-term optimization - makes the sort of beyond-episode goals that motivate scheming more likely to arise. So this raises the question: do we need to train our models to pursue long-term goals? Plausibly, there will be strong general incentives to do this. That is: people want optimization power specifically applied to long-term goals like "my company being as profitable as possible in a year." So, plausibly, they'll try to train AIs that optimize in this way. (Though note that this isn't the same as saying that there are strong incentives to create AIs that optimize the state of the galaxies in the year five trillion.) Indeed, there's a case to be made that even our alignment work, today, is specifically pushing towards the creation of models with long-term - and indeed, beyond-episode - goals. Thus, for example, when a lab trains a model to be "harmless," then even though it is plausibly using fairly "short-episode" training (e.g., RLHF on user interactions), it intends a form of "harmlessness" that extends quite far into the future, rather than cutting off the horizon of its concern after e.g. an interaction with the user is complete. That is: if a user asks for help building a bomb, the lab wants the model to refuse, even if the bomb in question won't be set off for a decade.[1] And this example is emblematic of a broader dynamic: namely, that even when we aren't actively optimizing for a specific long-term outcome (e.g., "my company makes a lot of money by next year"), we often have in mind a wide variety of long-term outcomes that we want to avoid (e.g., "the drinking water in a century is not poisoned"), and which it wouldn't be acceptable to cause in the course of accomplishing some short-term task. Humans, after all, care about the state of the future for at least decades in advance (and for some humans: much longer), and we'll want artificial optimization to reflect this concern. So overall, I think there is indeed quite a bit of pressure to steer our AIs towards various forms of long-term optimization. However, suppose that we're not blindly following this pressure. Rather, we're specifically trying to use our AIs to perform the sort of alignment-relevant cognitive work I discussed above - e.g., work on interpretability, scalable oversight, monitoring, control, coordination amongst humans, the general science of deep learning, alternative (and more controllable/interpretable) AI paradigms, and the like. In many cases, I think the answer is no. In particular: I think that a lot of this sort of alignment-relevant work can be performed by models that are e.g. generating research papers in response to human+AI supervision over fairly short timescales, suggesting/conducting relatively short-term experiments, looking over a codebase and pointing out bugs, conducting relatively short-term security tests and red-teaming attempts, and so on. We can talk about whether it will be possible to generate rewar...

Dec 1, 2023 • 8min

EA - My Personal Priorities, Charity, Judaism, and Effective Altruism by Davidmanheim

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: My Personal Priorities, Charity, Judaism, and Effective Altruism, published by Davidmanheim on December 1, 2023 on The Effective Altruism Forum. I've thought a lot about charitable giving over the past decade, both from a universalist and from a Jewish standpoint. I have a few thoughts, including about how my views have evolved over time. This is a very different perspective than many in Effective Altruism, but I think it's important as a member of a community that benefits from being diverse rather than monolithic for those who dissent from community consensus make it clear that it's acceptable to do so. Hopefully, this can be useful both to other people who are interested in a more Jewish perspective, and for everyone else interested in thinking about balancing different personal views with effective giving. Background To start, there is a strong Jewish tradition, and a legal requirement in the Shulchan Aruch, the code of Jewish law, for giving at least ten percent of your income to the poor and to community organizations - and for those who can afford it, ideally, a fifth of their income. (For some reason, no-one ever points out that second part.) So I always gave a tenth of my income to charity, even before starting my first post-college job, per Jewish customary law. My parents inculcated this as a value since childhood, and a norm, and it's one I am grateful for. (One thing I did differently than most, and credit my sister with suggesting, is putting 10% of my paycheck directly in a second account which was exclusively for charity. My giving as a child, and as a young adult, largely centered on local Jewish organizations, poverty assistance for local poor people and the poor in Israel, and community organizations I interacted with. In the following years, I started thinking more critically about my giving, and charity to community organizations seemed in tension with a more universalist impulse, what you might call "Tikkun Olam"- a directive to improve the world as a whole. I was very conflicted about this for quite some time, but have come to some tentative conclusions, and I wanted to outline my current views, informed by a combination of the Jewish sources and my other beliefs. Judaism vs. Utilitarians I am lucky enough, like most people I know personally, to have significantly more money than is strictly needed to feed, clothe, and house myself and my family. The rest of the money, however, needs to be allocated - for savings, for entertainment, for community, and for charity. And my conclusion, after reflection about the question, is that those last two are separate both conceptually and as a matter of Jewish conception of charity. My synagogue is a wonderful community institution that I benefit from, and I believe it is proper to pay my fair share. And in Halacha, Jewish law, community organizations are valid recipients of charity. But there is also a strong justification for prioritizing giving to those most in need. Utilitarian philosophers have advocated for a giving on an impartial basis, seeing a contradiction between universalism and their "selfish" impulse to justify keeping more than a minimal amount of their own money. To maximize global utility, all money over a bare minimum should go to those most in need, or otherwise be maximally impactful. In contrast, Halacha is clear that you and your family come first, and giving more than a token amount of charity must wait until your family's needs are met. More than that, it is clearly opposed to giving more than 20% of your income under usual circumstances, i.e. short of significant excess wealth. And once you are giving to charity, Jewish sources suggest progressively growing moral circles, first giving to family in need, then neighbors, then the community. In contrast to this, Jewish law also contai...

Dec 1, 2023 • 49min

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The Nonlinear Library

Episodes

Mentioned books

EA - Doing Good Effectively is Unusual by Richard Y Chappell

AF - Thoughts on "AI is easy to control" by Pope & Belrose by Steve Byrnes

EA - Effektiv Spenden's Impact Evaluation 2019-2023 (exec. summary) by Sebastian Schienle

AF - How useful for alignment-relevant work are AIs with short-term goals? (Section 2.2.4.3 of "Scheming AIs") by Joe Carlsmith

EA - My Personal Priorities, Charity, Judaism, and Effective Altruism by Davidmanheim

EA - ALLFED's 2023 Highlights by Sonia Cassidy

LW - How useful is mechanistic interpretability? by ryan greenblatt

AF - FixDT by Abram Demski

LW - What's next for the field of Agent Foundations? by Nora Ammann

LW - Scaling laws for dominant assurance contracts by jessicata

The AI-powered Podcast Player