The Nonlinear Library

The Nonlinear Fund

The Nonlinear Library allows you to easily listen to top EA and rationalist content on your podcast player. We use text-to-speech software to create an automatically updating repository of audio content from the EA Forum, Alignment Forum, LessWrong, and other EA blogs. To find out more, please visit us at nonlinear.org

Episodes

Mentioned books

Dec 15, 2023 • 7min

EA - My quick thoughts on donating to EA Funds' Global Health and Development Fund and what it should do by Vasco Grilo

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: My quick thoughts on donating to EA Funds' Global Health and Development Fund and what it should do, published by Vasco Grilo on December 15, 2023 on The Effective Altruism Forum. I think there is a strong case for donating to EA Funds' Global Health and Development Fund (GHDF) if one wants to support interventions in global health and development without attending to their effects on animals. On the other hand, given this goal, I believe one had better donate to GiveWell's All Grants Fund (AGF) or unrestricted funds (GWUF), or Giving What We Can's (GWWC's) Global Health and Wellbeing Fund (GHWF). In addition, I encourage GHDF to: Let its donors know that donating to GHDF in its current form has a similar effect to donating to AGF (if that is in fact the case). Consider appointing additional fund managers independent from GiveWell. Consider accepting applications. In any case, the goal of this post is mostly about starting a discussion about the future of GHDF rather than providing super informed takes about it. So feel free to share your thoughts or vision below! Case for donating to GiveWell's All Grants Fund or unrestricted funds Donating to AGF or GWUF instead of GHDF seems better if one highly trusts GiveWell's prioritisation: Donating to GHDF in its current form appears to have the same effect as donating to AGF or GWUF: Like AGF and GWUF, GHDF "aims to improve the health or economic empowerment of people around the world as effectively as possible". My understanding is that GHDF makes more uncertain or riskier grants than GiveWell's Top Charities Fund[1] (TCF), but AGF, launched in August 2022, now makes such grants too. AGF funds: GiveWell's top charities. Organisations implementing potentially cost-effective and scalable programs. Established organisations implementing cost-effective programs that GiveWell does not expect to scale. Organisations aiming to influence public health policy. Organisations producing research to aid our grantmaking process. Organizations that raise funds for our recommended charities. GHDF "is managed by Elie Hassenfeld, GiveWell's co-founder [and CEO]". GHDF does not accept applications, and neither does AGF. People in the United Kingdom can support GiveWell's funds and top charities through tax deductible donations via GiveWell UK, which was launched in August 2022 as AGF. Having EA Funds as an additional intermediary seems unnecessary unless it is doing some extra evaluation, which does not appear to be the case. As a side note, I would also say there is a pretty small difference between which one of GiveWell's funds, TCF, AGF or GWUF, one donates to: Due to funging, more donations to TCF will result in AGF granting less money to GiveWell's top charities. GiveWell arguably has tiny room for more funding given Open Philanthropy's support, so donating to GWUF is similar to donating to AGF[2]. However, if you highly trust GiveWell's prioritisation, donating to GWUF is the best option given its greatest flexibility, followed by the AGF and TCF. Yet, donors may prefer donating to TCF to facilitate explanations of their effective giving (e.g. skipping the need to go into expected value or funging). Case for donating to Giving What We Can's Global Health and Wellbeing Fund Donating to GHWF instead of GHDF seems better if one: Welcomes further evaluation of the process behind the recommendations of GiveWell and other evaluators in the global health and wellbeing space (e.g. Happier Lives Institute), trusts GWWC's research team to identify evaluators to rely on, and wants the evaluations to be published, as in GWWC's evaluations of evaluators. These would be my main reasons for donating to GHWF instead of GHDF, which has not produced public evaluations of GiveWell's recommendations. Is open to donating to funds or organisations not suppo...

Dec 15, 2023 • 13min

AF - Current AIs Provide Nearly No Data Relevant to AGI Alignment by Thane Ruthenis

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Current AIs Provide Nearly No Data Relevant to AGI Alignment, published by Thane Ruthenis on December 15, 2023 on The AI Alignment Forum. Recently, there's been a fair amount of pushback on the "canonical" views towards the difficulty of AGI Alignment (the views I call the "least forgiving" take). Said pushback is based on empirical studies of how the most powerful AIs at our disposal currently work, and is supported by fairly convincing theoretical basis of its own. By comparison, the "canonical" takes are almost purely theoretical. At a glance, not updating away from them in the face of ground-truth empirical evidence is a failure of rationality: entrenched beliefs fortified by rationalizations. I believe this is invalid, and that the two views are much more compatible than might seem. I think the issue lies in the mismatch between their subject matters. It's clearer if you taboo the word "AI": The "canonical" views are concerned with scarily powerful artificial agents: with systems that are human-like in their ability to model the world and take consequentialist actions in it, but inhuman in their processing power and in their value systems. The novel views are concerned with the systems generated by any process broadly encompassed by the current ML training paradigm. It is not at all obvious that they're one and the same. Indeed, I would say that to claim that the two classes of systems overlap is to make a very strong statement regarding how cognition and intelligence work. A statement we do not have much empirical evidence on, but which often gets unknowingly, implicitly snuck-in when people extrapolate findings from LLM studies to superintelligences. It's an easy mistake to make: both things are called "AI", after all. But you wouldn't study manually-written FPS bots circa 2000s, or MNIST-classifier CNNs circa 2010s, and claim that your findings generalize to how LLMs circa 2020s work. By the same token, LLM findings do not necessarily generalize to AGI. What the Fuss Is All About To start off, let's consider where all the concerns about the AGI Omnicide Risk came from in the first place. Consider humans. Some facts: Humans posses an outstanding ability to steer the world towards their goals, and that ability grows sharply with their "intelligence". Sure, there are specific talents, and "idiot savants". But broadly, there does seem to be a single variable that mediates a human's competence in all domains. An IQ 140 human would dramatically outperform an IQ 90 human at basically any cognitive task, and crucially, be much better at achieving their real-life goals. Humans have the ability to plot against and deceive others. That ability grows fast with their g-factor. A brilliant social manipulator can quickly maneuver their way into having power over millions of people, out-plotting and dispatching even those that are actively trying to stop them or compete with them. Human values are complex and fragile, and the process of moral philosophy is more complex still. Humans often arrive at weird conclusions that don't neatly correspond to their innate instincts or basic values. Intricate moral frameworks, weird bullet-biting philosophies, and even essentially-arbitrary ideologies like cults. And when people with different values interact... People who differ in their values even just a bit are often vicious, bitter enemies. Consider the history of heresies, or of long-standing political rifts between factions that are essentially indistinguishable from the outside. People whose cultures evolved in mutual isolation often don't even view each other as human. Consider the history of xenophobia, colonization, culture shocks. So, we have an existence proof of systems able to powerfully steer the world towards their goals. Some of these system can be strictly more powerfu...

Dec 15, 2023 • 5min

LW - "AI Alignment" is a Dangerously Overloaded Term by Roko

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: "AI Alignment" is a Dangerously Overloaded Term, published by Roko on December 15, 2023 on LessWrong. Alignment as Aimability or as Goalcraft? The Less Wrong and AI risk communities have obviously had a huge role in mainstreaming the concept of risks from artificial intelligence, but we have a serious terminology problem. The term "AI Alignment" has become popular, but people cannot agree whether it means something like making "Good" AI or whether it means something like making "Aimable" AI. We can define the terms as follows: AI Aimability = Create AI systems that will do what the creator/developer/owner/user intends them to do, whether or not that thing is good or bad AI Goalcraft = Create goals for AI systems that we ultimately think lead to the best outcomes Aimability is a relatively well-defined technical problem and in practice almost all of the technical work on AI Alignment is actually work on AI Aimability. Less Wrong has for a long time been concerned with Aimability failures (what Yudkowsky in the early days would have called "Technical Failures of Friendly AI") rather than failures of Goalcraft (old-school MIRI terminology would be "Friendliness Content"). The problem is that as the term "AI Alignment" has gained popularity, people have started to completely merge the definitions of Aimability and Goalcraft under the term "Alignment". I recently ran some Twitter polls on this subject, and it seems that people are relatively evenly split between the two definitions. This is a relatively bad state of affairs. We should not have the fate of the universe partially determined by how people interpret an ambiguous word. In particular, the way we are using the term AI Alignment right now means that it's hard to solve the AI Goalcraft problem and easy to solve the Aimability problem, because there is a part of AI that is distinct from Aimability which the current terminology doesn't have a word for. Not having a word for what goals to give the most powerful AI system in the universe is certainly a problem, and it means that everyone will be attracted to the easier Aimability research where one can quickly get stuck in and show a concrete improvement on a metric and publish a paper. Why doesn't the Less Wrong / AI risk community have good terminology for the right hand side of the diagram? Well, this (I think) goes back to a decision by Eliezer from the SL4 mailing list days that one should not discuss what the world would be like after the singularity, because a lot of time would be wasted arguing about politics, instead of the then more urgent problem of solving the AI Aimability problem (which was then called the control problem). At the time this decision was probably correct, but times have changed. There are now quite a few people working on Aimability, and far more are surely to come, and it also seems quite likely (though not certain) that Eliezer was wrong about how hard Aimability/Control actually is. Words Have Consequences This decision to not talk about AI goals or content might eventually result in some unscrupulous actors getting to define the actual content and goals of superintelligence, cutting the X-risk and LW community out of the only part of the AI saga that actually matters in the end. For example, the recent popularity of the e/acc movement has been associated with the Landian strain of AI goal content - acceleration towards a deliberate and final extermination of humanity, in order to appease the Thermodynamic God. And the field that calls itself AI Ethics has been tainted with extremist far-left ideology around DIE (Diversity, Inclusion and Equity) that is perhaps even more frightening than the Landian Accelerationist strain. By not having mainstream terminology for AI goals and content, we may cede the future of the universe to extremis...

Dec 15, 2023 • 3min

EA - Announcing Surveys on Community Health, Causes, and Harassment by David Moss

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Announcing Surveys on Community Health, Causes, and Harassment, published by David Moss on December 15, 2023 on The Effective Altruism Forum. We are announcing a supplementary survey to gather timely information from the EA community before the next EA Survey in 2024. This survey will contain questions related to: Community health and satisfaction with the EA community Cause prioritization and how EA resources should be allocated Demographics (which can optionally be skipped if you provided your email address last time and opt for us to link your responses) We are also sending out a separate survey, requested by CEA's Community Health and Special Projects team, focusing primarily on sexual harassment and gender-related experiences: 4. EA Climate and Harassment Survey You can take the first survey here. This will give you the option to take the Climate and Harassment Survey immediately afterwards, without having to answer the demographic questions twice. Alternatively, you can just take the Climate and Harassment survey here. If you wish to share links to either of these surveys with others, please use the following links: Both surveys: https://rethinkpriorities.qualtrics.com/jfe/form/SV_1G37guBPVAl9TtI?source=sharing Climate and Harassment Survey alone: https://rethinkpriorities.qualtrics.com/jfe/form/SV_bxD0wtmuuXw4KUe?source=sharing The first survey should be significantly shorter than the main EA Survey, depending on how much detail you choose to provide in the open comment questions and whether you skip the demographic section by providing your email address. The EA Climate and Harassment Survey is estimated to take between 5 and 30 minutes depending on how much detail you choose to provide. Both surveys are planned to close on 1st January 2024. Acknowledgements The post is a project of Rethink Priorities, a global priority think-and-do tank, aiming to do good at scale. We research and implement pressing opportunities to make the world better. We act upon these opportunities by developing and implementing strategies, projects, and solutions to key issues. We do this work in close partnership with foundations and impact-focused non-profits or other entities. If you're interested in Rethink Priorities' work, please consider subscribing to our newsletter. You can explore our completed public work here. Thanks for listening. To help us out with The Nonlinear Library or to learn more, please visit nonlinear.org

Dec 15, 2023 • 12min

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The Nonlinear Library

Episodes

Mentioned books

EA - My quick thoughts on donating to EA Funds' Global Health and Development Fund and what it should do by Vasco Grilo

AF - Current AIs Provide Nearly No Data Relevant to AGI Alignment by Thane Ruthenis

LW - "AI Alignment" is a Dangerously Overloaded Term by Roko

EA - Announcing Surveys on Community Health, Causes, and Harassment by David Moss

LW - EU policymakers reach an agreement on the AI Act by tlevin

LW - Some for-profit AI alignment org ideas by Eric Ho

LW - Love, Reverence, and Life by Elizabeth

EA - On-Ramps for Biosecurity - A Model by Sofya Lebedeva

LW - Bayesian Injustice by Kevin Dorst

EA - Risk Aversion in Wild Animal Welfare by Rethink Priorities

The AI-powered Podcast Player