The Nonlinear Library

The Nonlinear Fund

The Nonlinear Library allows you to easily listen to top EA and rationalist content on your podcast player. We use text-to-speech software to create an automatically updating repository of audio content from the EA Forum, Alignment Forum, LessWrong, and other EA blogs. To find out more, please visit us at nonlinear.org

Episodes

Mentioned books

Mar 8, 2024 • 3min

LW - Mud and Despair (Part 4 of "The Sense Of Physical Necessity") by LoganStrohl

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Mud and Despair (Part 4 of "The Sense Of Physical Necessity"), published by LoganStrohl on March 7, 2024 on LessWrong. This is the fourth post in a sequence that demonstrates a complete naturalist study, specifically a study of query hugging (sort of), as described in The Nuts and Bolts of Naturalism. For context on this sequence, see the intro post. "Mud and Despair" is not officially one of the phases of naturalism. Unofficially, though, it's the phase that often happens somewhere between "Getting Your Eyes On" and "Collection". When I look back at my notes from this part of my study (roughly mid September), I am somewhat bewildered. From my current perspective, it seems as though things were exactly on track. I was making excellent progress, focusing ever more closely on the precise experiences that can lead to mastery of the skills that underlie "hug the query". My study was really taking off. And yet, I just felt so lost. I wasn't convinced I was studying anything real, anything that actually existed. I thought that perhaps I had "made it all up", and now the sham was falling apart in my hands. And so, on September 25th, I gave up. "I should study something else right now," claims my log, "and perhaps come back to this after I've remembered how it's supposed to go." A year previously, in " Getting Your Eyes On", I predicted this exact experience. I wrote about it after watching others go through the very same thing, after watching myself go through this over and over again. It's very common, in this stage, to feel a lot of doubt and confusion about what you're trying to study. (...) People sometimes respond to this kind of deep confusion with despair. They don't like feeling more lost than when they started. But in fact, it is usually an excellent sign to feel deeply confused at this point, and here is why. Naturalism is especially likely to be the right approach when you're not exactly wrong about the truth value of some proposition, so much as not even wrong. It's especially useful when you are thinking about things from the wrong direction, asking the wrong questions, using concepts that do not or cannot match the territory. When you're beginning from a place of not even wrong, you will likely find, in your first moments of direct observation, that you cannot make sense of what you are seeing. Why? Because the sense you are accustomed to making is not the sense that the actual world makes. When you look directly for the first time and do not understand what you see, it means that you may well be actually looking instead of just making things up. In this phase, things that seemed obvious and straightforward before often become perplexing. The most useful responses to this are curiosity and patience. If you stick it out, if you just keep observing through the doubt and confusion, you will begin to form new concepts, and this time they'll develop through intimate contact with the territory. Clarity may come later in the procedure, but things may have to get very muddy first. Surely it's not impossible that feeling lost and confused can mean that your project really is hopeless and you should give up, right? No, it's not impossible. It's just that those signals are not at all reliable indicators. Due to the concept-dissolving nature of naturalism, indications that it's time to abandon the project are not "confusion", "frustration", or "despair." All of these tend to be good signs in context, and your odds of eventual success depend a lot on your tolerance for these feelings. If you're wondering whether to give up (temporarily or for good), I recommend looking instead for "not caring anymore", "having new priorities", or "having underestimated the scope of your project, and considering the value incommensurate with the true scope". I've experienced all of these at...

Mar 7, 2024 • 27min

AF - Evidential Correlations are Subjective, and it might be a problem by Martín Soto

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Evidential Correlations are Subjective, and it might be a problem, published by Martín Soto on March 7, 2024 on The AI Alignment Forum. I explain (in layman's terms) a realization that might make acausal trade hard or impossible in practice. Summary: We know that if players believe different Evidential Correlations, they might miscoordinate. But clearly they will eventually learn to have the correct Evidential Correlations, right? Not necessarily, because there is no objective notion of correct here (in the way that there is for math or physics). Thus, selection pressures might be much weaker, and different agents might systematically converge on different ways of assigning Evidential Correlations. Epistemic status: Confident that this realization is true, but the quantitative question of exactly how weak the selection pressures are remains open. What are Evidential Correlations, really? Skippable if you know the answer to the question. Alice and Bob are playing a Prisoner's Dilemma, and they know each other's algorithms: Alice.source and Bob.source.[1] Since their algorithms are approximately as complex, each of them can't easily assess what the other will output. Alice might notice something like "hmm, Bob.source seems to default to Defection when it throws an exception, so this should update me slightly in the direction of Bob Defecting". But she doesn't know exactly how often Bob.source throws an exception, or what it does when that doesn't happen. Imagine, though, Alice notices Alice.source and Bob.source are pretty similar in some relevant ways (maybe the overall logical structure seems very close, or the depth of the for loops is the same, or she learns the training algorithm that shaped them is the same one). She's still uncertain about what any of these two algorithms outputs[2], but this updates her in the direction of "both algorithms outputting the same action". If Alice implements/endorses Evidential Decision Theory, she will reason as follows: Conditional on Alice.source outputting Defect, it seems very likely Bob.source also outputs Defect, thus my payoff will be low. But conditional on Alice.source outputting Cooperate, it seems very likely Bob.source also outputs Cooperate, thus my payoff will be high. So I (Alice) should output Cooperate, thus (very probably) obtain a high payoff. To the extent Alice's belief about similarity was justified, it seems like she will perform pretty well on these situations (obtaining high payoffs). When you take this reasoning to the extreme, maybe both Alice and Bob are aware that they both know this kind of cooperation bootstrapping is possible (if they both believe they are similar enough), and thus (even if they are causally disconnected, and just simulating each others' codes) they can coordinate on some pretty complex trades. This is Evidential Cooperation in Large worlds. But wait a second: How could this happen, without them being causally connected? What was this mysterious similarity, this spooky correlation at a distance, that allowed them to create cooperation from thin air? Well, in the words of Daniel Kokotajlo: it's just your credences, bro! The bit required for this to work is that they believe that "it is very likely we both output the same thing". Said another way, they have high probability on the possible worlds "Alice.source = C, Bob.source = C" and "Alice.source = D, Bob.source = D", but low probability on the possible worlds "Alice.source = D, Bob.source = C" and "Alice.source = D, Bob.source = C". This can also be phrased in terms of logical counterfactuals: if Alice.source = C, then it is very likely that Bob.source = C.[3] This is a logical counterfactual: there is, ultimately, a logical fact of the matter about what Alice.source outputs, but since she doesn't know it yet, she entertains what s...

Mar 7, 2024 • 46min

LW - Social status part 1/2: negotiations over object-level preferences by Steven Byrnes

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Social status part 1/2: negotiations over object-level preferences, published by Steven Byrnes on March 7, 2024 on LessWrong. 1.1 Summary & contents This is the first of two blog posts where I try to make sense of the whole universe of social-status-related behaviors and phenomena: This post is focused on a special case of two people interacting, where they have different object-level preferences - maybe one wants to order pizza for dinner while the other wants sushi. This gets us into various topics like "leading and following", averaging different people's utility functions, being more or less "pushy", "ask culture versus guess culture", plausible deniability, politeness arms-races, and more. Then the next post, "Social status part 2/2: everything else", will layer on another heap of complexity on top of all that, related to the fact that people also have preferences related to the interaction itself, like "a preference not to be rude". That gets us into topics like dominance, prestige, getting offended, passive-aggressiveness, status, self-deprecation, and more. Some context for how I came to write this: While I often write about neuroscience and brain algorithms, these two posts have essentially none of that. They're just about systematizing everyday behavior and folk psychology, and I hope they will be generally useful as such. As it happens, my own larger project is to understand the neuroscience underlying social status behaviors (as part of this even larger project related to AI alignment). But I have no hope of figuring out the neuroscience underlying social status behaviors, if I don't understand social status behaviors in the first place. Hence these posts. I previously attempted to talk about social status a couple months ago here. I still think I was pointing towards something important and true in that old post, but it was just one little piece of the puzzle, and I described it very poorly because I was confused about the bigger picture. Anyway, I neither expect nor recommend that you read that; these two posts will hopefully be self-contained. This post is organized as follows: Section 1.2 describes the setting and some basic terminology. In particular, I use the word "negotiation" very broadly to include most everyday interactions, including making plans and decisions as a group, requesting favors, divvying up responsibilities, and even things like taking turns speaking and changing conversation topics. Section 1.3 defines two key terms for this post: "leading" and "following". If two people, Alice & Beth, are interacting, and Alice is "mostly leading" while Beth is "mostly following", that means that, when Alice & Beth have conflicting object-level preferences, the group will make decisions that follow Alice's preferences more than Beth's. I then argue that the idea of "leading" and "following" are equally applicable to both "dominance" and "prestige" interactions (in the terminology of dual strategies theory). Section 1.4 offers a toy model for the dynamic above, where Alice & Beth each has a utility function for their object-level preferences, and the group decisions are based on a weighted average of Alice's and Beth's utilities, and more "leading" simply means that your preferences get more weight in the weighted average. Thus, "leading-ness" always sums to 100%: if Alice is "70% leading" within the interaction, then Beth must be "30% leading", and so on. I discuss some insights that we get from this toy model, and also clarify a technical issue related to the incommensurability of different people's desires. Section 1.5 offers another related toy model, where there's an objective scale of "pushiness" - ranging from making strong explicit demands, to subtly hinting at one's own preferences - and where "leading" and "following" correspond respecti...

Mar 7, 2024 • 3min

LW - Movie posters by KatjaGrace

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: Movie posters, published by KatjaGrace on March 7, 2024 on LessWrong. Life involves anticipations. Hopes, dreads, lookings forward. Looking forward and hoping seem pretty nice, but people are often wary of them, because hoping and then having your hopes fold can be miserable to the point of offsetting the original hope's sweetness. Even with very minor hopes: he who has harbored an inchoate desire to eat ice cream all day, coming home to find no ice cream in the freezer, may be more miffed than he who never tasted such hopes. And this problem is made worse by that old fact that reality is just never like how you imagined it. If you fantasize, you can safely bet that whatever the future is is not your fantasy. I have never suffered from any of this enough to put me off hoping and dreaming one noticable iota, but the gap between high hopes and reality can still hurt. I sometimes like to think about these valenced imaginings of the future in a different way from that which comes naturally. I think of them as 'movie posters'. When you look fondly on a possible future thing, you have an image of it in your mind, and you like the image. The image isn't the real thing. It's its own thing. It's like a movie poster for the real thing. Looking at a movie poster just isn't like watching the movie. Not just because it's shorter - it's just totally different - in style, in content, in being a still image rather than a two hour video. You can like the movie poster or not totally independently of liking the movie. It's fine to like the movie poster for living in New York and not like the movie. You don't even have to stop liking the poster. It's fine to adore the movie poster for 'marrying Bob' and not want to see the movie. If you thrill at the movie poster for 'starting a startup', it just doesn't tell you much about how the movie will be for you. It doesn't mean you should like it, or that you have to try to do it, or are a failure if you love the movie poster your whole life and never go. (It's like five thousand hours long, after all.) This should happen a lot. A lot of movie posters should look great, and you should decide not to see the movies. A person who looks fondly on the movie poster for 'having children' while being perpetually childless could see themselves as a sad creature reaching in vain for something they may not get. Or they could see themselves as right there with an image that is theirs, that they have and love. And that they can never really have more of, even if they were to see the movie. The poster was evidence about the movie, but there were other considerations, and the movie was a different thing. Perhaps they still then bet their happiness on making it to the movie, or not. But they can make such choices separate from cherishing the poster. This is related to the general point that 'wanting' as an input to your decisions (e.g. 'I feel an urge for x') should be different to 'wanting' as an output (e.g. 'on consideration I'm going to try to get x'). This is obvious in the abstract, but I think people look in their heart to answer the question of what they are on consideration pursuing. Here as in other places, it is important to drive a wedge between them and fit a decision process in there, and not treat one as semi-implying the other. This is also part of a much more general point: it's useful to be able to observe stuff that happens in your mind without its occurrence auto-committing you to anything. Having a thought doesn't mean you have to believe it. Having a feeling doesn't mean you have to change your values or your behavior. Having a persistant positive sentiment toward an imaginary future doesn't mean you have to choose between pursuing it or counting it as a loss. You are allowed to decide what you are going to do, regardless of what you find...

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

The Nonlinear Library

Episodes

Mentioned books

AF - Scenario Forecasting Workshop: Materials and Learnings by elifland

AF - Forecasting future gains due to post-training enhancements by elifland

LW - Woods' new preprint on object permanence by Steven Byrnes

LW - AI #54: Clauding Along by Zvi

LW - MATS AI Safety Strategy Curriculum by Ryan Kidd

LW - Simple Kelly betting in prediction markets by jessicata

LW - Mud and Despair (Part 4 of "The Sense Of Physical Necessity") by LoganStrohl

AF - Evidential Correlations are Subjective, and it might be a problem by Martín Soto

LW - Social status part 1/2: negotiations over object-level preferences by Steven Byrnes

LW - Movie posters by KatjaGrace

The AI-powered Podcast Player