LessWrong (30+ Karma)

LessWrong

Audio narrations of LessWrong posts.

Episodes

Mentioned books

Mar 18, 2026 • 7min

“PSA: Predictions markets often have very low liquidity; be careful citing them.” by Eye You

I see people repeatedly make the mistake of referencing a very low liquidity prediction market and using it to make a nontrivial point. Usually the implication when a market is cited is that it's number should be taken somewhat seriously, that it's giving us a highly informed probability. Sometimes a market is used to analyze some event that recently occurred; reasoning here looks like "the market on outcome O was trading at X%, then event E happened and the market quickly moved to Y%, thus event E made O less/more likely." Who do I see make this mistake? Rationalists, both casually and gasp in blog posts. Scott Alexander and Zvi (and I really appreciate their work, seriously!) are guilty of this. I'll give a recent example from each of them. From Scott's Mantic Monday post on March 2: Having Your Own Government Try To Destroy You Is (At Least Temporarily) Good For Business On Friday, the Pentagon declared AI company Anthropic a “supply chain risk”, a designation never before given to an American firm. This unprecedented move was seen as an attempt to punish, maybe destroy the company. How effective was it? Anthropic isn’t publicly traded, so we [...] --- First published: March 16th, 2026 Source: https://www.lesswrong.com/posts/SrtoF6PcbHpzcT82T/psa-predictions-markets-often-have-very-low-liquidity-be --- Narrated by TYPE III AUDIO. ---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Mar 16, 2026 • 8min

“AICRAFT: DARPA-Funded AI Alignment Researchers — Applications Open” by Mike Vaiana, Diogo de Lucena, Judd Rosenblatt

AICRAFT: DARPA-Funded AI Alignment Researchers — Applications Open TL;DR: We hypothesize that most alignment researchers have more ideas than they have engineering bandwidth to test. AICRAFT is a DARPA-funded project that pairs researchers with a fully managed professional engineering team for two-week pilot sprints, designed specifically for high-risk ideas that might otherwise go untested. We will select 6 applicants and execute a 2 week pilot with each, the most promising pilot may be given a 3 month extension. This is the first MVP for engaging DARPA directly with the alignment community to our knowledge, and if successful can catalyze government scale investment in alignment R&D. Apply here. Applications close March 27, 2026 at 11 PM PST. What is AICRAFT? AICRAFT (Artificial Intelligence Control Research Amplification & Framework for Talent) is a DARPA-funded seedling project executed by AE Studio. The premise is straightforward: we hypothesize that alignment research could progress faster if the best researchers had more leverage. We believe that researchers currently are bottlenecked on either execution (i.e. they are doing the hands-on experiments themselves) or management (i.e. they are managing teams that are executing the work). Management is higher leverage but what if we could push that much [...] ---Outline:(00:15) AICRAFT: DARPA-Funded AI Alignment Researchers -- Applications Open(01:08) What is AICRAFT?(02:49) The Bigger Picture(03:56) Who should apply?(04:26) How it works(05:21) The application(06:11) FAQ --- First published: March 16th, 2026 Source: https://www.lesswrong.com/posts/nmMdtZveC38atLnDm/aicraft-darpa-funded-ai-alignment-researchers-applications --- Narrated by TYPE III AUDIO.

Mar 16, 2026 • 24min

“Customer Satisfaction Opportunities” by Tomás B.

I am monitoring surveillance camera V84A. A tall man is walking towards me. He is roughly twenty-five. <faceprint> His name is Damion Prescott. He has a room booked for a whole month. His facial symmetry scores show he is in the 99th percentile. This is in accordance with my holistic impression. <search> School records show both truancy and perfect grades, suggesting high intelligence and disagreeableness. Searching social media. <search>. No record of modeling or acting experience, fame. I will assign him to our tier C high-value client list, based solely on his facial symmetry score and wealth. Reminder to recommend seating him in a high-visibility table, should he be heading to the restaurant. <search> I found a forum post mentioning him on swipeshare.com. Several women are sharing pictures, having seen him on a dating app. I recall Hinge uses highly attractive profiles to entice new users. They appear to be using Damion Prescott's profile heavily in this capacity. The women on the site are memeing about him. They are wondering why almost none of them have matched, apparently this is rare even for the most attractive men. Only one appears to have gone on a date with him. She [...] --- First published: March 16th, 2026 Source: https://www.lesswrong.com/posts/LTKfRovaJ6jcwDJia/customer-satisfaction-opportunities-1 --- Narrated by TYPE III AUDIO.

Mar 16, 2026 • 7min

“LLM Misalignment Can be One Gradient Step Away, and Blackbox Evaluation Cannot Detect It.” by Yavuz Bakman

Models that appear aligned under black-box evaluation may conceal substantial latent misalignment beneath their observable behavior. Let's say you downloaded a language model from Huggingface. You do all the blackbox evaluation for the safety/alignment, and you are convinced that the model is safe/aligned. But how badly can things go after you update the model? Our recent work shows, both theoretically and empirically, that a language model (or more generally, a neural network) can appear perfectly aligned under black-box evaluation but become arbitrarily misaligned after just a single gradient step on an update set. Strikingly, this observation can happen under any definition of blackbox alignment and for any update set (benign or adversarial). In this post, I will deep dive into this observation and talk about its implications. Theory: Same Forward Computation, Different Backward Computation LLMs or NNs in general are overparameterized. This overparameterization can lead to an interesting case: 2 differently parameterized models can have the exact forward pass. Think about a simple example: the two-layer linear model and the model . Both models output the input x directly, but backward computations are totally different. Now consider a model that is perfectly aligned under blackbox evaluation, i.e. [...] ---Outline:(01:07) Theory: Same Forward Computation, Different Backward Computation(03:18) Hair-Trigger Aligned LLMs(05:44) Whats Next? --- First published: March 14th, 2026 Source: https://www.lesswrong.com/posts/uSgw9muqRZpjpxKDA/llm-misalignment-can-be-one-gradient-step-away-and-blackbox-1 --- Narrated by TYPE III AUDIO. ---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Mar 16, 2026 • 40min

“Compradorization” by Benquo

Previously: Is GDP a Kind of Factory? There is a word, "convergence," which economists use when they want to say that poor countries are becoming less poor relative to rich ones. There is a phrase, "the resource curse," for the tendency of countries with valuable natural resources to stay poor despite their resources. There is a phrase, "Dutch disease," for the way that selling one commodity too profitably can destroy the ability to sell other things. When an economist says "Dutch disease," they are choosing not to say "Chinese industrial policy combined with structural adjustment conditionality." When they say "the resource curse," they are choosing not to say "extraction concessions negotiated under debt pressure, with domestic officials whose personal interests had already been oriented toward the extraction rather than toward their own population, in conditions created by international creditors who collectively benefited from those terms." When they say "convergence," they are choosing not to say "a temporary windfall from China's industrial buildout, recorded in a measure that cannot distinguish liquidation from accumulation, in countries whose productive capacity was simultaneously being eroded by the same process that temporarily raised their GDP." These words name phenomena while drawing [...] ---Outline:(01:44) Dutch Disease(04:01) The Restructuring of Interests(13:25) Compradorization: The Separation of Interest from Duty(15:50) Reflexive Compradorization: The Prodigal Son(19:24) Construals of Corruption: Fawkes or Villiers?(21:45) Development Consulting: a Case Study(22:44) The Instruments and the Flinch(22:48) The Roles(25:55) The Pervert(26:42) The Hysteric(27:34) The Neurotic(30:22) The Bargain(33:10) Basilisk(34:56) Punctuated Equilibrium(36:15) Outside the Asylum(37:50) What Does This Have to Do with Solow Convergence? The original text contained 4 footnotes which were omitted from this narration. --- First published: March 16th, 2026 Source: https://www.lesswrong.com/posts/8P8bLbNHvC8cHXsBs/compradorization --- Narrated by TYPE III AUDIO.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

LessWrong (30+ Karma)

Episodes

Mentioned books

“Adding Typos Made Haiku’s Accuracy Go Up” by bira

“LLMs as Giant Lookup-Tables of Shallow Circuits” by niplav, Claude+

“Medical Roundup #7” by Zvi

“Types of Handoff to AIs” by Daniel Kokotajlo

“You can’t imitation-learn how to continual-learn” by Steven Byrnes

“PSA: Predictions markets often have very low liquidity; be careful citing them.” by Eye You

“AICRAFT: DARPA-Funded AI Alignment Researchers — Applications Open” by Mike Vaiana, Diogo de Lucena, Judd Rosenblatt

“Customer Satisfaction Opportunities” by Tomás B.

“LLM Misalignment Can be One Gradient Step Away, and Blackbox Evaluation Cannot Detect It.” by Yavuz Bakman

“Compradorization” by Benquo

The AI-powered Podcast Player