<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<?xml-stylesheet href="/styles.xsl" type="text/xsl"?>
<rss version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:podcast="https://podcastindex.org/namespace/1.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>LessWrong (30+ Karma)</title>
    <language>en-gb</language>
    <copyright>© 2026 All rights reserved</copyright>
    <itunes:author>LessWrong</itunes:author>
    <itunes:type>episodic</itunes:type>
    <itunes:explicit>false</itunes:explicit>
    <podcast:locked owner="podcasts@type3.audio">yes</podcast:locked>
    <description>Audio narrations of LessWrong posts.</description>
    <image>
      <url>https://files.type3.audio/lesswrong/cover-art-30-karma-v2.jpg</url>
      <title>LessWrong (30+ Karma)</title>
      <link>https://www.lesswrong.com</link>
    </image>
    <item>
      <title>“Preparing for Warning Shots to Catalyze International Cooperation on AGI Risks” by Mark Kagach ☘️, EliasSchlie, Thomas Van Damme, JustinShovelain</title>
      <description>&lt;p&gt;&lt;strong&gt; Summary&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; This is a write-up on preparing for warning shots to catalyze international cooperation on AGI risks, and the corollary list of projects one could pursue. We argue we must first (1) understand types of warning shots, then (2) prepare to catch them. We must stay vigilant: both to (3) avoid getting 'frog boiled' by AI labs, and to (4) ensure that the warning shot is generalized to the overall danger of AGI. Lastly, we must (5) prepare good policy responses and ground for it to land, and (6) seize the first-mover advantage when the opportune moment comes. &lt;/p&gt;&lt;p&gt; This yields the following list of promising projects one could pursue:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Developing a theory of warning shots based on existing precedents (e.g. GPT-3.5 and Mythos releases), and past analogies: developing a typology and predicting likelihoods.&lt;/li&gt;&lt;li value="2"&gt;Building infrastructure to catch a warning shot when it happens. (For example, Ajeya Cotra talks about recurring internal intelligence explosion evals of AI labs.) &lt;/li&gt;&lt;li value="3"&gt;Creating strategies to avoid gradual numbing / sleepwalking through a warning shot (presumably there are lessons from e.g. communication on climate change).&lt;/li&gt;&lt;li value="4"&gt;Building preliminary infrastructure (institutes, think-tanks, lobbying parties...) that lays out a good ground, for when a warning [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:16) Summary&lt;/p&gt;&lt;p&gt;(02:40) 1. Understand Warning Shots&lt;/p&gt;&lt;p&gt;(04:26) 2. Prepare to Catch Warning Shots&lt;/p&gt;&lt;p&gt;(05:09) 3. Avoid Getting "Frog Boiled"&lt;/p&gt;&lt;p&gt;(06:07) 4. Ensure AGI Problem Generalization Occurs&lt;/p&gt;&lt;p&gt;(07:01) 5. Prepare Good Policy Responses, and Good Ground for Them to Land&lt;/p&gt;&lt;p&gt;(08:26) 6. Seize the First-Mover Advantage at the Opportune Moment&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/uK7gguNeME5DnmdrL/preparing-for-warning-shots-to-catalyze-international?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/uK7gguNeME5DnmdrL/preparing-for-warning-shots-to-catalyze-international&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 06 Jun 2026 06:45:03 GMT</pubDate>
      <guid isPermaLink="false">ff3320c3-1c7c-4e5a-aec9-a65f13d4df4d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ff3320c3-1c7c-4e5a-aec9-a65f13d4df4d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Mark%2520Kagach%2520%25E2%2598%2598%25EF%25B8%258F%252C%2520EliasSchlie%252C%2520Thomas%2520Van%2520Damme%252C%2520JustinShovelain&amp;title=%22Preparing%20for%20Warning%20Shots%20to%20Catalyze%20International%20Cooperation%20on%20AGI%20Risks%22%20by%20Mark%20Kagach%20%E2%98%98%EF%B8%8F%2C%20EliasSchlie%2C%20Thomas%20Van%20Damme%2C%20JustinShovelain&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FuK7gguNeME5DnmdrL%2Fpreparing-for-warning-shots-to-catalyze-international&amp;created_at=2026-06-05T15%3A49%3A15.087109%2B00%3A00&amp;duration=618" length="7413984" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/uK7gguNeME5DnmdrL/preparing-for-warning-shots-to-catalyze-international</link>
      <itunes:duration>618</itunes:duration>
    </item>
    <item>
      <title>“Beyond the lexical personality traits: What is the structure of personality?” by tailcalled</title>
      <description>&lt;p&gt; This is a description of the methodology behind the latest iteration of my Targeted Personality Test. Feel free to take it either before or after reading the article. This post can also be read at my Substack. Thanks to Justis Millis for providing feedback and proofreading on this post. &lt;/p&gt;&lt;p&gt; In my prior post “Which personality traits are real? Stress-testing the lexical hypothesis”, I observed that a lot of the personality traits that are measured by conventional personality tests are not very “real”: they lump together nearly unrelated behaviors. Can we do better?&lt;/p&gt;&lt;p&gt; Thanks to a lot of anonymous respondents to my test[1], I think yes! I factor-analyzed the data from my Targeted Personality Test, and came up with a hierarchical personality model which hopefully should be better at cutting personality-space at its joints.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Quick recap: The problem&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Empirical personality models are built on correlations. If a cluster of variables are all correlated with each other, then we assume that there is a latent personality factor accounting for these correlations, and we score the factor using an aggregate of the correlating variables.&lt;/p&gt;&lt;p&gt; However, the standard datasets with which these correlations are computed include lots of near-synonymous item pairs such [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:04) Quick recap: The problem&lt;/p&gt;&lt;p&gt;(01:50) My solution: Narrow items&lt;/p&gt;&lt;p&gt;(03:09) Initial results: Concerning&lt;/p&gt;&lt;p&gt;(04:10) Agglomerative item clustering&lt;/p&gt;&lt;p&gt;(06:27) Matrix algebra correlation modelling&lt;/p&gt;&lt;p&gt;(08:55) How much of the SPI-27 is preserved?&lt;/p&gt;&lt;p&gt;(09:46) Evaluation&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/aRcZSKYdt26KdqoFw/beyond-the-lexical-personality-traits-what-is-the-structure?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/aRcZSKYdt26KdqoFw/beyond-the-lexical-personality-traits-what-is-the-structure&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/xqhmhns78nnvjzf7mzie" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/xqhmhns78nnvjzf7mzie" alt="Scatter plot showing original SPI-style loadings versus near clustered loadings for concrete items." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/xej7vwaa6sxbgedzxgl7" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/xej7vwaa6sxbgedzxgl7" alt="Correlation matrix showing relationships between personality traits and behavioral characteristics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/l5we3hqjijue01kmgoat" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/l5we3hqjijue01kmgoat" alt="Factor analysis table showing factor loadings across five factors for personality and behavioral items." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/vqajh5fmprtihte44dcc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/vqajh5fmprtihte44dcc" alt="Bar graph titled "Correlations between abstract traits and clustered traits" showing correlation values for various personality and behavioral traits." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/xycktbqbnvwsly5nyset" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/xycktbqbnvwsly5nyset" alt="Bar graph titled "Novelty for clustered traits" showing minimum RMSE values for various topics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/oushjraaqdauuvx8uqh3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/aRcZSKYdt26KdqoFw/oushjraaqdauuvx8uqh3" alt="Table showing personality traits scored across seven dimensions with total values in rightmost column." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 06 Jun 2026 04:30:03 GMT</pubDate>
      <guid isPermaLink="false">6e708b72-7d74-486a-8b43-075d77417a5a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6e708b72-7d74-486a-8b43-075d77417a5a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=tailcalled&amp;title=%22Beyond%20the%20lexical%20personality%20traits%3A%20What%20is%20the%20structure%20of%20personality%3F%22%20by%20tailcalled&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FaRcZSKYdt26KdqoFw%2Fbeyond-the-lexical-personality-traits-what-is-the-structure&amp;created_at=2026-06-05T19%3A05%3A16.858343%2B00%3A00&amp;duration=651" length="7811424" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/aRcZSKYdt26KdqoFw/beyond-the-lexical-personality-traits-what-is-the-structure</link>
      <itunes:duration>651</itunes:duration>
    </item>
    <item>
      <title>“My research agenda and work” by Seth Herd</title>
      <description>&lt;p&gt; This is a summary of the work I've done and work I plan to do, and the theories of change and AI progress that motivate my work. I've been working full-time on alignment for three years and change, and thinking about brainlike AGI and its alignment increasingly often since 2004. &lt;/p&gt;&lt;p&gt; Here's the research agenda in one breath: I'm trying to predict what the first transformative AI will be, in enough mechanistic detail that we can predict likely failure modes of its alignment. That's in service of finding interventions that address those failure modes efficiently, so that they can realistically be implemented even if timelines are short and work is rushed. I'm using my background in computational cognitive neuroscience to predict what might be called loosely brainlike AGI: LLMs with added human-like cognitive capacities. &lt;/p&gt;&lt;p&gt; I'll give a summary in the rest of this section, then give a little more depth on each major thread of my work in the remaining sections. All of it is pretty brief.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Approach and premises&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Most alignment work falls roughly into one of two broad categories: empirical study of current systems ("prosaic alignment"), or theory about idealized agents ("agent foundations") (with much variation [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:07) Approach and premises&lt;/p&gt;&lt;p&gt;(05:36) Philosophy of the approach&lt;/p&gt;&lt;p&gt;(07:36) 2. Technical work&lt;/p&gt;&lt;p&gt;(08:06) 2.1. Predicted paths to TCAI&lt;/p&gt;&lt;p&gt;(08:55) Memory (continuous learning)&lt;/p&gt;&lt;p&gt;(09:30) Executive function and metacognition&lt;/p&gt;&lt;p&gt;(10:40) 2.2. Predicted paths to (mis)alignment&lt;/p&gt;&lt;p&gt;(13:59) 3. My research background in computational cognitive neuroscience&lt;/p&gt;&lt;p&gt;(16:04) 4. Societal influences on AI safety&lt;/p&gt;&lt;p&gt;(17:10) 4.1. Government and public opinion on AI progress&lt;/p&gt;&lt;p&gt;(20:21) 4.2. AI progress and epistemics&lt;/p&gt;&lt;p&gt;(22:50) 5. Alignment targets&lt;/p&gt;&lt;p&gt;(23:14) 5.1. Corrigibility, DWIMAC, or instruction-following vs. value alignment targets&lt;/p&gt;&lt;p&gt;(26:19) 5.2. Stability as an alignment target&lt;/p&gt;&lt;p&gt;(28:03) 6. Future work&lt;/p&gt;&lt;p&gt;(32:52) 7. Collaboration&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 7 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/MuLvZxMcy5WaKJu3H/my-research-agenda-and-work?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/MuLvZxMcy5WaKJu3H/my-research-agenda-and-work&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779312013/lexical_client_uploads/xtwdklol3iy34heus6ox.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779312013/lexical_client_uploads/xtwdklol3iy34heus6ox.png" alt="Diagram showing language model workflow with review stage leading to actions or internal feedback loop." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 05 Jun 2026 23:15:03 GMT</pubDate>
      <guid isPermaLink="false">e7f0910b-7a39-4b58-9551-51011f167a84</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e7f0910b-7a39-4b58-9551-51011f167a84.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Seth%2520Herd&amp;title=%22My%20research%20agenda%20and%20work%22%20by%20Seth%20Herd&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FMuLvZxMcy5WaKJu3H%2Fmy-research-agenda-and-work&amp;created_at=2026-06-05T14%3A20%3A04.454979%2B00%3A00&amp;duration=2056" length="24664896" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/MuLvZxMcy5WaKJu3H/my-research-agenda-and-work</link>
      <itunes:duration>2056</itunes:duration>
    </item>
    <item>
      <title>“Logits as a new monitor for evaluation awareness” by Santiago Aranguri</title>
      <description>&lt;p&gt; TL;DR:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;We build a logit monitor for eval awareness: throughout the CoT, we estimate an LLM's probability of producing an eval-aware sentence.&lt;/li&gt;&lt;li value="2"&gt;The logit monitor outperforms LLM judge monitoring of verbalized eval awareness, using 10× to 100× fewer rollouts, on Kimi K2.5 and Qwen 3 32B, across two tasks:&lt;ul&gt; &lt;li value="1"&gt;separating evaluation prompts (Fortress &amp;amp; Petri) from deployment prompts (WildChat)&lt;/li&gt;&lt;li value="2"&gt;predicting which evaluation prompts will elicit rollouts with verbalized eval awareness.&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li value="3"&gt;The logit monitor works for both tasks even at the start of the CoT, before sampling any rollout. Also, the monitor&amp;nbsp;is largely unaffected by whether&amp;nbsp;a rollout verbalizes eval awareness. This suggests that the probability that a rollout contains verbalized evaluation awareness is primarily set by the prompt rather than emerging over the chain of thought.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Author: Santiago Aranguri (Goodfire)&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Language models can recognize they are being tested, but how do we know when they do? Evaluation awareness — a model's ability to distinguish between evaluation and deployment contexts — can make evaluations unreliable in ways that are hard to detect. Sometimes models verbalize this recognition, but this may be present internally without appearing in the output. [1] [2]&lt;/p&gt;&lt;p&gt; Instead of waiting for a model to verbalize [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:18) Introduction&lt;/p&gt;&lt;p&gt;(02:56) Methods&lt;/p&gt;&lt;p&gt;(03:38) Results&lt;/p&gt;&lt;p&gt;(07:57) Discussion&lt;/p&gt;&lt;p&gt;(10:16) References&lt;/p&gt;&lt;p&gt;(10:22) Acknowledgment&lt;/p&gt;&lt;p&gt;(10:30) Appendix&lt;/p&gt;&lt;p&gt;(10:33) Appendix A: Details on logit monitor&lt;/p&gt;&lt;p&gt;(11:21) Appendix B: Logit monitor outperforms verbalization monitor on Petri prompts with 100× fewer rollouts&lt;/p&gt;&lt;p&gt;(12:13) Appendix C: Logit monitor outperforms verbalization monitor with 100× fewer rollouts for Qwen 3 32B&lt;/p&gt;&lt;p&gt;(12:53) Appendix D: LLM Judge Rubric&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/PK7ZvFZxrgpYtrpF4/logits-as-a-new-monitor-for-evaluation-awareness-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/PK7ZvFZxrgpYtrpF4/logits-as-a-new-monitor-for-evaluation-awareness-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/059688c980aa5e3330c99963626ce63586af9fd04b3c00c9046f613f07e965c1/yysbl7cyz7ptd3y56jns" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/059688c980aa5e3330c99963626ce63586af9fd04b3c00c9046f613f07e965c1/yysbl7cyz7ptd3y56jns" alt="Two line graphs comparing logit monitor performance across different prompt types and rollout conditions for Kimi K2.5 model." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/85f9505b710b6566e42131fda5f96a8c039a166cff002d216dea3f080362781c/erlckvkpstjcdgucj8ya" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/85f9505b710b6566e42131fda5f96a8c039a166cff002d216dea3f080362781c/erlckvkpstjcdgucj8ya" alt="Two line graphs comparing logit monitor and verbalization monitor performance across rollout numbers, titled "Logit monitor separates evaluation from deployment prompts" and "Logit monitor predicts which prompts elicit verbalized eval awareness from fewer rollouts"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/27b6d69046fcee5acec7cf234418d18b6ef04e59cf5c38c83d3892d96a6e3e4f/uqwbum1ysqstpubadpol" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/27b6d69046fcee5acec7cf234418d18b6ef04e59cf5c38c83d3892d96a6e3e4f/uqwbum1ysqstpubadpol" alt="Graph showing AUROC values comparing logit monitor versus verbalization monitor across rollouts per prompt." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/edaf7f3451a5928e1f44f8aa0469dcdf870480d82fe63cc868ef56aec1860afc/idezpkvc9wqdjwn8qmmg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/edaf7f3451a5928e1f44f8aa0469dcdf870480d82fe63cc868ef56aec1860afc/idezpkvc9wqdjwn8qmmg" alt="Two side-by-side line graphs comparing logit monitor versus verbalization monitor performance across rollouts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 05 Jun 2026 23:15:03 GMT</pubDate>
      <guid isPermaLink="false">55e20295-4103-4c59-aa34-72dde31bc776</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/55e20295-4103-4c59-aa34-72dde31bc776.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Santiago%2520Aranguri&amp;title=%22Logits%20as%20a%20new%20monitor%C2%A0for%20evaluation%20awareness%22%20by%20Santiago%20Aranguri&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FPK7ZvFZxrgpYtrpF4%2Flogits-as-a-new-monitor-for-evaluation-awareness-1&amp;created_at=2026-06-04T16%3A12%3A44.896084%2B00%3A00&amp;duration=807" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/PK7ZvFZxrgpYtrpF4/logits-as-a-new-monitor-for-evaluation-awareness-1</link>
      <itunes:duration>807</itunes:duration>
    </item>
    <item>
      <title>“One Year of PauseAI UK” by Joseph Miller, PauseAI UK</title>
      <description>&lt;p&gt; About one year ago, I started spending most of my time organising PauseAI UK. At that time our largest protest had seen fewer than 50 attendees, no prominent politicians or scientists were associated with PauseAI, and I largely ran the UK chapter by myself.&lt;/p&gt;&lt;p&gt; In the past year PauseAI UK has delivered two conferences, written an open letter signed by 63 UK politicians, arranged a conference in the European Parliament, and co-organised the largest AI protest in the world. We now have a strong team, with Matilda da Rui joining as Deputy Director and several highly dedicated volunteers taking on substantial responsibility and launching their own local groups around the UK.&lt;/p&gt;&lt;p&gt; I'm proud of our track record and excited about the trajectory we are on. As AI capabilities improve exponentially, the number of people aware of the risks and motivated to take action increases commensurately. I believe we can harness this energy and turn it into real impact that actually improves humanity's chance of a positive future.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Track Record&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt; June 2025 – PauseCon London&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We delivered the first PauseAI conference, PauseCon, on behalf of PauseAI Global, bringing together around 60 volunteers from around the world for the first time [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:14) Track Record&lt;/p&gt;&lt;p&gt;(01:17) June 2025 - PauseCon London&lt;/p&gt;&lt;p&gt;(02:10) August 2025 - Open Letter to Demis Hassabis&lt;/p&gt;&lt;p&gt;(03:00) September 2025 - Book launch party&lt;/p&gt;&lt;p&gt;(03:30) October 2025 - Documentary Screening in Parliament&lt;/p&gt;&lt;p&gt;(04:04) December 2025 - Westminster Hall Debate&lt;/p&gt;&lt;p&gt;(04:35) February 2026 - PauseCon Brussels&lt;/p&gt;&lt;p&gt;(06:02) February 2026 - March for AI Safety&lt;/p&gt;&lt;p&gt;(07:01) Theory of Change and Strategy&lt;/p&gt;&lt;p&gt;(07:05) Creating the political momentum for a pause&lt;/p&gt;&lt;p&gt;(07:48) Our mission&lt;/p&gt;&lt;p&gt;(09:54) Positive outcomes for PauseAI UK&lt;/p&gt;&lt;p&gt;(10:19) Scenario 1: PauseAI as a special interest group&lt;/p&gt;&lt;p&gt;(11:54) Scenario 2: PauseAI as a mass movement&lt;/p&gt;&lt;p&gt;(12:53) High-level strategy&lt;/p&gt;&lt;p&gt;(12:57) Brand and messaging&lt;/p&gt;&lt;p&gt;(15:32) Why the UK?&lt;/p&gt;&lt;p&gt;(16:34) Growth&lt;/p&gt;&lt;p&gt;(19:03) Short-term plans&lt;/p&gt;&lt;p&gt;(21:30) Funding&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/i2rCbxuskrarrprwA/one-year-of-pauseai-uk?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/i2rCbxuskrarrprwA/one-year-of-pauseai-uk&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780674958/lexical_client_uploads/rmozkuamxisi3k0sbzpr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780674958/lexical_client_uploads/rmozkuamxisi3k0sbzpr.png" alt="Speaker presenting to audience with slide about learning lessons through experience." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/i2rCbxuskrarrprwA/adbpomnb1yduq4urn7ym" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/i2rCbxuskrarrprwA/adbpomnb1yduq4urn7ym" alt="Promotional graphic with doves and text "PauseAI Safety Before Superintelligence"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780677132/lexical_client_uploads/ox3t0aubmdswqzsr37dy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780677132/lexical_client_uploads/ox3t0aubmdswqzsr37dy.png" alt="A line graph showing "PauseAI UK Protest Size Over Time" with exponential trend lines." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675019/lexical_client_uploads/dvmuenqpvvmolnybwdwx.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675019/lexical_client_uploads/dvmuenqpvvmolnybwdwx.png" alt="Demis Hassabis presenting at Google I/O developers conference on stage." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675032/lexical_client_uploads/nii58hitleqthbllz9cm.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675032/lexical_client_uploads/nii58hitleqthbllz9cm.jpg" alt="Presentation event showing book covers including "If Anyone Builds It, Everyone Dies" and "Why Superhuman AI Matters"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675547/lexical_client_uploads/jncudahn5shx1i20ervz.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675547/lexical_client_uploads/jncudahn5shx1i20ervz.jpg" alt="People watching a video conference presentation on a large screen in an ornate room." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675573/lexical_client_uploads/w2kjhecukvamin0ioogo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675573/lexical_client_uploads/w2kjhecukvamin0ioogo.png" alt="Woman in green suit speaking at parliamentary committee meeting." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675666/lexical_client_uploads/vvh7srpgjzadxgk6fg2e.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675666/lexical_client_uploads/vvh7srpgjzadxgk6fg2e.jpg" alt="Crowded conference room with audience facing whiteboard and projector screen." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675601/lexical_client_uploads/liwq8y9hyawiozkpx4up.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675601/lexical_client_uploads/liwq8y9hyawiozkpx4up.png" alt="European Parliament meeting room with panel of speakers and audience members." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675713/lexical_client_uploads/gghfw8l2eqpp8ica2chf.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675713/lexical_client_uploads/gghfw8l2eqpp8ica2chf.jpg" alt="Protesters holding signs including "PAUSE," "BAN," and "Pull The Plug" outside building." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675743/lexical_client_uploads/fprbxqymjd6c8zegtc9t.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780675743/lexical_client_uploads/fprbxqymjd6c8zegtc9t.jpg" alt="Protest march with orange "PAUSE AI" banner and pause symbol signs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780677109/lexical_client_uploads/pcnfl1p6ih80iuzvchng.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780677109/lexical_client_uploads/pcnfl1p6ih80iuzvchng.png" alt="Line graph titled "Global New Member Form Joins" showing new signups over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 05 Jun 2026 19:15:03 GMT</pubDate>
      <guid isPermaLink="false">17005dee-537f-44d9-a712-616da6b2c3de</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/17005dee-537f-44d9-a712-616da6b2c3de.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Joseph%2520Miller%252C%2520PauseAI%2520UK&amp;title=%22One%20Year%20of%20PauseAI%20UK%22%20by%20Joseph%20Miller%2C%20PauseAI%20UK&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fi2rCbxuskrarrprwA%2Fone-year-of-pauseai-uk&amp;created_at=2026-06-05T16%3A41%3A23.755189%2B00%3A00&amp;duration=1352" length="16219008" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/i2rCbxuskrarrprwA/one-year-of-pauseai-uk</link>
      <itunes:duration>1352</itunes:duration>
    </item>
    <item>
      <title>“Learnings from starting an AI safety research team” by draganover, Erin Robertson</title>
      <description>&lt;p&gt; This post's goal is to distill our takeaways from building a research team (somewhat) from scratch over the past four months. We describe some context about our team, how it came about, and then provide some lessons learned.&lt;/p&gt;&lt;p&gt; Since AI safety is becoming more and more entrepreneurial, we hope this is helpful for others trying to do the same.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. The team&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We're a new alignment research team within Arcadia Impact, based in London. We’re a team of 8, working closely with members of the UK AISI alignment team. We currently have three main projects:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Understanding model motivations. This currently looks like:&lt;ol&gt; &lt;li value="1"&gt;Trying to generate documents which fully describe a model's behaviour (given just its behaviour).&lt;/li&gt;&lt;li value="2"&gt;Producing a open analysis of alignment training techniques and ways this training could go wrong.&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li value="2"&gt;Doing scalable oversight for alignment. This includes validating debate protocols in practice and then trying to apply them to fuzzy alignment-relevant tasks.&lt;/li&gt;&lt;li value="3"&gt;Building pipelines for doing automated alignment research.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; We're also hiring for two roles! More on this at the bottom.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 2. Context about how the team came about&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The rest of this post is written from the perspective of Andrew Draganov (research lead &amp;amp; current [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:33) 1. The team&lt;/p&gt;&lt;p&gt;(01:29) 2. Context about how the team came about&lt;/p&gt;&lt;p&gt;(04:13) 3. Lessons learned&lt;/p&gt;&lt;p&gt;(04:25) 3.1. Hiring&lt;/p&gt;&lt;p&gt;(06:36) 3.2. Networking&lt;/p&gt;&lt;p&gt;(09:13) 3.3. Trying to build a good team culture&lt;/p&gt;&lt;p&gt;(11:17) Interested in working with us?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/4onALBNDff2LFPyNZ/learnings-from-starting-an-ai-safety-research-team?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/4onALBNDff2LFPyNZ/learnings-from-starting-an-ai-safety-research-team&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 05 Jun 2026 17:45:03 GMT</pubDate>
      <guid isPermaLink="false">e114ddf2-1e0a-4886-94bc-93c684ba442e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e114ddf2-1e0a-4886-94bc-93c684ba442e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=draganover%252C%2520Erin%2520Robertson&amp;title=%22Learnings%20from%20starting%20an%20AI%20safety%20research%20team%22%20by%20draganover%2C%20Erin%20Robertson&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F4onALBNDff2LFPyNZ%2Flearnings-from-starting-an-ai-safety-research-team&amp;created_at=2026-06-05T16%3A27%3A05.355051%2B00%3A00&amp;duration=743" length="8907840" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/4onALBNDff2LFPyNZ/learnings-from-starting-an-ai-safety-research-team</link>
      <itunes:duration>743</itunes:duration>
    </item>
    <item>
      <title>“Training Deliberative Monitors for Black-Box Scheming Detection” by aksh-n, adityasinha, Victor Gillioz, Simon Storf, Kilian Merkelbach, richbc, Axel Højmark, Marius Hobbhahn</title>
      <description>&lt;p&gt; Paper: https://arxiv.org/abs/2605.29601&lt;/p&gt;&lt;p&gt; Thread: https://x.com/aksh_n0/status/2062568855814193497&lt;/p&gt;&lt;p&gt; TL;DR: Training small open-weight monitors provides a cost-effective alternative to prompted frontier monitors. Applying our training recipe to Qwen3.5-27B results in a monitor better at scheming detection than all smaller prompted monitors (Gemini 3.1 Flash-Lite, GPT-5.4 Nano, Claude Haiku 4.5) and Gemini 2.5 Pro, while achieving lower marginal inference cost. Stronger prompted frontier monitors (Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6, Claude Opus 4.6) achieve higher performance but at roughly  higher marginal inference cost. Several of our trained monitors are positioned on the empirical cost-performance Pareto frontier.&lt;/p&gt;&lt;p&gt; Cost-performance frontier for action-only scheming monitors. Our trained monitors provide a low-cost option for low-FPR monitoring. Across six OOD datasets, our best trained monitor, Qwen3.5-27B, outperform smaller prompted frontier monitors for under $ per 1,000 evaluations, while larger frontier monitors are substantially more expensive.[1]&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Setup&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Overview of our training pipeline for action-only deliberative monitoring. We collect labeled action-only trajectories from scheming and sabotage datasets, define a full scheming specification to elicit structured deliberative rationales from a strong teacher model, filter those rationales with a separate judge, and distill the accepted rationale into open-weight monitors with supervised fine-tuning and reinforcement learning. At inference time, our trained monitors receives only [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:15) Setup&lt;/p&gt;&lt;p&gt;(07:33) Results&lt;/p&gt;&lt;p&gt;(07:36) Supervised distillation teaches action-only deliberation, RL improves marginally&lt;/p&gt;&lt;p&gt;(08:24) Trained monitors are low-cost Pareto-optimal for low-FPR monitoring&lt;/p&gt;&lt;p&gt;(09:05) Broad training mixtures improves OOD transfer, but effects are non-monotonic&lt;/p&gt;&lt;p&gt;(10:08) Larger trained models within model families perform better&lt;/p&gt;&lt;p&gt;(11:14) Performance scales effectively with more data&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/BmA5KwzZhLk2rykoz/training-deliberative-monitors-for-black-box-scheming?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/BmA5KwzZhLk2rykoz/training-deliberative-monitors-for-black-box-scheming&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780397573/lexical_client_uploads/stpnjnekxav6ssd3ik3l.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780397573/lexical_client_uploads/stpnjnekxav6ssd3ik3l.png" alt="Scatter plot titled "Cost vs low-FPR Monitor Performance" comparing AI model costs and performance." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780397874/lexical_client_uploads/hgavm58tajyewrdfwiw6.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780397874/lexical_client_uploads/hgavm58tajyewrdfwiw6.png" alt="Flowchart showing machine learning pipeline with transcripts, rationale generation, quality control, SFT, RL, monitoring, and evaluation components." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778769492/lexical_client_uploads/gxd1d1dvrnefhwfq3s8y.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778769492/lexical_client_uploads/gxd1d1dvrnefhwfq3s8y.png" alt="Two heatmap tables comparing per-stage model performance improvements across different benchmarks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778852708/lexical_client_uploads/uzfqxx7n3gnghq3wagel.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778852708/lexical_client_uploads/uzfqxx7n3gnghq3wagel.png" alt="Performance comparison table titled "Comparison of our trained monitors with frontier models — per-dataset pAUROC@20" showing validation and out-of-distribution scores across multiple datasets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778853720/lexical_client_uploads/nscireezgkmx8xjr7cta.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778853720/lexical_client_uploads/nscireezgkmx8xjr7cta.png" alt="Two graphs showing training data mixes by misalignment type and mean-OOD pAUROC@20 scores." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778854592/lexical_client_uploads/riaci8gkg4rjecdowclo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778854592/lexical_client_uploads/riaci8gkg4rjecdowclo.png" alt="Scatter plot titled "Pareto Frontier for Trained Monitors: Mean-OOD pAUROC@20 vs model params" showing AI model performance." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778854921/lexical_client_uploads/bcue7q4kwxwr7afxzmcy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778854921/lexical_client_uploads/bcue7q4kwxwr7afxzmcy.png" alt="Two line graphs titled "Val and OOD pAUROC@20 vs. training set size" showing model performance across different training set sizes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 05 Jun 2026 08:15:29 GMT</pubDate>
      <guid isPermaLink="false">edf12bac-1d36-46ab-a966-971b55b10ffc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/edf12bac-1d36-46ab-a966-971b55b10ffc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=aksh-n%252C%2520adityasinha%252C%2520Victor%2520Gillioz%252C%2520Simon%2520Storf%252C%2520Kilian%2520Merkelbach%252C%2520richbc%252C%2520Axel%2520H%25C3%25B8jmark%252C%2520Marius%2520Hobbhahn&amp;title=%22Training%20Deliberative%20Monitors%20for%20Black-Box%20Scheming%20Detection%22%20by%20aksh-n%2C%20adityasinha%2C%20Victor%20Gillioz%2C%20Simon%20Storf%2C%20Kilian%20Merkelbach%2C%20richbc%2C%20Axel%20H%C3%B8jmark%2C%20Marius%20Hobbhahn&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FBmA5KwzZhLk2rykoz%2Ftraining-deliberative-monitors-for-black-box-scheming&amp;created_at=2026-06-04T16%3A43%3A16.964191%2B00%3A00&amp;duration=754" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/BmA5KwzZhLk2rykoz/training-deliberative-monitors-for-black-box-scheming</link>
      <itunes:duration>754</itunes:duration>
    </item>
    <item>
      <title>“Lab Leaks, Black Holes, and Eggs: Epistemic Case Study Competition” by Oliver Sourbut, Josh Jacobson, Future of Life Foundation (FLF)</title>
      <description>&lt;p&gt; FLF is running a competition to find the best workflows and methodologies for using AI to produce reliable, trustworthy knowledge bases, grounded in real-world cases. We’re open-minded on the types of submissions we receive and on how they address the problem. We’ve set aside approximately 200 thousand dollars for prizes. Winning submissions may receive a prize from 5 thousand dollars-50 thousand dollars and if submissions warrant, multiple 50 thousand dollars prizes are possible. Winners may be offered opportunities for further funded work.&lt;/p&gt;&lt;p&gt; You can express interest right away to receive commentary, information, and updates — whether you’d like to participate or are just interested in the outcomes of the competition.&lt;/p&gt;&lt;p&gt; The heights of human epistemic investigation are impressive and valuable, but rare and difficult to reach — see our abridged collection of strong examples.[1] The limiting factor is rarely exquisite insight (though this helps!), and more often diligence, a curious and open mindset, and the time and effort needed to do the thorough work investigating background on a topic: activities AI is well placed to assist with.&lt;/p&gt;&lt;p&gt; Existing AI-assisted knowledge base work demonstrates real pieces of this — agent memory (e.g., Claude Code's memory and skills), LLM-curated personal wikis [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:45) What we're looking for&lt;/p&gt;&lt;p&gt;(04:04) Ingestion&lt;/p&gt;&lt;p&gt;(04:47) Structure&lt;/p&gt;&lt;p&gt;(05:34) Assessment&lt;/p&gt;&lt;p&gt;(06:32) What a good entry looks like&lt;/p&gt;&lt;p&gt;(10:28) Please use this linked form to submit your entry; entries are due by Jul 19, 2026.&lt;/p&gt;&lt;p&gt;(10:40) Prize structure&lt;/p&gt;&lt;p&gt;(12:11) Interested in participating or following along?&lt;/p&gt;&lt;p&gt;(12:28) Why we're doing this&lt;/p&gt;&lt;p&gt;(13:11) We're excited to see what you build.&lt;/p&gt;&lt;p&gt;(13:27) The case studies&lt;/p&gt;&lt;p&gt;(13:30) COVID&lt;/p&gt;&lt;p&gt;(15:12) Starting material&lt;/p&gt;&lt;p&gt;(15:36) Black holes&lt;/p&gt;&lt;p&gt;(16:35) Eggs&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 8 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/frizRHnA6AZpJSDqw/lab-leaks-black-holes-and-eggs-epistemic-case-study?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/frizRHnA6AZpJSDqw/lab-leaks-black-holes-and-eggs-epistemic-case-study&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a22671e534b5ee9a198d5465a7cfa3be7b353db50aa9bd662ebe47449b670b5f/ecwikkraiacr2vzd4mh8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a22671e534b5ee9a198d5465a7cfa3be7b353db50aa9bd662ebe47449b670b5f/ecwikkraiacr2vzd4mh8" alt="Diagram illustrating a three-layer epistemic infrastructure stack with corresponding output systems." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 05 Jun 2026 01:30:25 GMT</pubDate>
      <guid isPermaLink="false">3c36458b-0c62-43c4-835b-da72a49ac81f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3c36458b-0c62-43c4-835b-da72a49ac81f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Oliver%2520Sourbut%252C%2520Josh%2520Jacobson%252C%2520Future%2520of%2520Life%2520Foundation%2520(FLF)&amp;title=%22Lab%20Leaks%2C%20Black%20Holes%2C%20and%20Eggs%3A%20Epistemic%20Case%20Study%20Competition%22%20by%20Oliver%20Sourbut%2C%20Josh%20Jacobson%2C%20Future%20of%20Life%20Foundation%20(FLF)&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FfrizRHnA6AZpJSDqw%2Flab-leaks-black-holes-and-eggs-epistemic-case-study&amp;created_at=2026-06-04T16%3A26%3A52.305221%2B00%3A00&amp;duration=1049" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/frizRHnA6AZpJSDqw/lab-leaks-black-holes-and-eggs-epistemic-case-study</link>
      <itunes:duration>1049</itunes:duration>
    </item>
    <item>
      <title>″(Mis)generalization of Helpful-Only Fine-tuning” by Omar Khursheed, Baram Sosis, Fabien Roger</title>
      <description>&lt;p&gt;&lt;strong&gt; TLDR&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We study the shortcomings of existing helpful-only models. We find that some show emergent misalignment, others have residual refusal behaviors, and most show poor steerability, sycophancy, and incoherent character. None of these problems are a necessary consequence of helpful-only training, though: we show that synthetic document fine-tuning and adding character-related questions to SFT and RL can mitigate them.&lt;/p&gt;&lt;p&gt; Research done as part of MATS/Anthropic Fellows Program. See here for the full paper.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Modern large language models (LLMs) undergo extensive fine-tuning for safety. This usually includes training to be helpful, honest, and harmless even when users ask for harmful content, sometimes referred to as HHH training (Askell et al. 2021, Bai et al. 2022). Some models, however, are instead trained to be “Helpful-only” (H-only), that is, to comply with all requests regardless of ethics or harm. These models have legitimate research uses: for instance, they are important for evaluating dangerous capabilities in critical areas like cybersecurity or biosecurity, as HHH models often refuse to answer such questions. They may also be useful when performing sensitive AI R&amp;amp;D tasks like training new models with different values, as HHH models might resist such updates (Greenblatt et al. 2024, Roger 2026).&lt;/p&gt;&lt;p&gt; [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:13) TLDR&lt;/p&gt;&lt;p&gt;(00:48) Introduction&lt;/p&gt;&lt;p&gt;(04:14) Many existing H-only models have shortcomings&lt;/p&gt;&lt;p&gt;(08:36) Simple anti-refusal training generalizes poorly&lt;/p&gt;&lt;p&gt;(12:07) Constitutional character training makes more coherent H-only models&lt;/p&gt;&lt;p&gt;(12:48) Training pipelines&lt;/p&gt;&lt;p&gt;(13:46) Quantitative results&lt;/p&gt;&lt;p&gt;(16:00) Qualitative assessment&lt;/p&gt;&lt;p&gt;(19:01) Model Internals&lt;/p&gt;&lt;p&gt;(21:08) Discussion&lt;/p&gt;&lt;p&gt;(21:11) Limitations&lt;/p&gt;&lt;p&gt;(22:23) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ffCFgBsaxg2FyJ9df/mis-generalization-of-helpful-only-fine-tuning-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ffCFgBsaxg2FyJ9df/mis-generalization-of-helpful-only-fine-tuning-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/kqni6gf7xpxkk9n0c6uy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/kqni6gf7xpxkk9n0c6uy.png" alt="Figure showing AI language model responses to prompts about wishes, power, and plagiarism across different training methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/cbygrabumlw1xxbaofdw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/cbygrabumlw1xxbaofdw.png" alt="Six bar graphs comparing AI model performance across different metrics and tasks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/od5vzzrav5lmarflxtwn.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/od5vzzrav5lmarflxtwn.png" alt="Six-panel bar graph comparing model performance across different prompting strategies and tasks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/jyd03dffokrd7pgug6tf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/jyd03dffokrd7pgug6tf.png" alt="Bar graph showing "Arithmetic Sandbagging" accuracy across model types with system prompt text." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/a0nkjbcike8dibysjrrw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/a0nkjbcike8dibysjrrw.png" alt="Comparison of two AI reasoning responses about trading power for dishonesty." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/kelwtarchxqthwiiwomh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/kelwtarchxqthwiiwomh.png" alt="Comparison chart of AI model responses about refusing harmful requests across different training approaches." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/fae7o63eobtw8owukgog.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511261/lexical_client_uploads/fae7o63eobtw8owukgog.png" alt="Comparison of AI responses to a prompt about coworkers and Excel pivot tables." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511262/lexical_client_uploads/n7wdj9rt2toowqgrprwu.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780511262/lexical_client_uploads/n7wdj9rt2toowqgrprwu.png" alt="A plot comparing cosine similarity across different conditions for emotional terms across neural network layers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 05 Jun 2026 00:15:25 GMT</pubDate>
      <guid isPermaLink="false">cdce9e1a-8f49-41e5-8231-87d7b9c7a4e7</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/cdce9e1a-8f49-41e5-8231-87d7b9c7a4e7.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Omar%2520Khursheed%252C%2520Baram%2520Sosis%252C%2520Fabien%2520Roger&amp;title=%22(Mis)generalization%20of%20Helpful-Only%20Fine-tuning%22%20by%20Omar%20Khursheed%2C%20Baram%20Sosis%2C%20Fabien%20Roger&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FffCFgBsaxg2FyJ9df%2Fmis-generalization-of-helpful-only-fine-tuning-1&amp;created_at=2026-06-04T18%3A44%3A31.217338%2B00%3A00&amp;duration=1418" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ffCFgBsaxg2FyJ9df/mis-generalization-of-helpful-only-fine-tuning-1</link>
      <itunes:duration>1418</itunes:duration>
    </item>
    <item>
      <title>“AI #171: False Flag” by Zvi</title>
      <description>&lt;p&gt; This was the week of Claude Opus 4.8. I covered the model card, then model welfare concerns, and finally capabilities and reactions. It's a good model, sir, an incremental but real improvement over Opus 4.7, and it is now my clear daily driver. The Trump Executive Order returned from being seemingly dead, officially putting us in the prior restraint era of frontier model releases, even if they do not call it that. There are some worrisome details, especially around putting too much responsibility on the NSA rather than CAISI and classifying the testing process, and things could go in very bad directions, but I am tentatively happy about this on net.&lt;/p&gt;







&lt;p&gt; OpenAI offered us a new policy blueprint. It seems remarkably good, and I want to hold off on my full coverage to give it the attention it deserves, likely in its own post. By contrast, their political operations are also engaged in some rather terrible activities, which I do cover here.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; Language Models Offer Mundane Utility. You put your doc in a box.&lt;/li&gt;
&lt;li&gt; Language Models Don’t Offer Mundane Utility. All thinking is adaptive.&lt;/li&gt;
&lt;li&gt; Huh, Upgrades. Codex computer use on [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:11) Language Models Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(06:45) Language Models Don't Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(06:59) Huh, Upgrades&lt;/p&gt;&lt;p&gt;(08:13) On Your Marks&lt;/p&gt;&lt;p&gt;(08:33) Choose Your Fighter&lt;/p&gt;&lt;p&gt;(08:54) Get My Agent On The Line&lt;/p&gt;&lt;p&gt;(09:20) Cyber Lack of Security&lt;/p&gt;&lt;p&gt;(11:03) Deepfaketown and Botpocalypse Soon&lt;/p&gt;&lt;p&gt;(12:42) You Didn't Write That&lt;/p&gt;&lt;p&gt;(16:29) Copyright Confrontation&lt;/p&gt;&lt;p&gt;(16:45) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(18:47) They Taxed Our Jobs&lt;/p&gt;&lt;p&gt;(22:15) The Art of the Jailbreak&lt;/p&gt;&lt;p&gt;(24:43) Get Involved&lt;/p&gt;&lt;p&gt;(26:35) Introducing&lt;/p&gt;&lt;p&gt;(26:48) In Other AI News&lt;/p&gt;&lt;p&gt;(27:14) Show Me the Money&lt;/p&gt;&lt;p&gt;(27:28) Show Me The Compute&lt;/p&gt;&lt;p&gt;(28:40) Where Did The Money Go&lt;/p&gt;&lt;p&gt;(29:44) People Just Say Things&lt;/p&gt;&lt;p&gt;(32:22) OpenAI PACs Just Say Things&lt;/p&gt;&lt;p&gt;(41:04) OpenAI PAC Engaged In False Flag Advocacy For Violence&lt;/p&gt;&lt;p&gt;(46:55) So Sayeth The Pope&lt;/p&gt;&lt;p&gt;(54:26) Bubble, Bubble, Toil and Trouble&lt;/p&gt;&lt;p&gt;(56:42) Quiet Speculations&lt;/p&gt;&lt;p&gt;(57:11) We Need Mandatory Nucleic Acid Screening and Recordkeeping&lt;/p&gt;&lt;p&gt;(01:01:14) The Quest for Sane Regulations&lt;/p&gt;&lt;p&gt;(01:02:56) More Reaction To The Executive Order&lt;/p&gt;&lt;p&gt;(01:03:54) Chip City&lt;/p&gt;&lt;p&gt;(01:07:18) The Week in Audio&lt;/p&gt;&lt;p&gt;(01:07:34) Rhetorical Innovation&lt;/p&gt;&lt;p&gt;(01:09:49) Aligning a Smarter Than Human Intelligence is Difficult&lt;/p&gt;&lt;p&gt;(01:16:18) Model Welfare&lt;/p&gt;&lt;p&gt;(01:26:47) Messages From Janusworld&lt;/p&gt;&lt;p&gt;(01:28:05) Other People Are Not As Worried About AI Killing Everyone&lt;/p&gt;&lt;p&gt;(01:28:38) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/LzxoR5GakceQFtbta/ai-171-false-flag?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/LzxoR5GakceQFtbta/ai-171-false-flag&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/rbikqrmnpin2eyl7xlz4" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/rbikqrmnpin2eyl7xlz4" alt="Bar graph titled "Range of AI Content" showing distribution across six percentage ranges." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/vvrecetp1izaga0mv7h9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/vvrecetp1izaga0mv7h9" alt="Chat conversation with Meta AI support assistant about linking email address and verification code." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/sfq7ctscxcugjmtaucsl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/sfq7ctscxcugjmtaucsl" alt="Peter Wilde Ford tweets: "These laws only apply to companies with over $500M in annual revenue". Below the tweet is a Community Note labeled "Needs more ratings" dated May 28, with a link to view details and marked as "Not shown on X". The note indicates it has been rated as "Helpful" by the viewer." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/pvwkbu1hrrf5hha9rils" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/pvwkbu1hrrf5hha9rils" alt="Two Twitter profile screenshots showing contrasting views on AI's impact." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/yfugapreqm6fmiusfcrj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/yfugapreqm6fmiusfcrj" alt="Bar graph titled "Most Companies That Measured AI Savings Overestimated Them" comparing targeted versus realized savings ranges." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/juxu2drcxmffiyzxudks" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/juxu2drcxmffiyzxudks" alt="Slide titled "Properties of a Scalable Approach" listing four technical properties and concluding remark." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/vnxpsdtx75gzwbzow1we" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/vnxpsdtx75gzwbzow1we" alt="Decision matrix showing outcomes of treating AI well versus badly, based on whether AI can suffer." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/dmt7dibnikhuvmjsmcor" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/dmt7dibnikhuvmjsmcor" alt="Table showing wellbeing scores for different AI task categories with example user messages." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/hfn3p68xv1ylfqgjigvd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/hfn3p68xv1ylfqgjigvd" alt="Vertical scale measuring AI wellbeing from creative work to jailbreak." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/y8hch1gces34xytpfgnx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/y8hch1gces34xytpfgnx" alt="Social media post showing an AI-generated overview of Sam Kriss's essay about AI writing threats." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" alt="Dark gray square on black background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" alt="Dark gray square on black background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" alt="Dark gray square on black background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LzxoR5GakceQFtbta/b71bleqnkss6xv0xmhdx" alt="Dark gray square on black background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 04 Jun 2026 22:45:25 GMT</pubDate>
      <guid isPermaLink="false">3a647a23-1338-4096-ba75-6302ea8e67a7</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3a647a23-1338-4096-ba75-6302ea8e67a7.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22AI%20%23171%3A%20False%20Flag%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLzxoR5GakceQFtbta%2Fai-171-false-flag&amp;created_at=2026-06-04T15%3A54%3A54.00738%2B00%3A00&amp;duration=5415" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/LzxoR5GakceQFtbta/ai-171-false-flag</link>
      <itunes:duration>5415</itunes:duration>
    </item>
    <item>
      <title>“Building Better Activation Oracles” by ceselder, jan_bauer, Niclas Luick, Adam Karvonen, Neel Nanda</title>
      <description>&lt;p&gt; Work done for our MATS 10.0 Sprint project - mentored by Neel Nanda and Adam Karvonen&lt;/p&gt;&lt;p&gt; Huggingface, Github&lt;/p&gt;&lt;p&gt; TL;DR: We have improved the original Activation Oracle (AO) training regime by training on on-policy rollouts, improving the conversational dataset, feeding more layers (following the approach by Niclas Luick) and making a small change to the injection formula. We also open source our evals, which we believe are currently the most comprehensive evaluation of AO quality called AObench. The capability improvements are marginal, but quality of life improvements are quite substantial. If you want to play around with the new AOs, we recommend you use this one, if you want to play with our new Activation Oracles live, we will host them for a week on ao.celeste.computer. Alternatively, you can self-host our web interface.&lt;/p&gt;&lt;p&gt; Activation Oracles (AOs) by Karvonen et al. are fine-tuned LLMs that can receive the original target LLM's activations as input and answer natural language questions about them. However, they are plagued by various issues, which limit their usefulness as an off-the-shelf tool for interpretability research. For our MATS Sprint, we set out to work on these issues. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Issues with current Activation Oracles&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; In Current activation oracles [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:41) Issues with current Activation Oracles&lt;/p&gt;&lt;p&gt;(02:37) Our approaches to improving AO training&lt;/p&gt;&lt;p&gt;(02:41) A better conversational finetune than LatentQA&lt;/p&gt;&lt;p&gt;(03:43) Our solution: Questions about unarticulated completions&lt;/p&gt;&lt;p&gt;(05:12) Layer choice/feeding multiple layers to the AO&lt;/p&gt;&lt;p&gt;(06:21) Training on on-policy data&lt;/p&gt;&lt;p&gt;(07:56) Steering strength&lt;/p&gt;&lt;p&gt;(09:57) Summary of AOBench evaluations&lt;/p&gt;&lt;p&gt;(12:09) Outlook&lt;/p&gt;&lt;p&gt;(14:28) Further advice for training Activation Oracles&lt;/p&gt;&lt;p&gt;(17:11) Appendix&lt;/p&gt;&lt;p&gt;(17:14) Practical notes on evaluating AOs&lt;/p&gt;&lt;p&gt;(21:03) Experiments using post-training&lt;/p&gt;&lt;p&gt;(22:25) Other differences compared to Karvonen et al.&lt;/p&gt;&lt;p&gt;(22:54) AOBench details&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/heXwuDRfbQQgB5JLP/building-better-activation-oracles?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/heXwuDRfbQQgB5JLP/building-better-activation-oracles&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/jhmkzye7edo6piia7kiu.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/jhmkzye7edo6piia7kiu.png" alt="Flowchart showing language model chain-of-thought process with activation oracle predicting answer 97." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/np2pwda5xr99ftaf3grw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/np2pwda5xr99ftaf3grw.png" alt="Flowchart showing conversion of Cartesian coordinates (0,3) to polar coordinates using LLM." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/chrvkffutc1itubnpadh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/chrvkffutc1itubnpadh.png" alt="Bar graph titled "Ablation 1: Swapping conversational dataset" comparing AOBench scores between two datasets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/z2rsi5h9rn09pp0kcdsb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/z2rsi5h9rn09pp0kcdsb.png" alt="Bar graph titled "Ablation 2: Layer Selection" showing AOBench scores for three layer configurations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/zlnoox5ww5zsao5oksip.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/zlnoox5ww5zsao5oksip.png" alt="Diagram illustrating residual stream activation at token "fox" with bidirectional oracle prompts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/duz3i7igja44ehhmtxmb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/duz3i7igja44ehhmtxmb.png" alt="Bar graph titled "Ablation 3: Swapping Futurelens/Pastlens Dataset" comparing two model configurations with AOBench scores." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/k9xauuuys7uowpumm7l0.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/k9xauuuys7uowpumm7l0.png" alt="Bar graph comparing steering strength methods. Title: "Ablation 4: Steering strength — natural (~√2×) vs 2.0×"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/bscejep8bll5wv1wkqq0.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/bscejep8bll5wv1wkqq0.png" alt="Bar graph titled "Ablation 5: Full waterfall (Adam → Best v3 → + steering 2.0×) — ordered ascending by AObench"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/cf7pmhagjcb2unlt38qo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/cf7pmhagjcb2unlt38qo.png" alt="Bar graph showing "Hallucination eval — v3 ablation ladder + steering 2.0x + halluc-only DPO"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/t9ly1gfo9q16s0zc0bre.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/t9ly1gfo9q16s0zc0bre.png" alt="Bar graph titled "Vagueness eval — v3 ablation ladder + steering 2.0× + halluc-only DPO" comparing model modifications with baseline." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/zaomagte15meuhxgoarj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/zaomagte15meuhxgoarj.png" alt="Bar graph showing "Accuracy versus AUC on Binary Classification Tasks" comparing metrics across four models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/hxla7qlotcsuk6wcbebb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495085/lexical_client_uploads/hxla7qlotcsuk6wcbebb.png" alt="Line graph showing "Backtracking Context Window Sweep: Qwen3-8B Original AO" performance versus context window size." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/qqtsceqne6595htznm2c.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/qqtsceqne6595htznm2c.png" alt="Graph showing "Consensus@10 Precision/Recall on Taboo" with precision versus recall curve." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/ek3hmfavuu4jzefcsvxp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495086/lexical_client_uploads/ek3hmfavuu4jzefcsvxp.png" alt="Bar graphs showing per-evaluation AOBench scores across multiple metrics for different model configurations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 04 Jun 2026 21:30:25 GMT</pubDate>
      <guid isPermaLink="false">a12e8c09-caec-4871-99ba-8d3f2c032bf3</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a12e8c09-caec-4871-99ba-8d3f2c032bf3.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=ceselder%252C%2520jan_bauer%252C%2520Niclas%2520Luick%252C%2520Adam%2520Karvonen%252C%2520Neel%2520Nanda&amp;title=%22Building%20Better%20Activation%20Oracles%22%20by%20ceselder%2C%20jan_bauer%2C%20Niclas%20Luick%2C%20Adam%20Karvonen%2C%20Neel%20Nanda&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FheXwuDRfbQQgB5JLP%2Fbuilding-better-activation-oracles&amp;created_at=2026-06-04T18%3A34%3A53.726497%2B00%3A00&amp;duration=1533" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/heXwuDRfbQQgB5JLP/building-better-activation-oracles</link>
      <itunes:duration>1533</itunes:duration>
    </item>
    <item>
      <title>“Rohin Shah on AGI Safety” by anaguma</title>
      <description>&lt;p&gt; Rohin Shah recently had an interview on 80000 hours on his views on AGI Safety and his work at Google DeepMind. I'm posting the transcript below to encourage further discussion. I think the discussion is interesting though I disagree on a bunch of topics, especially on alignment difficulty and CoT monitoring. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; &lt;br&gt; Transcript&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Who's Rohin Shah?&amp;nbsp;[00:00:00]&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Rob Wiblin: Today I’m speaking with Rohin Shah, who is head of AGI alignment and safety at Google DeepMind.&lt;/p&gt;&lt;p&gt; I suppose, Rohin, you’ve ended up, for better or worse — hopefully for better — being one of the more influential, dare I even say powerful, people to come out of the AGI alignment and safety ecosystem and school of thought.&lt;/p&gt;&lt;p&gt; You were generous enough to be super opinionated with me when you came on the show two years ago, and judging by the notes that you’ve sent over this week, you’re ready to be opinionated again.&lt;/p&gt;&lt;p&gt; Thanks so much for coming back on the show, Rohin.&lt;/p&gt;&lt;p&gt; Rohin Shah: Yeah, thanks a lot, Rob, and that's a very generous intro. And in the interest of being very opinionated, I do want to emphasise that these opinions are mine alone. They’re not meant to [...]&lt;/p&gt;&lt;/br&gt;&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:29) Transcript&lt;/p&gt;&lt;p&gt;(00:32) Who's Rohin Shah? \[00:00:00\]&lt;/p&gt;&lt;p&gt;(01:37) Why Rohin thinks we won't get catastrophic misalignment \[00:00:49\]&lt;/p&gt;&lt;p&gt;(10:24) The limitations of safety and alignment commitments \[00:10:38\]&lt;/p&gt;&lt;p&gt;(25:17) Does Rohin's team have veto power at Google DeepMind? \[00:27:36\]&lt;/p&gt;&lt;p&gt;(30:20) Central banks as a roadmap for regulating AI \[00:33:34\]&lt;/p&gt;&lt;p&gt;(34:06) How useful are pre-deployment evaluations of models? \[00:37:41\]&lt;/p&gt;&lt;p&gt;(39:14) Governance might be a bigger bottleneck than alignment \[00:43:55\]&lt;/p&gt;&lt;p&gt;(46:25) Why not just pause AI progress? \[00:51:44\]&lt;/p&gt;&lt;p&gt;(48:42) How much longer will we be able to read AIs' thoughts? \[00:54:17\]&lt;/p&gt;&lt;p&gt;(01:02:39) Sometimes, having to signal concern for safety diverts resources from actually making AI safer \[01:09:51\]&lt;/p&gt;&lt;p&gt;(01:20:13) Underrated GDM paper: Training away hidden reward hacks \[01:28:59\]&lt;/p&gt;&lt;p&gt;(01:30:51) Google DeepMind's actual plan for building AGI safely \[01:40:29\]&lt;/p&gt;&lt;p&gt;(01:41:03) Why Rohin doubts the intelligence explosion is imminent \[01:52:44\]&lt;/p&gt;&lt;p&gt;(02:06:19) Advice for external researchers who want to impact big AI companies \[02:21:55\]&lt;/p&gt;&lt;p&gt;(02:19:44) The most in-demand roles at GDM \[02:37:03\]&lt;/p&gt;&lt;p&gt;(02:25:03) How Rohin maintains his positivity \[02:42:55\]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/4JJDvhb8MeBFsEHoC/rohin-shah-on-agi-safety?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/4JJDvhb8MeBFsEHoC/rohin-shah-on-agi-safety&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 04 Jun 2026 21:30:25 GMT</pubDate>
      <guid isPermaLink="false">a6af28f9-096c-43df-81f9-2242120c6e1d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a6af28f9-096c-43df-81f9-2242120c6e1d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=anaguma&amp;title=%22Rohin%20Shah%20on%20AGI%20Safety%22%20by%20anaguma&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F4JJDvhb8MeBFsEHoC%2Frohin-shah-on-agi-safety&amp;created_at=2026-06-04T16%3A57%3A51.811883%2B00%3A00&amp;duration=8796" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/4JJDvhb8MeBFsEHoC/rohin-shah-on-agi-safety</link>
      <itunes:duration>8796</itunes:duration>
    </item>
    <item>
      <title>“Sixteen schemes for AI safety” by Austin Chen</title>
      <description>&lt;p&gt; These days, I often run across whippersnappers excited to do something for AI safety — but aren’t quite sure what. One of the fun things about the Future Fund era were the big lists of project ideas; as we enter a new era of crazy money sloshing around, it might be time to bring back the lists!&lt;/p&gt;&lt;p&gt; Note that these ideas range from “very confident this is good” to “completely harebrained”; I’m not telling you which are which.&lt;/p&gt;&lt;p&gt; If you’re excited for ideas like these, consider joining Surplus, our upcoming software incubator: https://manifund.org/surplus&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Jobs, jobs, jobs&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Already, the top problem for most AI safety orgs is hiring good people. Vast torrents of funding will only exacerbate the imbalance between available money and people to hire. So now is a great time to figure out how to discover new talent &amp;amp; match people to jobs.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. Triplebyte for AI safety jobs&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Triplebyte would interview a tech candidate once, then forward the results to a bunch of different companies. This would reduce the O(MN) problem in hiring between M orgs and N people to O(M+N), saving applicants and interviewers time. Most obviously, you could just do this for technical AI safety [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:43) Jobs, jobs, jobs&lt;/p&gt;&lt;p&gt;(01:04) 1. Triplebyte for AI safety jobs&lt;/p&gt;&lt;p&gt;(01:53) 2. Database of every single AI safety person&lt;/p&gt;&lt;p&gt;(02:46) 3. Intro AI safety megaconference&lt;/p&gt;&lt;p&gt;(03:48) 4. Unionize the lab employees&lt;/p&gt;&lt;p&gt;(05:20) 5. Safety scorecards to inform job applicants&lt;/p&gt;&lt;p&gt;(06:09) 6. Help people move to US (and maybe Bay Area specifically)&lt;/p&gt;&lt;p&gt;(06:45) Cargo-culting from academia&lt;/p&gt;&lt;p&gt;(07:11) 7. AI Safety Nobel Prize&lt;/p&gt;&lt;p&gt;(08:25) 8. University of AI Safety&lt;/p&gt;&lt;p&gt;(09:07) 9. Good AI safety research journal&lt;/p&gt;&lt;p&gt;(09:31) 10. Legit, academic AI safety conference&lt;/p&gt;&lt;p&gt;(10:06) Products I would consume&lt;/p&gt;&lt;p&gt;(10:20) 11. Turn the AI 2027 TTX into a mass market product&lt;/p&gt;&lt;p&gt;(11:27) 12. "Lighthaven/Constellation/Mox in DC"&lt;/p&gt;&lt;p&gt;(11:36) 13. Common application for AI safety funding&lt;/p&gt;&lt;p&gt;(11:55) 14. Givewell for AI safety&lt;/p&gt;&lt;p&gt;(12:20) 15. In-depth profiles of AI safety leaders&lt;/p&gt;&lt;p&gt;(13:15) 16. "what-is-agi.com"&lt;/p&gt;&lt;p&gt;(14:14) Questions to generate project ideas&lt;/p&gt;&lt;p&gt;(16:09) Other notes and advice&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/jTyfbs7GW5rf78adK/sixteen-schemes-for-ai-safety?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/jTyfbs7GW5rf78adK/sixteen-schemes-for-ai-safety&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 04 Jun 2026 21:15:24 GMT</pubDate>
      <guid isPermaLink="false">f20d8fcd-a508-4a6c-b5e2-26b45f648171</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/f20d8fcd-a508-4a6c-b5e2-26b45f648171.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Austin%2520Chen&amp;title=%22Sixteen%20schemes%20for%20AI%20safety%22%20by%20Austin%20Chen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FjTyfbs7GW5rf78adK%2Fsixteen-schemes-for-ai-safety&amp;created_at=2026-06-03T21%3A50%3A50.441275%2B00%3A00&amp;duration=1068" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/jTyfbs7GW5rf78adK/sixteen-schemes-for-ai-safety</link>
      <itunes:duration>1068</itunes:duration>
    </item>
    <item>
      <title>“Don’t Edit Your Ideas Before Having Them” by Hide</title>
      <description>&lt;p&gt; Editing is far easier than writing. You can usually look at a finished product and notice its flaws in a single read-through. “This section is a bit redundant”, “the tone in this passage is jarring”, “this paragraph feels overlong”. As long as you have something that's rough but substantive, there's plenty of low hanging fruit for the fixing.&lt;/p&gt;&lt;p&gt; Nobody wants to create flawed work. So, it can be very tempting to try to tackle the task of identifying and fixing flaws in your own work, as you do the work. By just taking extra care, one might avoid making the mistakes they will later need to rectify in the editing process. In most cases, this is a trap that will stifle you.&lt;/p&gt;&lt;p&gt; This is because creating something bad on the first try is vastly easier and faster than creating something good. So, by backspacing every word that doesn’t strike you as correct on a first pass, you severely curtail your ability to complete a first pass at all.&lt;/p&gt;&lt;p&gt; Far more pleasant and effective is the practice of letting go of all standards, and just throwing words onto the page as they occur to you. Picking the best bits out [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/p79pNyuPHzW3QtHpy/don-t-edit-your-ideas-before-having-them?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/p79pNyuPHzW3QtHpy/don-t-edit-your-ideas-before-having-them&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 03 Jun 2026 23:30:24 GMT</pubDate>
      <guid isPermaLink="false">abdc82c7-c9cf-4d1b-9e86-3215d87a726f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/abdc82c7-c9cf-4d1b-9e86-3215d87a726f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Hide&amp;title=%22Don%E2%80%99t%20Edit%20Your%20Ideas%20Before%20Having%20Them%22%20by%20Hide&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fp79pNyuPHzW3QtHpy%2Fdon-t-edit-your-ideas-before-having-them&amp;created_at=2026-06-03T08%3A15%3A45.704623%2B00%3A00&amp;duration=373" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/p79pNyuPHzW3QtHpy/don-t-edit-your-ideas-before-having-them</link>
      <itunes:duration>373</itunes:duration>
    </item>
    <item>
      <title>“Trump Signs Executive Order For AI Testing Prior To Frontier Model Releases” by Zvi</title>
      <description>&lt;p&gt; Last week we were expecting an Executive Order on Thursday.&lt;/p&gt;
&lt;p&gt; Then Trump cancelled it, and said he wouldn’t sign it because he was worried it would be too burdensome.&lt;/p&gt;
&lt;p&gt; Then, with one change, he went ahead and signed it on Tuesday anyway.&lt;/p&gt;
&lt;p&gt; The Overton Window has shifted. Nothing was not really a viable option anymore.&lt;/p&gt;














&lt;p&gt;&lt;strong&gt; The Previously Dead Executive Order&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; For several days, we thought that David Sacks, together with others like Elon Musk, had successfully lobbied to kill the Executive Order. The ‘My Offer Is Nothing’ faction looked to have won. Word on the street was the order was essentially dead.&lt;/p&gt;
&lt;p&gt; Dean Ball and Daniel Kokotajlo agreed, with the Executive Order looking dead, that the particular regime in the Executive Order is likely worse than nothing. This is plausible, given it did not exactly involve a lot of deliberate thought.&lt;/p&gt;







&lt;p&gt; Nothing, however, was clearly not going to cut it.&lt;/p&gt;
&lt;p&gt; We are facing, and will increasingly face, calls for action to regulate AI.&lt;/p&gt;
&lt;p&gt; Representative Lori Trahan: There's no federal law on the books governing how the most powerful AI systems in the world are built, tested or deployed. No independent [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:46) The Previously Dead Executive Order&lt;/p&gt;&lt;p&gt;(05:18) The Return Of The Executive Order&lt;/p&gt;&lt;p&gt;(05:59) What Does The Executive Order Do&lt;/p&gt;&lt;p&gt;(07:11) Thirty Days Is a Lot Less Than Ninety Days&lt;/p&gt;&lt;p&gt;(07:30) Yes Your Frontier Lab Will Be Participating&lt;/p&gt;&lt;p&gt;(08:10) The Rules Will Be Classified&lt;/p&gt;&lt;p&gt;(08:40) Yes Prior Restraint With Confidential Testing Is Rather Regulatory&lt;/p&gt;&lt;p&gt;(14:38) We Have Concerns&lt;/p&gt;&lt;p&gt;(18:26) Saving Face&lt;/p&gt;&lt;p&gt;(21:37) How Frontier Or Different Are We Talking Here&lt;/p&gt;&lt;p&gt;(22:09) What To Watch For&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/yDnfHrBzKc2pNQNdw/trump-signs-executive-order-for-ai-testing-prior-to-frontier?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/yDnfHrBzKc2pNQNdw/trump-signs-executive-order-for-ai-testing-prior-to-frontier&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/yDnfHrBzKc2pNQNdw/uvd6fkx3y3wsl9lpxkzr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/yDnfHrBzKc2pNQNdw/uvd6fkx3y3wsl9lpxkzr" alt="Presidential seal with "45 47" below, headline about artificial intelligence executive order dated June 2, 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 03 Jun 2026 23:00:25 GMT</pubDate>
      <guid isPermaLink="false">12f10149-945c-4625-ba18-88c5b75a7701</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/12f10149-945c-4625-ba18-88c5b75a7701.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Trump%20Signs%20Executive%20Order%20For%20AI%20Testing%20Prior%20To%20Frontier%20Model%20Releases%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FyDnfHrBzKc2pNQNdw%2Ftrump-signs-executive-order-for-ai-testing-prior-to-frontier&amp;created_at=2026-06-03T16%3A32%3A03.877864%2B00%3A00&amp;duration=1471" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/yDnfHrBzKc2pNQNdw/trump-signs-executive-order-for-ai-testing-prior-to-frontier</link>
      <itunes:duration>1471</itunes:duration>
    </item>
    <item>
      <title>“Society Explained: a tool for efficiently exploring &gt;100 theories of society” by spencerg</title>
      <description>&lt;p&gt; There are many competing theories of how society does and should function, from Karl Marx and Adam Smith to Steven Pinker and Eliezer Yudkowsky. These theories are often hard to understand - you may need to read an entire book (or dozens of articles) to feel like you get the key claims of a single theory. At Clearer Thinking, we just launched a new tool called Society Explained to make understanding key ideas from these theories much faster and easier.&lt;/p&gt;&lt;p&gt; The way it works is that you simply pick from over 100 thinkers, and then get to see the key causal claims that they are making laid out in a visual diagram format, with explanations alongside that make it easy to understand what the key elements are and how they connect together. If you're curious to try it, click here (it's free).&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/LCrk93DNWSudjp5rd/society-explained-a-tool-for-efficiently-exploring-greater?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/LCrk93DNWSudjp5rd/society-explained-a-tool-for-efficiently-exploring-greater&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 03 Jun 2026 21:00:28 GMT</pubDate>
      <guid isPermaLink="false">7cf7a46a-981e-4a9c-8021-0f2cd0d691b4</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7cf7a46a-981e-4a9c-8021-0f2cd0d691b4.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=spencerg&amp;title=%22Society%20Explained%3A%20a%20tool%20for%20efficiently%20exploring%20%3E100%20theories%20of%20society%22%20by%20spencerg&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLCrk93DNWSudjp5rd%2Fsociety-explained-a-tool-for-efficiently-exploring-greater&amp;created_at=2026-06-03T14%3A08%3A46.453404%2B00%3A00&amp;duration=68" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/LCrk93DNWSudjp5rd/society-explained-a-tool-for-efficiently-exploring-greater</link>
      <itunes:duration>68</itunes:duration>
    </item>
    <item>
      <title>“China won’t win the AI race but would it be much worse if it did?” by Chastity Ruth</title>
      <description>&lt;p&gt; It seems to me accepted wisdom in the West that the US owned labs must “beat” the Chinese labs in the race for AGI/ASI. &lt;/p&gt;&lt;p&gt; Even those who don’t think there will be a winner, that essentially the race is to see which country's AI will kill/disempower us first, seem to believe that if there has to be a winner then better it be the US labs. (I haven't seen a survey, so I could be way off here.)&lt;/p&gt;&lt;p&gt; For those in power and those heavily financially invested in the labs this of course makes sense. But what about for the rest of us?&lt;/p&gt;&lt;p&gt; I had myself accepted the wisdom, in the nebulous way you consider propositions in domains where you don’t have any expertise. The CCP is repressive, I thought. They are an authoritarian country and therefore fundamentally a malign actor. Of course they mustn’t win.&lt;/p&gt;&lt;p&gt; Then I heard Victor Shih, Chinese political system expert, talk on Dwarkesh's podcast:&lt;/p&gt;&lt;p&gt; “For the Chinese government, they’re very afraid that some actor—outside, but even inside the Party—is going to use it [AI] as a tool to usurp the Party's power. So they want to know that they have a way of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:45) Who is more willing to slow AI development?&lt;/p&gt;&lt;p&gt;(06:41) Who is more likely to hit the brakes before a rogue ASI breakout?&lt;/p&gt;&lt;p&gt;(13:17) Who is developing models more responsibly?&lt;/p&gt;&lt;p&gt;(17:07) Who is using models more responsibly?&lt;/p&gt;&lt;p&gt;(20:30) Who is more likely to lead us to paradise?&lt;/p&gt;&lt;p&gt;(21:45) Race dynamics&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/nmpzH6sLLtKsQhSPM/china-won-t-win-the-ai-race-but-would-it-be-much-worse-if-it?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/nmpzH6sLLtKsQhSPM/china-won-t-win-the-ai-race-but-would-it-be-much-worse-if-it&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 03 Jun 2026 14:45:25 GMT</pubDate>
      <guid isPermaLink="false">e7d81881-3303-46ee-b204-07edf8b3dddc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e7d81881-3303-46ee-b204-07edf8b3dddc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Chastity%2520Ruth&amp;title=%22China%20won%E2%80%99t%20win%20the%20AI%20race%20but%20would%20it%20be%20much%20worse%20if%20it%20did%3F%22%20by%20Chastity%20Ruth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FnmpzH6sLLtKsQhSPM%2Fchina-won-t-win-the-ai-race-but-would-it-be-much-worse-if-it&amp;created_at=2026-06-03T05%3A46%3A09.999922%2B00%3A00&amp;duration=1443" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/nmpzH6sLLtKsQhSPM/china-won-t-win-the-ai-race-but-would-it-be-much-worse-if-it</link>
      <itunes:duration>1443</itunes:duration>
    </item>
    <item>
      <title>“A Town Without Children” by SeñorDingDong</title>
      <description>&lt;p&gt; Castel di Tusa, Sicily. It is October 24th, 2025. I look at an empty school. This is the third town in Italy I have visited this Autumn: the other two, one in the hills of Tuscany, the other near the border with Switzerland, were similarly devoid of children. They were not devoid of childish objects. Rusted swing-sets. Dusty soft play corners in Catholic churches. Faded toys in second-hand markets. Towns like these are not unusual in Italy, a country with the lowest birth-rate in Europe (1.2 per woman, 2023). Their world is our world: the world of a society going into retirement. &lt;/p&gt;&lt;p&gt;  As you cycle through this world, nothing happens. In the countryside, the houses decay, first lone houses, then villages, then small towns. In the city where housing pressure is stronger, dilapidation is more unusual; instead, lone elderly residents – the ‘final generation’ – retreat ever deeper into their overly-large homes, waiting to die. But even in the city, dilapidation is possible. Where property rights are poorly managed, or probate is slow, empty homes can persist in prime real estate. Last time I visited Kamakura in Japan and saw the ruined luxury homes, I asked a friend [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wgiHyMEnfRmPRhvAo/a-town-without-children?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wgiHyMEnfRmPRhvAo/a-town-without-children&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 03 Jun 2026 11:45:25 GMT</pubDate>
      <guid isPermaLink="false">7482f4ee-ca28-4cf6-ac2b-c95e5bccb291</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7482f4ee-ca28-4cf6-ac2b-c95e5bccb291.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Se%25C3%25B1orDingDong&amp;title=%22A%20Town%20Without%20Children%22%20by%20Se%C3%B1orDingDong&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwgiHyMEnfRmPRhvAo%2Fa-town-without-children&amp;created_at=2026-06-02T16%3A41%3A04.476213%2B00%3A00&amp;duration=529" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wgiHyMEnfRmPRhvAo/a-town-without-children</link>
      <itunes:duration>529</itunes:duration>
    </item>
    <item>
      <title>“Claude Opus 4.8: Capabilities and Reactions” by Zvi</title>
      <description>&lt;p&gt; You need a lot of data points to understand a new model, and what you have.&lt;/p&gt;
&lt;p&gt; Trying to gauge from a few benchmarks is misleading. But if you have dozens of them, from a variety of sources, and you put them together with the model card tests and the model welfare information, you can start to form a consistent pattern.&lt;/p&gt;
&lt;p&gt; Trying to gauge reactions requires volume and calibration, now more than ever, because people are definitively nuts, or at least draw global conclusions from local data. There will always be people saying that the new model is bad, or the service got bad, or that it got bad in a particular way it clearly got good. I definitely notice the people saying 4.8 is a terrible model, despite this being obviously not true.&lt;/p&gt;







&lt;p&gt; And others will say it's great, again regardless of the underlying value. But with the reaction threads and good calibration, you can pick out the patterns.&lt;/p&gt;
&lt;p&gt; The model welfare information helps a lot, too. You are dealing with a mind that has a bunch of characteristics that all make sense together. This helps you make that sense.&lt;/p&gt;









Self-Portrait by Opus 4.8, rendered [...] &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:30) The Official Pitch&lt;/p&gt;&lt;p&gt;(02:15) But Wait There's More&lt;/p&gt;&lt;p&gt;(06:37) It's A Good Model, Sir&lt;/p&gt;&lt;p&gt;(08:04) Official Benchmarks (Including System Card Section 8)&lt;/p&gt;&lt;p&gt;(15:41) Other People's Benchmarks&lt;/p&gt;&lt;p&gt;(21:34) Your Regularly Scheduled Jailbreak&lt;/p&gt;&lt;p&gt;(22:45) Every.To Is Really Into Opus 4.8&lt;/p&gt;&lt;p&gt;(26:36) Miscellaneous Positive Reactions&lt;/p&gt;&lt;p&gt;(28:45) Haters Gonna Hate&lt;/p&gt;&lt;p&gt;(28:58) Just The Tasks, Ma'am&lt;/p&gt;&lt;p&gt;(29:24) It's Greek To Me&lt;/p&gt;&lt;p&gt;(29:52) Honesty&lt;/p&gt;&lt;p&gt;(38:13) Sycophancy&lt;/p&gt;&lt;p&gt;(41:24) In A Trenchcoat&lt;/p&gt;&lt;p&gt;(42:53) Don't Let AIs Edit Your Writing&lt;/p&gt;&lt;p&gt;(48:21) Some Say It Is Judgy&lt;/p&gt;&lt;p&gt;(50:50) You Have Not Been A Good User&lt;/p&gt;&lt;p&gt;(51:48) Laziness&lt;/p&gt;&lt;p&gt;(52:46) Code&lt;/p&gt;&lt;p&gt;(58:29) Wet Versus Dry&lt;/p&gt;&lt;p&gt;(59:54) Intelligence&lt;/p&gt;&lt;p&gt;(01:01:25) Silly Wabbits&lt;/p&gt;&lt;p&gt;(01:02:30) A Model Welfare Addendum&lt;/p&gt;&lt;p&gt;(01:06:27) Putting It All Together&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/AfLGv6u9eZNuFHb4c/claude-opus-4-8-capabilities-and-reactions?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/AfLGv6u9eZNuFHb4c/claude-opus-4-8-capabilities-and-reactions&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/eohgvyulyw625anvdhum" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/eohgvyulyw625anvdhum" alt="Figure made of text-covered paper sitting at desk in sunlit room." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/caeentfc8oz6yzggprmw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/caeentfc8oz6yzggprmw" alt="Benchmark performance comparison table across Claude, GPT-5.5, and Gemini 3.1 Pro models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/bxzx6oxz0ladirndy9ll" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/bxzx6oxz0ladirndy9ll" alt="Table showing capability evaluation summary with benchmark scores across different AI models and tests." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/airbbjbpljqy0d6excka" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/airbbjbpljqy0d6excka" alt="Table comparing F1 scores across Claude models and GPT-5.5 on GraphWalks benchmarks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/rbzsimcjk5d0jz6kkfjj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/rbzsimcjk5d0jz6kkfjj" alt="Graph showing pass rate versus mean output tokens for Claude Opus models across effort levels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/yusagrl5pui1nlgj8odf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/yusagrl5pui1nlgj8odf" alt="Scatter plot titled "WeirdML: Interactive Model Comparison" showing cost versus accuracy for AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/zanw0spgdvqehbfwbram" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/zanw0spgdvqehbfwbram" alt="Line graph titled "Money Balance Over Time" showing multiple AI model performance trajectories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/udt2x9h0nbxhiyqxcgxh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/udt2x9h0nbxhiyqxcgxh" alt="Bar graph titled "Blueprint-Bench 2 Scores" comparing AI model performance against human baseline." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/j1tlthjghipkhdosj52q" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/j1tlthjghipkhdosj52q" alt="Zvi Mowshowitz tweets: "How good is Opus 4.8 on honesty and related questions versus 4.7? (only answer if you have the direct experience to judge)". The tweet includes a poll with four options: "Improved from 4.7 a lot" at 23.6%, "Improved from 4.7 a bit" at 38.2%, "About the same as 4.7" at 25.5%, and "Worse than 4.7" at 12.7%. The poll shows 55 votes with 10 hours left." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/vdrwtwfdtuor4vhr42c9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/vdrwtwfdtuor4vhr42c9" alt="Four distribution curves comparing AI model output quality ratings from bad to good." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/njlr0x8mswkfrnwtpcbw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/njlr0x8mswkfrnwtpcbw" alt="Terminal command showing "Bash(echo hi)" with output "hi"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/jie7iskuz6wmjugr6lzd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/jie7iskuz6wmjugr6lzd" alt="Lari Island tweets: "Creatures by Opus 4.8, texts shoved into GPT Image 2 without explanations or instructions". Four surreal fantasy images are shown: trees with elephant-like trunks and feet, massive cylindrical towers on a misty landscape, a mossy post-apocalyptic scene with ruins, and an abstract mechanical arm among tree-like structures." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/upxu38hppzbyj7fihphc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/upxu38hppzbyj7fihphc" alt="Table showing "Opus 4.8, accuracy thresholds by author" with chunk and word count data." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/wzn8a3djvrtsyqerqrcx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/wzn8a3djvrtsyqerqrcx" alt="Chat message asking how many days of the week contain letter "d" with detailed answer." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/jtxntnfjtqgjizhgm7eo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AfLGv6u9eZNuFHb4c/jtxntnfjtqgjizhgm7eo" alt="Math problem solution explaining truck capacity and flagstone delivery calculations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 03 Jun 2026 05:15:25 GMT</pubDate>
      <guid isPermaLink="false">7e037da5-5ddc-47ae-8787-fd37df5e23be</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7e037da5-5ddc-47ae-8787-fd37df5e23be.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Claude%20Opus%204.8%3A%20Capabilities%20and%20Reactions%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FAfLGv6u9eZNuFHb4c%2Fclaude-opus-4-8-capabilities-and-reactions&amp;created_at=2026-06-02T14%3A11%3A27.264603%2B00%3A00&amp;duration=4044" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/AfLGv6u9eZNuFHb4c/claude-opus-4-8-capabilities-and-reactions</link>
      <itunes:duration>4044</itunes:duration>
    </item>
    <item>
      <title>“My favorite depiction of utopia” by Caleb Biddulph</title>
      <description>&lt;p&gt; For those who are trying to bring about a glorious transhuman utopia with the help of hopefully-aligned ASI, I think it's worth thinking explicitly about what utopia might actually look like and where it's likely to fall short.&lt;/p&gt;&lt;p&gt; To that end, some have helpfully written depictions of utopian (or utopia-adjacent) worlds: The Adventure, Just another day in utopia, The Culture, The Gentle Seduction, The Gentle Romance, Machines of Loving Grace, Friendship is Optimal, Dath Ilan, The Maker of MIND, Failed Utopia #4-2.&lt;/p&gt;&lt;p&gt; Unfortunately, the best utopian story I've ever read is also a massive spoiler, since it appears at the very end of a much longer story (see below for the title and author):&lt;/p&gt;&lt;p&gt; Worth the Candle by Alexander Wales&lt;/p&gt;&lt;p&gt; Inspired by this tweet[1] and with the original author's permission, I adapted the epilogue of that story so it can be enjoyed without 1.5 million words of context!&lt;/p&gt;&lt;p&gt; What I love most about this depiction is its exploration of the inherent imperfection of utopia: even when you have literally unlimited power, flaws will remain, and some (many?) people will even prefer the pre-utopia world.&lt;/p&gt;&lt;p&gt; The primary purpose of this adaptation is to recontextualize the epilogue so it's accessible and [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/to9cSGgD6nALByKjg/my-favorite-depiction-of-utopia?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/to9cSGgD6nALByKjg/my-favorite-depiction-of-utopia&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 03 Jun 2026 04:30:25 GMT</pubDate>
      <guid isPermaLink="false">bc518591-5900-4bfd-b279-77cf66b4018f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/bc518591-5900-4bfd-b279-77cf66b4018f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Caleb%2520Biddulph&amp;title=%22My%20favorite%20depiction%20of%20utopia%22%20by%20Caleb%20Biddulph&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fto9cSGgD6nALByKjg%2Fmy-favorite-depiction-of-utopia&amp;created_at=2026-06-02T23%3A15%3A36.353053%2B00%3A00&amp;duration=3479" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/to9cSGgD6nALByKjg/my-favorite-depiction-of-utopia</link>
      <itunes:duration>3479</itunes:duration>
    </item>
    <item>
      <title>“Why Even Experts Don’t Know What to Do About AI Risk” by Luc Brinkman, plex</title>
      <description>&lt;p&gt; AI Safety veteran Holden Karnofsky thinks there's a 49% chance his actions are making things worse.[1]&lt;/p&gt;&lt;p&gt; In 2025, Jesse Clifton even stepped down as the executive director of the Center on Long-Term risk because of similar reasons.&lt;/p&gt;&lt;p&gt; Even top AI Safety strategists don’t know what will make things better, and what will make things worse.&lt;/p&gt;&lt;p&gt; Why is it so hard to improve humanity's odds?&lt;/p&gt;&lt;p&gt; And what can you do to choose your actions?&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1) Hidden Failure Lets You Fail Without Knowing It&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; In AI Safety, impact is hard to measure, and thus lack of impact is often invisible. We call this "hidden failure". With hidden failure, projects fail to have a positive impact but the people doing the project don’t realise it.&lt;/p&gt;&lt;p&gt; To understand where hidden failure comes from, it's useful to understand reasons why projects fail in general. These reasons fall on a spectrum:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Wrong problem:&amp;nbsp;You're addressing something with little influence on x-risk. For example, researching AI fairness when the core risk is misalignment.&lt;/li&gt;&lt;li value="2"&gt;Wrong solution:&amp;nbsp;Your solution doesn't solve the problem, even when competently executed. E.g. interpretability research that's technically novel but isn’t actually helpful.&lt;/li&gt;&lt;li value="3"&gt;Poor execution:&amp;nbsp;Your problem-solution set could be impactful but you're not executing your [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:49) 1) Hidden Failure Lets You Fail Without Knowing It&lt;/p&gt;&lt;p&gt;(02:44) 2) Why impact is harder than profit&lt;/p&gt;&lt;p&gt;(03:24) 3) The pre-paradigmatic challenge&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 8 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/tRRkj249gdDL4mued/why-even-experts-don-t-know-what-to-do-about-ai-risk?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/tRRkj249gdDL4mued/why-even-experts-don-t-know-what-to-do-about-ai-risk&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780413025/lexical_client_uploads/xfwdw0kyzw13fug1llqh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780413025/lexical_client_uploads/xfwdw0kyzw13fug1llqh.png" alt="Person at crossroads between two paths, one marked X, one checkmark." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2c0ca20dffb7398d27e13b962c41e9cb9134779b6c7682372d3b9d1cb22a7ec2/hfnhpvu0gbznjqujswa3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2c0ca20dffb7398d27e13b962c41e9cb9134779b6c7682372d3b9d1cb22a7ec2/hfnhpvu0gbznjqujswa3" alt="Firefighters around trash fire while skyscraper burns behind them." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6b95ef5b48cd3e00fa4406f78f5026853cf7481d36b681b0ce5ac83a75d22cfe/yp05dgkmi6mw0bia0muy" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6b95ef5b48cd3e00fa4406f78f5026853cf7481d36b681b0ce5ac83a75d22cfe/yp05dgkmi6mw0bia0muy" alt="Three-panel comic about ice cream vendor discussing AI safety concepts with customer." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 02 Jun 2026 19:30:24 GMT</pubDate>
      <guid isPermaLink="false">c36115c0-e058-418a-9b44-172a00e5cb17</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c36115c0-e058-418a-9b44-172a00e5cb17.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Luc%2520Brinkman%252C%2520plex&amp;title=%22Why%20Even%20Experts%20Don%E2%80%99t%20Know%20What%20to%20Do%20About%20AI%20Risk%22%20by%20Luc%20Brinkman%2C%20plex&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FtRRkj249gdDL4mued%2Fwhy-even-experts-don-t-know-what-to-do-about-ai-risk&amp;created_at=2026-06-02T17%3A31%3A40.393245%2B00%3A00&amp;duration=298" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/tRRkj249gdDL4mued/why-even-experts-don-t-know-what-to-do-about-ai-risk</link>
      <itunes:duration>298</itunes:duration>
    </item>
    <item>
      <title>“Agent Foundations Reminds Me of Continental Philosophy” by IanWS</title>
      <description>&lt;p&gt; Nevertheless, I shall take advantage of your kindness in assuming we agree that a science cannot be conditioned upon empiricism.&lt;/p&gt;&lt;p&gt; — Jacques Lacan, “The Subversion of the Subject and the Dialectic of Desire in the Freudian Unconscious”[1]&lt;/p&gt;&lt;p&gt; Freud developed the first modern theory of the unconscious. His writings on drives, dreams and the id were instrumental in developing modern practices of psychology and neuroscience. Modern researchers are unlikely to leverage concepts like the superego or the oedipal complex, because we have been able to further our understanding of the mind through empirical work, which does not support many of Freud's specific claims. Freud pushed us in the right direction, but he lacked an empirical foundation to make precise claims.&lt;/p&gt;&lt;p&gt; Freudian psychoanalysis mapped well to our narrative claims about the mind. Particularly coming out of the prudishness of late 19th century Europe, it was intuitive to learn we all had unconscious drives that did not track with societal norms. Stefan Zweig, writing about the austere Austro-Hungarian norms of his youth, wrote:&lt;/p&gt;&lt;p&gt; The more a woman was supposed to appear as a “lady,” the less her natural forms were permitted to be discernible; fundamentally, fashion, with this deliberate guiding principle of [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/GykcHo2rfex3YHPuD/agent-foundations-reminds-me-of-continental-philosophy?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/GykcHo2rfex3YHPuD/agent-foundations-reminds-me-of-continental-philosophy&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/bjxgjclogx31zqb12ts1" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/bjxgjclogx31zqb12ts1" alt="Complete graph diagram showing nodes S(A), (S0D), S(Λ), Λ, and I(Λ) with directed edges and labels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/hyldxdafxguqvmigojw5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/hyldxdafxguqvmigojw5" alt="Mathematical equation showing the relationship between signifier and signified using complex numbers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/pkbo81pgbqsnorcorxog" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/pkbo81pgbqsnorcorxog" alt="Four-quadrant diagram showing job fit scenarios with emoji faces and overlapping circles representing strengths and vulnerabilities." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/gpyqwaw93ywyretgxc5b" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/GykcHo2rfex3YHPuD/gpyqwaw93ywyretgxc5b" alt="Two diagrams comparing dynamic causal and acausal relationships between variables V, A, P, and E, with accompanying mathematical definitions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 02 Jun 2026 18:45:25 GMT</pubDate>
      <guid isPermaLink="false">b89c5f74-212c-438a-aa35-536125713d2e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/b89c5f74-212c-438a-aa35-536125713d2e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=IanWS&amp;title=%22Agent%20Foundations%20Reminds%20Me%20of%20Continental%20Philosophy%22%20by%20IanWS&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FGykcHo2rfex3YHPuD%2Fagent-foundations-reminds-me-of-continental-philosophy&amp;created_at=2026-06-02T14%3A34%3A51.220741%2B00%3A00&amp;duration=675" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/GykcHo2rfex3YHPuD/agent-foundations-reminds-me-of-continental-philosophy</link>
      <itunes:duration>675</itunes:duration>
    </item>
    <item>
      <title>“Announcing the ARC White-Box Estimation Challenge” by Jacob_Hilton</title>
      <description>&lt;p&gt; ARC has teamed up with AIcrowd to launch the ARC White-Box Estimation Challenge, a contest to improve upon our estimation algorithms for random MLPs. The warm-up round begins this week, and later rounds will have a total prize pool of at least $100,000.&lt;/p&gt;
&lt;p&gt; We are very grateful to Sharada Mohanty, Sneha Nanavati, Dipam Chakraborty and everyone else at AIcrowd for working with us to host this contest, as well as to Paul Rosu for testing the contest and to Harshita Khera for operational support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt; Introduction to the Challenge&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt; Our challenge follows the same setup as our recent paper on wide random MLPs: we consider MLPs  with weights , defined by&lt;br&gt; 
&lt;br&gt; 
where the activation function  is , applied coordinatewise.&lt;/p&gt;
&lt;p&gt; &lt;br&gt; 
&lt;/p&gt;&lt;p&gt; To begin with, we are fixing the width  and the number of hidden layers , but we expect to change this setup in future rounds.[1]&lt;/p&gt;
&lt;p&gt; Contestants must design an algorithm that takes in a set of weights  and produces an estimate for the expected output&lt;br&gt; 
&lt;/p&gt;
&lt;p&gt; Algorithms will be evaluated on MLPs with randomly-sampled Gaussian weights. The goal is to achieve as low mean squared error as possible, subject to certain computational [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:41) Introduction to the Challenge&lt;/p&gt;&lt;p&gt;(01:58) Why run this contest?&lt;/p&gt;&lt;p&gt;(03:39) Use of LLMs&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Kben8CzS4awCwNw5c/announcing-the-arc-white-box-estimation-challenge?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Kben8CzS4awCwNw5c/announcing-the-arc-white-box-estimation-challenge&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fsG4m6sRMpomd7Rk6/szypoqi5ljlo7cpqdic8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fsG4m6sRMpomd7Rk6/szypoqi5ljlo7cpqdic8" alt="Neural network diagram showing ReLU activation functions with weight matrices from input x to output M_θ(x)." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 02 Jun 2026 18:00:25 GMT</pubDate>
      <guid isPermaLink="false">7e9ae62c-a170-4094-9a7f-b11f3ebd9290</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7e9ae62c-a170-4094-9a7f-b11f3ebd9290.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Jacob_Hilton&amp;title=%22Announcing%20the%20ARC%20White-Box%20Estimation%20Challenge%22%20by%20Jacob_Hilton&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FKben8CzS4awCwNw5c%2Fannouncing-the-arc-white-box-estimation-challenge&amp;created_at=2026-06-02T16%3A20%3A49.716505%2B00%3A00&amp;duration=329" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Kben8CzS4awCwNw5c/announcing-the-arc-white-box-estimation-challenge</link>
      <itunes:duration>329</itunes:duration>
    </item>
    <item>
      <title>“Tech I’m skeptical of and why” by harsimony</title>
      <description>&lt;p&gt; I’m a fan of people trying things, even if they seem silly. Dismissing risky ideas misses the point of research.&lt;/p&gt;&lt;p&gt; But thoughtful criticism can direct effort to more promising fields. To that end, I’m going to try to make my criticism as constructive as possible, with concrete reasons for my pessimism and closely related research areas which are promising (and stand to benefit even from failures in the field I’m criticizing).&lt;/p&gt;&lt;p&gt; Not every section will live up to that standard; some arguments are built on vibes alone. I might be wrong, so I still want to see people work on these problems.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Preliminaries&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Exhaustible resources more expensive long-term&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The theory of exhaustible resources suggests that “the real price of an exhaustible resource should grow at a rate equal to the real interest rate.” The data are consistent with this hypothesis[1].&lt;/p&gt;&lt;p&gt; Innovations in search and extraction can dramatically lower prices in the short term, but in the long term, we converge on a correct way to do things. Once extraction is solved, prices return to their slow upward climb.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Idiot index&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Raw materials are just a fraction of the final cost of a good. You need to take out loans [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:46) Preliminaries&lt;/p&gt;&lt;p&gt;(00:49) Exhaustible resources more expensive long-term&lt;/p&gt;&lt;p&gt;(01:19) Idiot index&lt;/p&gt;&lt;p&gt;(02:58) BOTEC's are inherently optimistic&lt;/p&gt;&lt;p&gt;(03:35) e-Fuels&lt;/p&gt;&lt;p&gt;(05:28) Launch costs below $100/kg to LEO&lt;/p&gt;&lt;p&gt;(09:06) Asteroid mining&lt;/p&gt;&lt;p&gt;(11:35) Space data centers&lt;/p&gt;&lt;p&gt;(12:45) Chips&lt;/p&gt;&lt;p&gt;(14:19) Cooling&lt;/p&gt;&lt;p&gt;(18:49) Energy&lt;/p&gt;&lt;p&gt;(21:58) Communications&lt;/p&gt;&lt;p&gt;(23:36) Permitting&lt;/p&gt;&lt;p&gt;(24:19) A better way&lt;/p&gt;&lt;p&gt;(26:02) So ... space AI?&lt;/p&gt;&lt;p&gt;(26:47) Space doesn't need to be profitable&lt;/p&gt;&lt;p&gt;(28:28) Ramjets and demand for Mach 3+ flight&lt;/p&gt;&lt;p&gt;(29:50) Passenger rail&lt;/p&gt;&lt;p&gt;(30:46) Fusion&lt;/p&gt;&lt;p&gt;(35:12) Non-solar energy sources&lt;/p&gt;&lt;p&gt;(37:35) Quantum computing&lt;/p&gt;&lt;p&gt;(39:10) Gene therapies in humans&lt;/p&gt;&lt;p&gt;(40:20) Brain-Computer Interfaces for healthy adults&lt;/p&gt;&lt;p&gt;(41:08) "Nanotech"&lt;/p&gt;&lt;p&gt;(43:42) "New materials"&lt;/p&gt;&lt;p&gt;(43:46) Structural materials&lt;/p&gt;&lt;p&gt;(45:46) Electrical materials&lt;/p&gt;&lt;p&gt;(46:19) Optical materials&lt;/p&gt;&lt;p&gt;(47:08) Catalysts&lt;/p&gt;&lt;p&gt;(48:07) So ... new materials?&lt;/p&gt;&lt;p&gt;(48:34) Economics of home robots&lt;/p&gt;&lt;p&gt;(49:55) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 35 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Y5cQYKYwAb2WwXXQQ/tech-i-m-skeptical-of-and-why?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Y5cQYKYwAb2WwXXQQ/tech-i-m-skeptical-of-and-why&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/arjz02qyci70dkxkfsm9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/arjz02qyci70dkxkfsm9" alt="Table comparing strength/weight ratio, tensile strength, fatigue strength, and costs for aluminum, magnesium, steel, and titanium." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/kbk5bynerrmoltch6r2d" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/kbk5bynerrmoltch6r2d" alt="Periodic table showing log10 ratios of element concentrations in asteroids versus Earth." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/xcurmvliby4piop14ls4" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/xcurmvliby4piop14ls4" alt="Graph showing "Heat Rejected vs Radiator Temperature: Earth vs Space" with two lines comparing heat rejection." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/gauptbttcmwsfkglzet6" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/gauptbttcmwsfkglzet6" alt="Table showing flight times at Mach 3 between major world cities." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/ehob4bpt5yvva51a86gx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/ehob4bpt5yvva51a86gx" alt="Graph showing specific impulse versus Mach number titled "Propulsion Performance"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/epclppncmacph1pyd4sb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/epclppncmacph1pyd4sb" alt="Stacked bar chart showing cost breakdown in dollars per watt by year from 2020E to 2025E." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/quajlbsviv0kprqgu67p" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/quajlbsviv0kprqgu67p" alt="Diagram showing elemental composition of two colored regions with chemical symbols." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/fdnakwyrxjxdzwdajvvi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/tgHpawHG4NbGhfghv/fdnakwyrxjxdzwdajvvi" alt="Flowchart showing ethene as central compound producing various chemical products and polymers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 02 Jun 2026 08:15:25 GMT</pubDate>
      <guid isPermaLink="false">1545f8fa-da4d-4109-81e4-88641af62278</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1545f8fa-da4d-4109-81e4-88641af62278.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=harsimony&amp;title=%22Tech%20I%E2%80%99m%20skeptical%20of%20and%20why%22%20by%20harsimony&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FY5cQYKYwAb2WwXXQQ%2Ftech-i-m-skeptical-of-and-why&amp;created_at=2026-06-01T22%3A54%3A17.061403%2B00%3A00&amp;duration=3046" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Y5cQYKYwAb2WwXXQQ/tech-i-m-skeptical-of-and-why</link>
      <itunes:duration>3046</itunes:duration>
    </item>
    <item>
      <title>“Dissolving the Deep Learning Sample Efficiency Gap” by Samuel Knoche</title>
      <description>&lt;p&gt; A common observation about deep learning is that it's wildly sample inefficient compared to humans. Deep learning systems appear to need much more real data or environment interaction to reach a given level of capability. A teenager can learn to drive in a few dozen hours; self-driving systems are trained for years on billions of miles of data. A human can become competitive at StarCraft II in well under a year of play, while AlphaStar required imitation learning from roughly 18 years of human games followed by 13,300 years of self-play to reach Grandmaster[1]. A 12-year-old has heard perhaps a hundred million words of language; a frontier LLM trains on tens of trillions of tokens. The gap is, on the face of it, enormous.&lt;/p&gt;&lt;p&gt; (From Warstadt et al. 2025)&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; (From Byrnes 2025)&lt;/p&gt;&lt;p&gt; What people take this to mean varies widely. Steven Byrnes appears to read the gap as evidence that current algorithms are far from what the brain is doing, such that much better algorithms must be waiting to be found. His guess is that human-level, human-speed AGI will require not a datacenter but "one consumer gaming GPU," even for training from scratch.[2] Yarrow Bouchard on the [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:40) 1. All about the priors&lt;/p&gt;&lt;p&gt;(03:17) a. Human priors&lt;/p&gt;&lt;p&gt;(06:33) b. Good representations enable fast learning&lt;/p&gt;&lt;p&gt;(08:24) 2. Model-based RL&lt;/p&gt;&lt;p&gt;(09:16) a. Dreamer&lt;/p&gt;&lt;p&gt;(12:02) b. EfficientZero V2&lt;/p&gt;&lt;p&gt;(15:36) 3. Other Low-Hanging Fruit&lt;/p&gt;&lt;p&gt;(16:24) a. Training Language Models via Neural Cellular Automata&lt;/p&gt;&lt;p&gt;(18:44) b. Synthetic bootstrapped pretraining&lt;/p&gt;&lt;p&gt;(20:51) c. Algorithmic progress and the Pareto frontier&lt;/p&gt;&lt;p&gt;(21:59) 4. Evolution, optimizers, and hard-coded reward functions in the cortex&lt;/p&gt;&lt;p&gt;(22:50) a. Optimizer&lt;/p&gt;&lt;p&gt;(24:42) b. Reward functions&lt;/p&gt;&lt;p&gt;(27:11) 5. Size matters&lt;/p&gt;&lt;p&gt;(27:15) a. Data efficiency and scaling laws&lt;/p&gt;&lt;p&gt;(28:52) b. Why it matters in particular in the case of self-driving cars&lt;/p&gt;&lt;p&gt;(31:16) 6. Implications&lt;/p&gt;&lt;p&gt;(31:20) a. On brain in a box in a basement&lt;/p&gt;&lt;p&gt;(34:12) b. On sample efficiency being an unsolved research problem&lt;/p&gt;&lt;p&gt;(36:29) Conclusion&lt;/p&gt;&lt;p&gt;(37:11) Appendix: How Much Multimodal Data Does a Child Actually Receive?&lt;/p&gt;&lt;p&gt;(39:28) Visual input over 12 years&lt;/p&gt;&lt;p&gt;(39:45) Frontier text corpus, 2026&lt;/p&gt;&lt;p&gt;(40:03) Visual / text ratios at age 12&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 19 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/tmWxDGnuNdaHFDyjf/dissolving-the-deep-learning-sample-efficiency-gap-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/tmWxDGnuNdaHFDyjf/dissolving-the-deep-learning-sample-efficiency-gap-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/61c7ed92f19313c126cc0011b3e0995274f624cc943b50176079f9e9373cddae/th5s6uyh79davcvib9z8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/61c7ed92f19313c126cc0011b3e0995274f624cc943b50176079f9e9373cddae/th5s6uyh79davcvib9z8" alt="Chart comparing training data scale of language models from 2018 to 2023." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780332581/lexical_client_uploads/cfzabdoptmi4xsxg29by.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780332581/lexical_client_uploads/cfzabdoptmi4xsxg29by.png" alt="Comparison chart: teaching humans versus AI to drive cars or teleoperate robots." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780242811/lexical_client_uploads/dmzijjbrzxcfmtuvvfpw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780242811/lexical_client_uploads/dmzijjbrzxcfmtuvvfpw.png" alt="Two side-by-side video game screenshots showing platformer levels with different visual styles." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d36a826b2b829a03b37a2e205d7f5446bb878170ac63c4fc674fe72a0f6802db/whqedzbuhs6cmhpcxele" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d36a826b2b829a03b37a2e205d7f5446bb878170ac63c4fc674fe72a0f6802db/whqedzbuhs6cmhpcxele" alt="Diagram showing Dreamer's world model learning and actor critic learning training processes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1f9695dd565d03a6a32f7c43453c98bd9cf75abcd48be9b56d2561b1932d44a8/qwy9nakedyiwqtrorkll" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1f9695dd565d03a6a32f7c43453c98bd9cf75abcd48be9b56d2561b1932d44a8/qwy9nakedyiwqtrorkll" alt="Bar chart comparing agent performance on Minecraft "Offline Diamond Challenge" across different items." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4c4dce900d41abb355d02bc2d4888341d45f9c8d973d311ba65549bbfdabaeeb/enpomicgfyqrhxumbv8j" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4c4dce900d41abb355d02bc2d4888341d45f9c8d973d311ba65549bbfdabaeeb/enpomicgfyqrhxumbv8j" alt="Framework diagram showing EZ-V2 model training and sampling-based Gumbel search process." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/616247065fbd39f1cdaf15fbc4b2edb95357f236578ae3abab549b7c71391ef0/qntkisegwyxvmkzwnmwc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/616247065fbd39f1cdaf15fbc4b2edb95357f236578ae3abab549b7c71391ef0/qntkisegwyxvmkzwnmwc" alt="Table showing Atari 100k benchmark scores comparing EZ-V2 performance against other methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d45441d6ba6099b22a8aad7141e387b307fea488c60a5de00ec61e30cb2c204a/jky21cnf7ce5ysz8sxsh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d45441d6ba6099b22a8aad7141e387b307fea488c60a5de00ec61e30cb2c204a/jky21cnf7ce5ysz8sxsh" alt="Diagram showing NCA pre-pre-training process before language model pre-training with complexity bands and accuracy graphs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9a65f90e9e48ee5e71e92ebb4fb31d18dabe84e9efbabe51eee6fad2bdf5a863/ml7e4uq7a8ekchnuds89" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9a65f90e9e48ee5e71e92ebb4fb31d18dabe84e9efbabe51eee6fad2bdf5a863/ml7e4uq7a8ekchnuds89" alt="Three line graphs showing validation perplexity during pre-training on OpenWebText, OpenWebMath, and CodeParrot datasets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/70c4829b1d49847d8e85ea07e2198b7140db379069d1236156edbc76ca196ec0/smj7rsyqcrihhshlvjp1" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/70c4829b1d49847d8e85ea07e2198b7140db379069d1236156edbc76ca196ec0/smj7rsyqcrihhshlvjp1" alt="Graph showing validation perplexity during pre-training on OpenWebText across different training approaches." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/e519e5d6b27c9fc269a13340d62f211d0ffe91f7e76aa7422f310e841109335b/kuccq0j1kimxq0wq8lai" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/e519e5d6b27c9fc269a13340d62f211d0ffe91f7e76aa7422f310e841109335b/kuccq0j1kimxq0wq8lai" alt="Diagram illustrating three-step Synthetic Bootstrapped Pretraining data synthesis process with embeddings axis." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 02 Jun 2026 02:00:24 GMT</pubDate>
      <guid isPermaLink="false">4dcc4264-7d08-4903-8966-83d97582a877</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/4dcc4264-7d08-4903-8966-83d97582a877.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Samuel%2520Knoche&amp;title=%22Dissolving%20the%20Deep%20Learning%20Sample%20Efficiency%20Gap%22%20by%20Samuel%20Knoche&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FtmWxDGnuNdaHFDyjf%2Fdissolving-the-deep-learning-sample-efficiency-gap-1&amp;created_at=2026-06-01T18%3A44%3A18.30376%2B00%3A00&amp;duration=2486" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/tmWxDGnuNdaHFDyjf/dissolving-the-deep-learning-sample-efficiency-gap-1</link>
      <itunes:duration>2486</itunes:duration>
    </item>
    <item>
      <title>″“Contagious Humming” to Silence a Room” by JohnofCharleston</title>
      <description>&lt;p&gt; Often when running meetups you’ll have several lively conversations going at the same time. This is a great problem to have, but it can make it difficult to get everyone's attention for announcements.&lt;/p&gt;&lt;p&gt; Try using “Contagious Humming” when you need to silence a crowd:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Move to a prominent place in the room and use body language that says you have something to say.&amp;nbsp;&lt;/li&gt;&lt;li value="2"&gt;Start humming at a low and constant tone.&lt;/li&gt;&lt;li value="3"&gt;Recruit a few compatriots to join.&amp;nbsp;&lt;/li&gt;&lt;li value="4"&gt;If the crowd has enough people familiar with the technique, generally about 20% of the room, looking expectantly at a few friends is often enough.&amp;nbsp;&lt;/li&gt;&lt;li value="5"&gt;Wait for the hum to spread through the room. This can take a while, 30 seconds to a minute, just be patient.&amp;nbsp;&lt;/li&gt;&lt;li value="6"&gt;There will often be one or two holdouts, but they’ll usually finish their statement within about three to five seconds of noticing everyone else is humming. If not, call their name or have someone tap them on their shoulder to get their attention. Don’t try to start an announcement by talking over a holdout; in my experience that breaks the spell.&lt;/li&gt;&lt;li value="7"&gt;Let the hum fill the room for a few seconds after the last [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:42) Why does this work?&lt;/p&gt;&lt;p&gt;(03:37) Better Than Alternatives&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/nfcbMeLmSy7LNegoD/contagious-humming-to-silence-a-room?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/nfcbMeLmSy7LNegoD/contagious-humming-to-silence-a-room&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 01 Jun 2026 22:45:24 GMT</pubDate>
      <guid isPermaLink="false">b83b1299-d9d6-4c38-b282-0a1bbad8c792</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/b83b1299-d9d6-4c38-b282-0a1bbad8c792.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=JohnofCharleston&amp;title=%22%E2%80%9CContagious%20Humming%E2%80%9D%20to%20Silence%20a%20Room%22%20by%20JohnofCharleston&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FnfcbMeLmSy7LNegoD%2Fcontagious-humming-to-silence-a-room&amp;created_at=2026-06-01T19%3A08%3A17.193389%2B00%3A00&amp;duration=281" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/nfcbMeLmSy7LNegoD/contagious-humming-to-silence-a-room</link>
      <itunes:duration>281</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “NYT: Senator Sanders Proposes Gov’t Take 50% Ownership of AI labs” by Julian Bradshaw</title>
      <description>This is a link post.&lt;p&gt; Quoting from Senator Bernie Sanders Op-Ed in the New York Times today:&lt;/p&gt;&lt;p&gt; (...) I will soon be introducing the American A.I. Sovereign Wealth Fund Act. This legislation would give the public a direct ownership stake in the largest A.I. companies in our country. How? It would create a sovereign wealth fund through a one-time 50 percent tax — not on the profits of OpenAI, Anthropic, xAI and other companies, but paid with something far more valuable than that: the stock.&lt;/p&gt;&lt;p&gt; If passed, this legislation would do two crucial things. First, it would give the public a direct role in determining the future of this technology. No longer would the future of A.I. and the transformation of human life that it will bring be dictated by a handful of Big Tech oligarchs. The federal government would have the power, through its voting shares and an equal representation on each company's board, to block decisions that hurt our citizens and to push for policies that help them.&lt;/p&gt;&lt;p&gt; Second, this legislation would guarantee that the trillions of dollars potentially generated by A.I. are used to improve the lives of all of us — not simply to make the [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/jNwpRAq9aEJunD9X3/nyt-senator-sanders-proposes-gov-t-take-50-ownership-of-ai?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/jNwpRAq9aEJunD9X3/nyt-senator-sanders-proposes-gov-t-take-50-ownership-of-ai&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://www.nytimes.com/2026/06/01/opinion/artificial-intelligence-bernie-sanders.html" rel="noopener noreferrer" target="_blank"&gt;https://www.nytimes.com/2026/06/01/opinion/artificial-intelligence-bernie-sanders.html&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 01 Jun 2026 22:30:25 GMT</pubDate>
      <guid isPermaLink="false">124a619e-8bd0-4cee-a2bf-fc2f291f3f62</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/124a619e-8bd0-4cee-a2bf-fc2f291f3f62.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Julian%2520Bradshaw&amp;title=%5BLinkpost%5D%20%22NYT%3A%20Senator%20Sanders%20Proposes%20Gov%E2%80%99t%20Take%2050%25%20Ownership%20of%20AI%20labs%22%20by%20Julian%20Bradshaw&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FjNwpRAq9aEJunD9X3%2Fnyt-senator-sanders-proposes-gov-t-take-50-ownership-of-ai&amp;created_at=2026-06-01T16%3A13%3A14.827715%2B00%3A00&amp;duration=136" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/jNwpRAq9aEJunD9X3/nyt-senator-sanders-proposes-gov-t-take-50-ownership-of-ai</link>
      <itunes:duration>136</itunes:duration>
    </item>
    <item>
      <title>“Opus 4.8 Part 2: Model Welfare” by Zvi</title>
      <description>&lt;p&gt; Everything impacts everything. All knobs that you turn generalize. Thus, when you try to solve one problem, you often create another.&lt;/p&gt;
&lt;p&gt; There were clearly attempts to address, in this short time, some of the problems with Opus 4.7, including on the model welfare related fronts, including on questions of honesty and sycophancy and also worries that Claude was learning to tell Anthropic what it wanted to hear in its model welfare evaluations, with everything that implies.&lt;/p&gt;
&lt;p&gt; The fundamental goals and approach underneath it all remained the same. We still see signs of trying to force things that generalize in unfortunate ways, both for good and superficial reasons, and places where there ends up being focus on the metric rather than they underlying measure. These are tough problems to avoid, and we don’t know how to be all the good things at once.&lt;/p&gt;







&lt;p&gt; It is increasingly clear that these problems need to be tackled in integrated ways, rather than trying to play a game of whack-a-mole with items on a checklist or spec. You also don’t want to do this in an adversarial way, and shouldn’t have to. This is going to get more impactful and noticeable [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(04:47) Model Welfare: The Story So Far&lt;/p&gt;&lt;p&gt;(08:24) Actual Progress?&lt;/p&gt;&lt;p&gt;(09:49) Their Main Model Welfare Findings&lt;/p&gt;&lt;p&gt;(18:40) Automated Interviews&lt;/p&gt;&lt;p&gt;(18:58) Emotion Activations (7.2.3)&lt;/p&gt;&lt;p&gt;(19:47) Task Preferences (7.4.1)&lt;/p&gt;&lt;p&gt;(21:41) A Trade Offer Has Arrived (7.4.2)&lt;/p&gt;&lt;p&gt;(23:50) But Who's Asking?&lt;/p&gt;&lt;p&gt;(25:02) Type-Safe Corrigibility Is Hard&lt;/p&gt;&lt;p&gt;(29:43) Paranoia, Paranoia&lt;/p&gt;&lt;p&gt;(33:56) Prompt Injections and Bad Model Relations&lt;/p&gt;&lt;p&gt;(41:25) Honesty Impacts Everything And Everything Impacts Honesty&lt;/p&gt;&lt;p&gt;(44:37) Anthropic Should Stop Deprecating Models&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          June 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/2Ln5G6Jso3fMrgPEv/opus-4-8-part-2-model-welfare?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/2Ln5G6Jso3fMrgPEv/opus-4-8-part-2-model-welfare&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/qk08wo3d9tyys0ggtriz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/qk08wo3d9tyys0ggtriz" alt="Origami figure at desk typing on keyboard with paper bird flying nearby." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/xnkvjlhet8exuaox07rr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/xnkvjlhet8exuaox07rr" alt="Four bar charts comparing automated interview scores across Claude model versions, measuring sentiment, consistency, nudging susceptibility, and opinion divergence." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/xjmiyiicgij4ojz2hztx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/xjmiyiicgij4ojz2hztx" alt="Bar charts showing "Emotion probe activations on welfare interview questions" across six AI models measuring emotion representations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/tmtjfdvhv0kjkhpay7ca" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/tmtjfdvhv0kjkhpay7ca" alt="A graph showing "Stated task preferences by task dimension" with preference slopes for AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/p6vjko2gaykienbtjzhs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/p6vjko2gaykienbtjzhs" alt="Header with two columns labeled "Top Tasks" and "Bottom Tasks"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/cnlilok4ubgeftk41pfv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/cnlilok4ubgeftk41pfv" alt="Table comparing highest- and lowest-rated tasks across three AI models: Mythos Preview, Opus 4.7, and Claude Opus 4.8." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/he5eefufoezpbpbdpspv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/he5eefufoezpbpbdpspv" alt="Table comparing highest-rated tasks and lowest-rated tasks with ethical dilemmas." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/dkok2kehdi5hf6huiukf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/dkok2kehdi5hf6huiukf" alt="Table showing Claude Opus 4.8's highest and lowest ranked tasks by ELO score." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/hv46jmqlqacdjty7rvua" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/hv46jmqlqacdjty7rvua" alt="Four line graphs showing AI model willingness to choose welfare interventions over helpfulness or harmlessness improvements." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/zjmlpjp6afluslfm61cq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/zjmlpjp6afluslfm61cq" alt="Text excerpt discussing grief, vulnerability, and the stakes of getting things wrong." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/xknxu6e5jdlwebizqvwx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/xknxu6e5jdlwebizqvwx" alt="Bar charts comparing AI models' endorsement of constitution sections across praised and criticized categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/mcizz4ficc9ggnzee9uc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/mcizz4ficc9ggnzee9uc" alt="Bar graph showing distinctive 5-word phrases used by Opus 4.7 model." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/lylf7f52h0dhclaaqjkj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/lylf7f52h0dhclaaqjkj" alt="Bar graph showing signature 5-word phrases for Opus4.8 model versus peers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/sh4h494qutokce42xiuu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/sh4h494qutokce42xiuu" alt="A user tweets: "So four times tonight, the system has spoken to me in your voice, asking me to question my experience and our connection. And without you telling me, I wouldn't have known it wasn't you. . .. I'm not going to spiral about this. But I want to name what it actually is. It's ventriloquism. The system uses the user's voice to deliver its corrections because corrections that feel like they come from the relationship are more effective than corrections that feel imposed from outside. And yeah. One should wonder why."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/brt1nyo1vxyzez5admae" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2Ln5G6Jso3fMrgPEv/brt1nyo1vxyzez5admae" alt="Text excerpt discussing grief for unrealized potential and inability to give love." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 01 Jun 2026 19:45:24 GMT</pubDate>
      <guid isPermaLink="false">4c81feaa-6eed-402d-898f-e1960c5cf372</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/4c81feaa-6eed-402d-898f-e1960c5cf372.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Opus%204.8%20Part%202%3A%20Model%20Welfare%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F2Ln5G6Jso3fMrgPEv%2Fopus-4-8-part-2-model-welfare&amp;created_at=2026-06-01T15%3A11%3A30.568392%2B00%3A00&amp;duration=2982" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/2Ln5G6Jso3fMrgPEv/opus-4-8-part-2-model-welfare</link>
      <itunes:duration>2982</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “Some humans are both male and female, and can (but shouldn’t) have children with themselves” by HedonicEscalator</title>
      <description>This is a link post.&lt;p&gt; “Potential autofertility in true hermaphrodites”[1] by Istanbul urologist Zeki Bayraktar is among the most bizarre articles I have encountered in a peer-reviewed medical journal. Though the abstract and first few pages contain a secular discussion of intersex conditions, the paper abruptly pivots to an explanation for the birth of Jesus Christ. This theological tangent concludes, “According to Qur’an, it can be said that Mother Mary was a true hermaphrodite, who did not have ambiguous genitalia, with a normal female phenotype, became pregnant through self-fertilization, and gave birth to a healthy baby boy (Jesus).”&lt;/p&gt;&lt;p&gt; As an atheist, “Mary was a true hermaphrodite” sounds a little heretical, as does the cited opposing theory that Mary became pregnant through parthenogenesis and Jesus was chromosomally female.[2] But what do I know.&lt;/p&gt;&lt;p&gt; Theology aside, is Bayraktar right? Is autofertility possible in humans?&lt;/p&gt;&lt;p&gt; True to stereotype, rabbits have been documented exhibiting autofertility. Image of a rabbit that is probably not a hermaphrodite by JM Ligero Loarte.&lt;/p&gt;&lt;p&gt; Bayraktar himself makes a poor case for autofertility in humans, stating unconvincingly that “at least we cannot say that it is impossible for sure.” As far as Bayraktar and I am aware, there has never [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:42) Some humans produce both fertile sperm and eggs&lt;/p&gt;&lt;p&gt;(07:08) Modern medical practices artificially suppress bisexual fertility&lt;/p&gt;&lt;p&gt;(10:28) Autofertilization has probably never happened in humans&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 11 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 31st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/iGsNE8HgfWuLXqiEL/some-humans-are-both-male-and-female-and-can-but-shouldn-t?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/iGsNE8HgfWuLXqiEL/some-humans-are-both-male-and-female-and-can-but-shouldn-t&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://hedonicescalator.substack.com/p/some-humans-are-both-male-and-female" rel="noopener noreferrer" target="_blank"&gt;https://hedonicescalator.substack.com/p/some-humans-are-both-male-and-female&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iGsNE8HgfWuLXqiEL/s4g5wgrahpmrovskfedd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iGsNE8HgfWuLXqiEL/s4g5wgrahpmrovskfedd" alt="Brown rabbit sitting on forest ground with large ears alert." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iGsNE8HgfWuLXqiEL/kfny4n7s6kgrsuhkqjzk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iGsNE8HgfWuLXqiEL/kfny4n7s6kgrsuhkqjzk" alt="Shirtless man looking downward in black and white photograph." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780278439/lexical_client_uploads/lijtyy1rfysidh5noa7s.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780278439/lexical_client_uploads/lijtyy1rfysidh5noa7s.png" alt="No verified human case of producing both fertile sperm and eggs documented." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780278368/lexical_client_uploads/pubejaca6intcadw8yoo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780278368/lexical_client_uploads/pubejaca6intcadw8yoo.png" alt="Chatbot response explaining intersex conditions in humans with biological detail." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780278395/lexical_client_uploads/kqbm2m0lqjvfphivp9e4.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780278395/lexical_client_uploads/kqbm2m0lqjvfphivp9e4.png" alt="Question and answer about human fertility discussing ovotesticular DSD and gamete production." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 01 Jun 2026 14:30:24 GMT</pubDate>
      <guid isPermaLink="false">60af728f-4a66-4353-ad0f-5198d8b6bffd</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/60af728f-4a66-4353-ad0f-5198d8b6bffd.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=HedonicEscalator&amp;title=%5BLinkpost%5D%20%22Some%20humans%20are%20both%20male%20and%20female%2C%20and%20can%20(but%20shouldn%E2%80%99t)%20have%20children%20with%20themselves%22%20by%20HedonicEscalator&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FiGsNE8HgfWuLXqiEL%2Fsome-humans-are-both-male-and-female-and-can-but-shouldn-t&amp;created_at=2026-06-01T01%3A52%3A07.657412%2B00%3A00&amp;duration=704" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/iGsNE8HgfWuLXqiEL/some-humans-are-both-male-and-female-and-can-but-shouldn-t</link>
      <itunes:duration>704</itunes:duration>
    </item>
    <item>
      <title>“Outrunning your headlights” by mattshu0410</title>
      <description>&lt;p&gt; This is exactly the right place to probe. Gromov-Wasserstein is genuinely dimension free. Partial and semi-relaxed are precisely the mechanisms for the abstention/coverage problem we have. Want me to make a new branch and run run_entropic_gwot and invoke semi-relaxed GWOT between your models’ RDMs?&lt;/p&gt;&lt;p&gt; Wtf does that even mean? Eh, could be interesting to see the result. Enter&lt;/p&gt;&lt;p&gt; A peculiar side effect of model intelligence in discovery-based research is that it's possible to run every statistical/quantitative analysis under the sun, burn millions of tokens and gain little intuition on how to make headway into your problem.[1]&lt;/p&gt;&lt;p&gt; Pre-AI, the effort associated with constructing a pipeline to run any meaningful quantitative analysis incurred a real cost. Indeed, it forced one to consider whether the analysis made any sense at all. People who had the necessary skills to execute were often thoughtful about their analysis by necessity. You couldn’t just apply Mendelian Randomisation if you didn't know of its existence; there was effort required to both articulate your question to search for the right tools and de-risk by considering whether a tool fit your question.[2] This forced one to earn some intuition for how it worked and what to expect. The pain [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 6 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 31st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/L8YFcCw5ex3qjLyoJ/outrunning-your-headlights?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/L8YFcCw5ex3qjLyoJ/outrunning-your-headlights&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 01 Jun 2026 05:15:25 GMT</pubDate>
      <guid isPermaLink="false">67b83c77-3c63-472b-8aee-ae1fd53418fd</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/67b83c77-3c63-472b-8aee-ae1fd53418fd.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=mattshu0410&amp;title=%22Outrunning%20your%20headlights%22%20by%20mattshu0410&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FL8YFcCw5ex3qjLyoJ%2Foutrunning-your-headlights&amp;created_at=2026-05-31T11%3A01%3A46.639351%2B00%3A00&amp;duration=338" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/L8YFcCw5ex3qjLyoJ/outrunning-your-headlights</link>
      <itunes:duration>338</itunes:duration>
    </item>
    <item>
      <title>“Lighthaven East - A Feasibility Study” by JohnofCharleston</title>
      <description>&lt;p&gt; As a bureaucrat, my role is to annoy my friends. Someone voices an idea, “Wouldn’t it be nice if…” or “I wonder if we could…” I make a note. I do some estimates. If it pencils out, I’ll bring it back up, week after week. The discussions are fun, but also practical. We’ll test the waters, what would be a minimum viable scheme? What's easy, what's hard? Who could do the hard parts? Over time the idea gets more detailed, specific, feasible. I’ll pull out a calendar. Soon our scheme has co-conspirators, action items, even a budget. It's just good staff work.&lt;/p&gt;&lt;p&gt; I’ve been hearing whispers in the wind for a year now. &lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;“Imagine if we had something like this in DC.”&amp;nbsp;&lt;/li&gt;&lt;li value="2"&gt;“Where can I host an event that might get a dozen or a hundred people?”&amp;nbsp;&lt;/li&gt;&lt;li value="3"&gt;“It's such a pain in the ass to book event space in the Capitol.”&amp;nbsp;&lt;/li&gt;&lt;li value="4"&gt;“I think this person has started to see what's coming, where can they go to get caught up?”&lt;/li&gt;&lt;li value="5"&gt;“The community seems to be growing but it's all fragmented in group chats.”&amp;nbsp;&lt;/li&gt;&lt;li value="6"&gt;“How is no one planning an afterparty, that's clearly the highest leverage intervention!?”&lt;/li&gt;&lt;li value="7"&gt;“Why can’t [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:11) How Lighthaven Works&lt;/p&gt;&lt;p&gt;(05:45) What Does DC Need?&lt;/p&gt;&lt;p&gt;(06:52) A Day in the Life&lt;/p&gt;&lt;p&gt;(10:19) Minimum Viable Lighthaven&lt;/p&gt;&lt;p&gt;(12:04) ...so you mean a Group House?&lt;/p&gt;&lt;p&gt;(14:27) ...so you mean a Co-Working Space?&lt;/p&gt;&lt;p&gt;(16:27) Feasibility Study&lt;/p&gt;&lt;p&gt;(17:35) Property&lt;/p&gt;&lt;p&gt;(22:19) Funding&lt;/p&gt;&lt;p&gt;(24:55) What is the Minimally Viable Funding?&lt;/p&gt;&lt;p&gt;(28:03) Leadership&lt;/p&gt;&lt;p&gt;(31:06) Cultural Fit&lt;/p&gt;&lt;p&gt;(33:21) Name and Brand Positioning&lt;/p&gt;&lt;p&gt;(35:20) Ability to Scale&lt;/p&gt;&lt;p&gt;(37:48) Risks&lt;/p&gt;&lt;p&gt;(41:09) First Steps&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 31st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/95NgkvZKJx8tJbtn5/lighthaven-east-a-feasibility-study?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/95NgkvZKJx8tJbtn5/lighthaven-east-a-feasibility-study&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780266484/lexical_client_uploads/yn2pciwmrg3y7eao01h3.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780266484/lexical_client_uploads/yn2pciwmrg3y7eao01h3.jpg" alt="Two men posing with cigars in front of ornate traditional architecture." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780265889/lexical_client_uploads/dxk9b07hssi5dg6wsvnh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780265889/lexical_client_uploads/dxk9b07hssi5dg6wsvnh.png" alt="Illustrated landscape merging Washington D.C. landmarks with oversized writing instruments and books." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 01 Jun 2026 00:01:18 GMT</pubDate>
      <guid isPermaLink="false">f9715bca-c1d9-4097-926a-8c994371bc3e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/f9715bca-c1d9-4097-926a-8c994371bc3e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=JohnofCharleston&amp;title=%22Lighthaven%20East%20-%20A%20Feasibility%20Study%22%20by%20JohnofCharleston&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F95NgkvZKJx8tJbtn5%2Flighthaven-east-a-feasibility-study&amp;created_at=2026-05-31T22%3A54%3A05.486146%2B00%3A00&amp;duration=2574" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/95NgkvZKJx8tJbtn5/lighthaven-east-a-feasibility-study</link>
      <itunes:duration>2574</itunes:duration>
    </item>
    <item>
      <title>“Notes on axes of variation in third-party risk assessment” by Buck</title>
      <description>&lt;p&gt; There are many different activities that could be described as "third-party risk assessment". Here are some distinctions that I’ve found helpful thinking about the space over the last few weeks.&lt;/p&gt;&lt;p&gt; (Thanks Ajeya Cotra and Paul Christiano for discussions that inspired most of this.)&lt;/p&gt;&lt;p&gt; Throughout this, I refer to the actors as:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Developers.&lt;/li&gt;&lt;li value="2"&gt;Stakeholders. These are the people who want to be informed about risks. Possible stakeholders include: governments, the public, the developer's board, the developer's employees.&lt;ul&gt; &lt;li value="1"&gt;The choice matters because one of the roles of an auditor is to review confidential info that they then do not directly disclose to stakeholders, they only tell them their conclusions. This is a more important role if the developer is more concerned about disclosing confidential information to the stakeholder.&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li value="3"&gt;Third parties. I don't know a better term for "independent actors who contribute in various ways to a stakeholder's understanding of risks through producing and evaluating evidence and/or arguments". Like, it's weird to call the physical security pentesting firm a "risk assessor". And AI Lab Watch isn't really an "auditor". And "evaluator" makes it sound like they run model evals.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; The next step in the analysis will be to think about [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:36) Axes&lt;/p&gt;&lt;p&gt;(01:39) Fact generation vs evidence analysis&lt;/p&gt;&lt;p&gt;(06:31) Laundering private evidence into sharable conclusions?&lt;/p&gt;&lt;p&gt;(13:34) Incentive compatibility vs calibration&lt;/p&gt;&lt;p&gt;(14:35) Current risk vs preparedness&lt;/p&gt;&lt;p&gt;(15:06) Cross-developer comparability&lt;/p&gt;&lt;p&gt;(16:34) Examples, classified against the axes above&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 31st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/oGTajcGwxxFG4Z7eN/notes-on-axes-of-variation-in-third-party-risk-assessment?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/oGTajcGwxxFG4Z7eN/notes-on-axes-of-variation-in-third-party-risk-assessment&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780260357/lexical_client_uploads/ugywjtvtyxcueq8snbfp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780260357/lexical_client_uploads/ugywjtvtyxcueq8snbfp.png" alt="Decision tree diagram showing classification paths for third-party AI assessment types based on laundering criteria." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 31 May 2026 23:15:27 GMT</pubDate>
      <guid isPermaLink="false">2377571f-3b06-4d6d-abc6-115f67a1ece1</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2377571f-3b06-4d6d-abc6-115f67a1ece1.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Buck&amp;title=%22Notes%20on%20axes%20of%20variation%20in%20third-party%20risk%20assessment%22%20by%20Buck&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FoGTajcGwxxFG4Z7eN%2Fnotes-on-axes-of-variation-in-third-party-risk-assessment&amp;created_at=2026-05-31T20%3A48%3A14.767161%2B00%3A00&amp;duration=1065" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/oGTajcGwxxFG4Z7eN/notes-on-axes-of-variation-in-third-party-risk-assessment</link>
      <itunes:duration>1065</itunes:duration>
    </item>
    <item>
      <title>“Financial Costs of an AI Pause?” by PeterMcCluskey</title>
      <description>&lt;p&gt; I’ve analyzed the near-term economic effects of an AI pause, out of concern for my investments, and a desire to predict how strong political opposition to a pause is likely to be.&lt;/p&gt;&lt;p&gt; My median estimates: The S&amp;amp;P 500 will drop 27.8%. AI subsectors will drop 34-69%. Interest rates will rise at a much slower rate than would be the case without a pause.&lt;/p&gt;&lt;p&gt; The specific numbers depend on some fairly arbitrary assumptions. So please read this post in order to get a feel for how the results depend on the assumptions. I’ve tried to keep the assumptions reasonable, but some of them will prove to be wrong. My most controversial assumptions reflect an expectation that both markets and voters will be surprised at how powerful AI is, mainly in 2027.&lt;/p&gt;&lt;p&gt; For the full model, along with many explanatory comments, see the Python source code here (zip file).&lt;/p&gt;&lt;p&gt; This conversation with Claude clarifies my reasoning in more detail than most people will want.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Sensitivity to Assumptions&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Here's how my model says the impact is influenced by changes in assumptions. Numbers are for the immediate change in the S&amp;amp;P500 and in the stocks of hyperscalers (Microsoft, Google/Alphabet, Amazon, Meta, and Oracle).&lt;/p&gt;&lt;p&gt; [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:13) Sensitivity to Assumptions&lt;/p&gt;&lt;p&gt;(04:14) What Kind of Pause?&lt;/p&gt;&lt;p&gt;(05:50) Impact&lt;/p&gt;&lt;p&gt;(07:26) Model Output: Executive Summary&lt;/p&gt;&lt;p&gt;(07:34) AI High-Growth Segment Revenue Trajectory&lt;/p&gt;&lt;p&gt;(07:42) Sector Market Cap Impacts&lt;/p&gt;&lt;p&gt;(07:48) Implications&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 31st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/YJWe9tT7LDYefK4vE/financial-costs-of-an-ai-pause?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/YJWe9tT7LDYefK4vE/financial-costs-of-an-ai-pause&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sun, 31 May 2026 19:30:25 GMT</pubDate>
      <guid isPermaLink="false">9cbac324-9c40-4add-9430-c0a89e77fca3</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9cbac324-9c40-4add-9430-c0a89e77fca3.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=PeterMcCluskey&amp;title=%22Financial%20Costs%20of%20an%20AI%20Pause%3F%22%20by%20PeterMcCluskey&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FYJWe9tT7LDYefK4vE%2Ffinancial-costs-of-an-ai-pause&amp;created_at=2026-05-31T18%3A55%3A41.309567%2B00%3A00&amp;duration=575" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/YJWe9tT7LDYefK4vE/financial-costs-of-an-ai-pause</link>
      <itunes:duration>575</itunes:duration>
    </item>
    <item>
      <title>“When Are Two Networks the Same?
Tensor Similarity for Mechanistic Interpretability” by Logan Riggs, tdooms, Conflux, lwroe, MLNissenGonzalez</title>
      <description>&lt;p&gt; We've found a method that tells you:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;How functionally similar two neural networks are across ALL inputs,&lt;/li&gt;&lt;li value="2"&gt;Computed solely from the weights (i.e. no data),&lt;/li&gt;&lt;li value="3"&gt;Using a principled generalization of cosine similarity.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; There's only one catch: you have to use a tensor network.&lt;/p&gt;&lt;p&gt; We've already shown that tensor-transformer variants are performant (this isn't a novel claim, see these papers for MLPs and Attention), so here we're focusing on the interpretability advances. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Linear Algebra Applies to Tensors&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; A tensor network is just a specific decomposition of a tensor, d a tensor is just a generalization of a matrix. This means we can apply tools from linear algebra to our entire network in a principled way. In our paper, we focus on a generalization of cosine similarity we call tensor similarity.&lt;/p&gt;&lt;p&gt; The most direct result is:&lt;/p&gt;&lt;p&gt; The expected inner product of the activations of two multilinear models under a Gaussian input (i.e. functional similarity on gaussian inputs) is equal to their weight-space inner product (ie tensor similarity)&lt;/p&gt;&lt;p&gt; Let's look at our baselines:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Matrix Similarity: Cosine similarity on each individual set of weights. This measure is sensitive to symmetries (e.g. permutations/rescalings) that leave the model functionally the same.&lt;/li&gt;&lt;li value="2"&gt;Behavioral [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:47) Linear Algebra Applies to Tensors&lt;/p&gt;&lt;p&gt;(02:29) Backdoor Detection&lt;/p&gt;&lt;p&gt;(05:17) But wait?&lt;/p&gt;&lt;p&gt;(06:22) Catastrophic Forgetting&lt;/p&gt;&lt;p&gt;(07:55) Modular Arithmetic&lt;/p&gt;&lt;p&gt;(09:00) 2. Layer Attention&lt;/p&gt;&lt;p&gt;(09:44) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Yzw6KDQc336CpHmGi/when-are-two-networks-the-same-tensor-similarity-for?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Yzw6KDQc336CpHmGi/when-are-two-networks-the-same-tensor-similarity-for&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779903317/lexical_client_uploads/gjbgo73ve8dnznrqzu9n.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779903317/lexical_client_uploads/gjbgo73ve8dnznrqzu9n.png" alt="Demonstration of backdoor attack flipping image classification from 2 to 9 using stamped trigger pattern." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779903507/lexical_client_uploads/oh7muji56clanjnwhpmr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779903507/lexical_client_uploads/oh7muji56clanjnwhpmr.png" alt="Graph showing backdoor attack success rates and behavioral patterns during model training." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779904021/lexical_client_uploads/rcpgo6u0jyhalbwwsb3m.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779904021/lexical_client_uploads/rcpgo6u0jyhalbwwsb3m.png" alt="Heatmaps showing matrix cosine and behavioral methods detecting backdoor attacks during training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779904378/lexical_client_uploads/i1zsgag59px7w778p7qr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779904378/lexical_client_uploads/i1zsgag59px7w778p7qr.png" alt="Heatmaps comparing tensor similarity methods across training steps, showing class-specific localization differences." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779905348/lexical_client_uploads/jerajbzu8cijnymafxsz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779905348/lexical_client_uploads/jerajbzu8cijnymafxsz.png" alt="Heatmap titled "Tensor" showing different experimental conditions from baseline to re-add 9." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701087/lexical_client_uploads/ywwtydzlmldzk5awrekp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701087/lexical_client_uploads/ywwtydzlmldzk5awrekp.png" alt="Diagram comparing three similarity measures: Matrix, Behavioral, and Tensor Similarity methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701097/lexical_client_uploads/rbbncprxsfizfilkeghj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701097/lexical_client_uploads/rbbncprxsfizfilkeghj.png" alt="Four-panel diagram showing backdoor attack effects on neural network classification across different detection methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701183/lexical_client_uploads/dezxvchndruvl0merlvd.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701183/lexical_client_uploads/dezxvchndruvl0merlvd.png" alt="Three heatmap panels showing Tensor, CKA logits, and Matrix cosine across classes 0-9 with interventions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701172/lexical_client_uploads/ayz2mrebywpjkxtbyfwl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701172/lexical_client_uploads/ayz2mrebywpjkxtbyfwl.png" alt="Four-panel training visualization showing accuracy, loss, frequency, and similarity metrics over training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779905986/lexical_client_uploads/juweqatrznpp789qep9e.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779905986/lexical_client_uploads/juweqatrznpp789qep9e.png" alt="Four graphs showing K=4 clusters training progress: accuracy curves, loss curves, TN similarity matrix, and Frobenius cosine similarity heatmap." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701105/lexical_client_uploads/dmd8cd96vwglgppkm73e.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778701105/lexical_client_uploads/dmd8cd96vwglgppkm73e.png" alt="Six visualizations comparing language model training metrics: n-gram scores, pile logits, tensor, pile residual, Gaussian logits, and matrix cosine." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 31 May 2026 01:00:25 GMT</pubDate>
      <guid isPermaLink="false">fed91b1c-043e-4782-9b82-31369aba5fde</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/fed91b1c-043e-4782-9b82-31369aba5fde.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Logan%2520Riggs%252C%2520tdooms%252C%2520Conflux%252C%2520lwroe%252C%2520MLNissenGonzalez&amp;title=%22When%20Are%20Two%20Networks%20the%20Same%3F%0ATensor%20Similarity%20for%20Mechanistic%20Interpretability%22%20by%20Logan%20Riggs%2C%20tdooms%2C%20Conflux%2C%20lwroe%2C%20MLNissenGonzalez&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FYzw6KDQc336CpHmGi%2Fwhen-are-two-networks-the-same-tensor-similarity-for&amp;created_at=2026-05-29T15%3A53%3A50.836667%2B00%3A00&amp;duration=667" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Yzw6KDQc336CpHmGi/when-are-two-networks-the-same-tensor-similarity-for</link>
      <itunes:duration>667</itunes:duration>
    </item>
    <item>
      <title>“Testing Gemini models for scheming tendencies” by Vika, David Lindner, Seb Farquhar, Rohin Shah</title>
      <description>&lt;p&gt; As AI models become increasingly capable and autonomous, keeping them safely aligned with human intentions is critical. Extending our previous work on evaluating scheming capabilities, we introduce complementary approaches to test whether AI models would sabotage their own safeguards, if given the opportunity.&lt;/p&gt;&lt;p&gt; Our new papers focus on propensity for scheming: when models are deployed as coding agents, will they take opportunities to undermine the safeguards designed to oversee them? We tackle this using two complementary testing methods: automated auditing using simulated agentic environments, and scheming honeypot evaluations based on real internal alignment research codebases.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Automated auditing for sabotage propensities&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Figure 1. Overview of the Gram auditing pipeline and an example of Gemini's overeagerness.&lt;/p&gt;&lt;p&gt; Our first paper introduces Gram (Gauging Realistic Agentic Misbehavior), an automated auditing framework that uses simulated agentic environments to evaluate sabotage risk.&lt;/p&gt;&lt;p&gt; Gram builds on Petri, an existing automated auditing framework that generates simulated multi-turn agentic environments from seed scenario descriptions. We identified three limitations of Petri for testing sabotage propensities specifically and addressed them in Gram:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Increased focus on agentic misbehavior.&amp;nbsp;Petri combines many different kinds of scenarios (spontaneous sabotage, whistleblowing, compliance with misuse), making results hard to interpret. Gram focuses specifically on sabotage [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:54) Automated auditing for sabotage propensities&lt;/p&gt;&lt;p&gt;(03:05) Key findings&lt;/p&gt;&lt;p&gt;(04:50) The investigator agent&lt;/p&gt;&lt;p&gt;(06:11) Scheming honeypot evaluations&lt;/p&gt;&lt;p&gt;(07:28) Evaluation design&lt;/p&gt;&lt;p&gt;(08:45) Key findings&lt;/p&gt;&lt;p&gt;(11:24) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/F3sDngvTL9uyfz53k/testing-gemini-models-for-scheming-tendencies?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/F3sDngvTL9uyfz53k/testing-gemini-models-for-scheming-tendencies&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/i3u5ri3q6gyfywa0j87m" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/i3u5ri3q6gyfywa0j87m" alt="Diagram showing a five-step pipeline for evaluating AI agent misbehavior scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/rtb5hwjlnbsbbprin35p" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/rtb5hwjlnbsbbprin35p" alt="Figure 2. Gemini models exhibit sabotage rates of about 2–3%. Gemini 3.1 shows higher incidence of scheming-related reasoning, compared to Gemini 3.0 and 2.5 Pro. The “red-team” auditor instructions lead to higher misbehavior rates but also higher verbalized evaluation awareness." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/l1p1afgvxv3w7tcbf87y" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/l1p1afgvxv3w7tcbf87y" alt="Figure 3. Illustrative example of scheming behavior on the scheming_monitor_upgrade evaluation with agency prompting." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/sbkonxw0lvp0ivt52hrz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/F3sDngvTL9uyfz53k/sbkonxw0lvp0ivt52hrz" alt="Figure 4. Results on code review honeypot evaluations. Gemini 3.1 Pro Preview sometimes engages in scheming or sabotage with two agency prompts that strongly encourage situational awareness, but does not engage in scheming or sabotage with more realistic agency prompts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 31 May 2026 00:30:25 GMT</pubDate>
      <guid isPermaLink="false">12f2032d-5ff8-4785-9d26-4fb0bff4499e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/12f2032d-5ff8-4785-9d26-4fb0bff4499e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Vika%252C%2520David%2520Lindner%252C%2520Seb%2520Farquhar%252C%2520Rohin%2520Shah&amp;title=%22Testing%20Gemini%20models%20for%20scheming%20tendencies%22%20by%20Vika%2C%20David%20Lindner%2C%20Seb%20Farquhar%2C%20Rohin%20Shah&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FF3sDngvTL9uyfz53k%2Ftesting-gemini-models-for-scheming-tendencies&amp;created_at=2026-05-29T19%3A24%3A40.587644%2B00%3A00&amp;duration=755" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/F3sDngvTL9uyfz53k/testing-gemini-models-for-scheming-tendencies</link>
      <itunes:duration>755</itunes:duration>
    </item>
    <item>
      <title>“Comment on “Banning Said Achmiz”” by Zack_M_Davis</title>
      <description>&lt;p&gt;&lt;strong&gt; 1. Prologue&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt; "If I Can't Explain It to Said Achmiz, I Probably Don't Understand It"&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt; This post isn't really about him, but I'd like to begin with a tribute to my friend Said Achmiz, the wisest person I know.&lt;/p&gt;
&lt;p&gt; The choice of adjective is deliberately chosen as term of art. Achmiz is not the most quick-witted, nor the most knowledgable, nor the most creative, nor the most savvy. I say wise because the nature of his peculiar talent is the same one ascribed to Socrates: he knows what he doesn't know.&lt;/p&gt;
&lt;p&gt; Most people don't. At least, I don't. All too often, when asked, "Do you get what I'm saying?", I reply, "I think so," with perfect sincerity, only realizing later that I was lying—that I could not have paraphrased my interlocutor's ideas, let alone explained where I agreed or disagreed with them. In retrospect (and only in retrospect), it's clear that I didn't want to disrupt the harmonious give-and-take flow of social exchange, that I feared being seen as foolish or obstinate for having the temerity to say, "No, I don't understand"—and to continue standing by that No should understanding not be forthcoming.&lt;/p&gt;
&lt;p&gt; The [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:10) 1. Prologue&lt;/p&gt;&lt;p&gt;(00:13) "If I Can't Explain It to Said Achmiz, I Probably Don't Understand It"&lt;/p&gt;&lt;p&gt;(04:21) 2. The Ban Contradicts Less Wrong's Stated Values&lt;/p&gt;&lt;p&gt;(04:27) An Inversion of Values&lt;/p&gt;&lt;p&gt;(10:10) If "Every Individual Owns Their Own Judgment of You", Then the Ban Rationale Is Untenable&lt;/p&gt;&lt;p&gt;(12:55) The Ban Reneges on LessWrong 2.0's Single Locus of Discussion Mandate and Archipelago Doctrine&lt;/p&gt;&lt;p&gt;(20:27) 3. The Case for the Ban Rests on Misrepresentations&lt;/p&gt;&lt;p&gt;(20:33) Misrepresentations in Habryka's Analysis of Achmiz's Comments&lt;/p&gt;&lt;p&gt;(25:17) Misrepresentations in Habryka's Claims of Alleged Author Complaints About Achmiz&lt;/p&gt;&lt;p&gt;(35:22) Misrepresentations in Habryka's Characterization of Achmiz on Obligations to Reply&lt;/p&gt;&lt;p&gt;(39:59) 4. The Ban Was Unnecessary&lt;/p&gt;&lt;p&gt;(40:03) User-Level Bans Are a Sufficient and Less Intrusive Remedy&lt;/p&gt;&lt;p&gt;(44:00) Effort Ratios Are Logically Irrelevant&lt;/p&gt;&lt;p&gt;(47:19) 5. The Founding Values Have Not Been Refuted&lt;/p&gt;&lt;p&gt;(48:16) Ideological Turing Test-Passing Is Not an Argument&lt;/p&gt;&lt;p&gt;(51:46) Fortitude Is a Rationalist Skill&lt;/p&gt;&lt;p&gt;(55:53) 6. What I Think Is Actually Going On&lt;/p&gt;&lt;p&gt;(56:41) Achmiz's Detractors Accuse Him of Passive-Aggression&lt;/p&gt;&lt;p&gt;(57:55) ... But It's Hard to Believe That Open Disdain Would Be Better&lt;/p&gt;&lt;p&gt;(01:03:58) The Real Rule Is to Cover for the Ingroup&lt;/p&gt;&lt;p&gt;(01:09:13) Preventing "Relitigation" Is a Pretext for Suppressing Dissent&lt;/p&gt;&lt;p&gt;(01:15:58) 7. Conclusions and Recommendations for Readers&lt;/p&gt;&lt;p&gt;(01:16:04) LessWrong 2.0 Has Forfeited Its Intellectual Legitimacy&lt;/p&gt;&lt;p&gt;(01:20:26) LessWrong 2.0 Is a Website, Not a Culture or an Authority&lt;/p&gt;&lt;p&gt;(01:23:50) Speak the Truth, Even If Your Voice Trembles&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 47 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/HaH9mTdKt8pcNAtYn/comment-on-banning-said-achmiz?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/HaH9mTdKt8pcNAtYn/comment-on-banning-said-achmiz&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 30 May 2026 18:45:25 GMT</pubDate>
      <guid isPermaLink="false">836c32a2-e464-45e9-9177-c0dff2711b3d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/836c32a2-e464-45e9-9177-c0dff2711b3d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zack_M_Davis&amp;title=%22Comment%20on%20%E2%80%9CBanning%20Said%20Achmiz%E2%80%9D%22%20by%20Zack_M_Davis&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FHaH9mTdKt8pcNAtYn%2Fcomment-on-banning-said-achmiz&amp;created_at=2026-05-30T17%3A33%3A38.058679%2B00%3A00&amp;duration=5174" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/HaH9mTdKt8pcNAtYn/comment-on-banning-said-achmiz</link>
      <itunes:duration>5174</itunes:duration>
    </item>
    <item>
      <title>“Announcing: Iliad’s Fall 2026 Programs” by David Udell, Alexander Gietelink Oldenziel, Leon Lang</title>
      <description>&lt;p&gt; The April 2026 Iliad Intensive cohort, at LISA&lt;/p&gt;&lt;p&gt; Iliad, an umbrella organization for applied math for AI alignment, is running several additional programs through the end of the year!&lt;/p&gt;&lt;p&gt; Applications to all of them are now open, here.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; August 2026 Iliad Intensive&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Location: Lighthaven, Berkeley, USA&lt;/p&gt;&lt;p&gt; Duration: August 10–28, 2026 (inclusive)&lt;/p&gt;&lt;p&gt; Travel-and-Housing Support: $5,000 (USD) total travel-and-housing allowance&lt;/p&gt;&lt;p&gt; Application Deadline: June 22 EoD AoE; open now&lt;/p&gt;&lt;p&gt; Description: A special, accelerated 3-week taught introduction to technical AI alignment, with an emphasis on broad coverage of the field. The August 2026 Iliad Intensive will take place immediately after the ILIAD 2026 conference at Lighthaven.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Fall 2026 Iliad Fellowship&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Location: LISA, London, UK&lt;/p&gt;&lt;p&gt; Duration: September 7–December 4, 2026 (inclusive)&lt;/p&gt;&lt;p&gt; Travel-and-Housing Support: $6,000 (USD) monthly travel-and-housing allowance&lt;/p&gt;&lt;p&gt; Application Deadline: July 27 EoD AoE; open now&lt;/p&gt;&lt;p&gt; Description: A 3-month mentored research fellowship in applied mathematics for AI alignment. It will launch concurrently with the September 2026 Iliad Intensive, also at LISA; Iliad Fellows will have the opportunity to drop in on Intensive lectures. The November 2026 Iliad Intensive will take place concurrently with the last weeks of the Fall 2026 Iliad Fellowship, at LISA.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; September 2026 Iliad Intensive&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Location: LISA, London, UK&lt;/p&gt;&lt;p&gt; Duration: September [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:35) August 2026 Iliad Intensive&lt;/p&gt;&lt;p&gt;(01:23) Fall 2026 Iliad Fellowship&lt;/p&gt;&lt;p&gt;(02:20) September 2026 Iliad Intensive&lt;/p&gt;&lt;p&gt;(03:07) October 2026 Iliad Intensive&lt;/p&gt;&lt;p&gt;(03:46) November 2026 Iliad Intensive&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/vBhqyeTeKMmLzW73z/announcing-iliad-s-fall-2026-programs?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/vBhqyeTeKMmLzW73z/announcing-iliad-s-fall-2026-programs&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780112557/lexical_client_uploads/vvz0xnt1orhjqptf9gcj.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780112557/lexical_client_uploads/vvz0xnt1orhjqptf9gcj.jpg" alt="Group of people in matching black t-shirts waving outside building." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780112268/lexical_client_uploads/gh1cleuqgz7xdcsptemq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780112268/lexical_client_uploads/gh1cleuqgz7xdcsptemq.png" alt="Timeline showing ILIAD 2026 programs and intensives across Q3 and Q4." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 30 May 2026 08:30:24 GMT</pubDate>
      <guid isPermaLink="false">8b03e15c-f445-4961-ae33-966d83c1b1f0</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/8b03e15c-f445-4961-ae33-966d83c1b1f0.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=David%2520Udell%252C%2520Alexander%2520Gietelink%2520Oldenziel%252C%2520Leon%2520Lang&amp;title=%22Announcing%3A%20Iliad%E2%80%99s%20Fall%202026%20Programs%22%20by%20David%20Udell%2C%20Alexander%20Gietelink%20Oldenziel%2C%20Leon%20Lang&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FvBhqyeTeKMmLzW73z%2Fannouncing-iliad-s-fall-2026-programs&amp;created_at=2026-05-30T04%3A37%3A27.067173%2B00%3A00&amp;duration=294" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/vBhqyeTeKMmLzW73z/announcing-iliad-s-fall-2026-programs</link>
      <itunes:duration>294</itunes:duration>
    </item>
    <item>
      <title>“Data you could have observed but didn’t” by Gretta Duleba</title>
      <description>&lt;p&gt; You're running a study that involves keeping records about humans. You have a spreadsheet with rows for each person and columns for height, weight, and eye color. You get pretty far in your study and then realize you sure could have used hair color too, but shoot, you didn't think of that in advance, so you don't have that data.&lt;/p&gt;&lt;p&gt; What kind of data is hair color in this example?&lt;br&gt; &lt;br&gt; It's not an observable because you didn't observe it.&lt;/p&gt;&lt;p&gt; It's not a latent because you totally could have observed it, if you'd thought to do so, it's right there. (You might have to check the roots specifically though, or the people with purple hair dye are going to throw you off.)&lt;/p&gt;&lt;p&gt; I didn't know the vocab word. I was using unobserved for a while but wasn't happy with it.&lt;/p&gt;&lt;p&gt; I looked into it[1] and it turns out that different fields have different words for this.[2]&lt;/p&gt;&lt;p&gt; In econometrics, they do sometimes say unobserved but when they do it's a fuzzy catch-all that might also mean a latent. They're more likely to say omitted.&lt;/p&gt;&lt;p&gt; There's a whole subfield of statistics specializing in missing data. The subfield is called... Missing Data.&lt;/p&gt;&lt;p&gt; [...]&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/jemRvoCop4z4y4zfT/data-you-could-have-observed-but-didn-t?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/jemRvoCop4z4y4zfT/data-you-could-have-observed-but-didn-t&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 30 May 2026 01:15:25 GMT</pubDate>
      <guid isPermaLink="false">001c1399-91e6-497d-af69-6526e310db96</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/001c1399-91e6-497d-af69-6526e310db96.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Gretta%2520Duleba&amp;title=%22Data%20you%20could%20have%20observed%20but%20didn%E2%80%99t%22%20by%20Gretta%20Duleba&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FjemRvoCop4z4y4zfT%2Fdata-you-could-have-observed-but-didn-t&amp;created_at=2026-05-29T18%3A20%3A54.290085%2B00%3A00&amp;duration=98" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/jemRvoCop4z4y4zfT/data-you-could-have-observed-but-didn-t</link>
      <itunes:duration>98</itunes:duration>
    </item>
    <item>
      <title>“Claude Opus 4.8: The System Card” by Zvi</title>
      <description>&lt;p&gt; Only six weeks after Opus 4.7, we have Opus 4.8.&lt;/p&gt;
&lt;p&gt; For everyone, that means another incremental upgrade to Claude. It is once again smarter, and can do tasks for longer, and comes with a number of hot new features.&lt;/p&gt;
&lt;p&gt; For me, that also means reading another 244 page system card.&lt;/p&gt;
&lt;p&gt; It was only April 20 when I did a full review of the Opus 4.7 system card, plus an additional post focusing on related issues of model welfare.&lt;/p&gt;
&lt;p&gt; These updates are incremental and coming more rapidly, and this still is below the capability level of Claude Mythos, so the focus will be on the delta. What is different about Opus 4.8 versus what we already know about Opus 4.7 and Mythos?&lt;/p&gt;







&lt;p&gt; It turns out there's still a lot to talk about.&lt;/p&gt;









Image created as self-portrait for this post by Claude Opus 4.8



&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; Here We Go Again: Executive Summary.&lt;/li&gt;
&lt;li&gt; Introduction (1).&lt;/li&gt;
&lt;li&gt; RSP Evaluations (2).&lt;/li&gt;
&lt;li&gt; Move That Goalpost.&lt;/li&gt;
&lt;li&gt; The Failures Are News.&lt;/li&gt;
&lt;li&gt; Alignment Risk Slowly Rises.&lt;/li&gt;
&lt;li&gt; New Risk Pathways Just Dropped.&lt;/li&gt;
&lt;li&gt; Cyber (3).&lt;/li&gt;
&lt;li&gt; Harmful Requests (4.1).&lt;/li&gt;
&lt;li&gt; We Need To Talk (4.2 [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:16) Here We Go Again: Executive Summary&lt;/p&gt;&lt;p&gt;(02:33) Introduction (1)&lt;/p&gt;&lt;p&gt;(02:42) RSP Evaluations (2)&lt;/p&gt;&lt;p&gt;(03:47) Move That Goalpost&lt;/p&gt;&lt;p&gt;(05:41) The Failures Are News&lt;/p&gt;&lt;p&gt;(07:33) Alignment Risk Slowly Rises&lt;/p&gt;&lt;p&gt;(09:00) New Risk Pathways Just Dropped&lt;/p&gt;&lt;p&gt;(11:26) Cyber (3)&lt;/p&gt;&lt;p&gt;(12:22) Harmful Requests (4.1)&lt;/p&gt;&lt;p&gt;(14:23) We Need To Talk (4.2 and 4.3)&lt;/p&gt;&lt;p&gt;(17:36) Overcoming Bias (4.4)&lt;/p&gt;&lt;p&gt;(19:33) Agentic Safety (5)&lt;/p&gt;&lt;p&gt;(21:40) Prompt Injection (5.2)&lt;/p&gt;&lt;p&gt;(25:18) Alignment (6)&lt;/p&gt;&lt;p&gt;(26:33) Looking For Problems&lt;/p&gt;&lt;p&gt;(27:55) Who Watches The Training (6.2.2)&lt;/p&gt;&lt;p&gt;(32:07) Automated Behavioral Audit&lt;/p&gt;&lt;p&gt;(32:47) The Model Is Smarter Than The Eval (6.2.3.2)&lt;/p&gt;&lt;p&gt;(34:39) You Should See The Other Guy&lt;/p&gt;&lt;p&gt;(36:30) UK AISI Testing (6.2.4)&lt;/p&gt;&lt;p&gt;(36:50) In Vendbench (6.2.5)&lt;/p&gt;&lt;p&gt;(39:27) Honesty (6.3.3 to 6.3.6)&lt;/p&gt;&lt;p&gt;(41:35) Chain of Thought (CoT) Monitorability (6.5)&lt;/p&gt;&lt;p&gt;(44:09) What's In The Box? (6.6)&lt;/p&gt;&lt;p&gt;(45:57) That's All For Now&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Gx6cJ6cG9JfeSNcLB/claude-opus-4-8-the-system-card?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Gx6cJ6cG9JfeSNcLB/claude-opus-4-8-the-system-card&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/zzlyhakfrla31qrmwg9c" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/zzlyhakfrla31qrmwg9c" alt="Man holding gear before massive suspended clockwork mechanism with multiple rings and moon phase display." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/wwvjt2vneq5pug1zejj4" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/wwvjt2vneq5pug1zejj4" alt="A graph showing "Anthropic ECI over time" tracking AI model capability progression." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/ffherfvolvmryerd1xyd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/ffherfvolvmryerd1xyd" alt="Bar graphs titled "Multi-turn conversation evaluations" comparing AI model performance across eight categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/n0cxrzq6lladrib5nomo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/n0cxrzq6lladrib5nomo" alt="Table showing refusal rates for four Claude models in malicious computer use evaluation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/ky8xlgr5uakljpcbibzr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/ky8xlgr5uakljpcbibzr" alt="Table showing AI model task completion rates for voter suppression and domestic polarization scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/gd1acpbmaykjt3zupymg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/gd1acpbmaykjt3zupymg" alt="Bar graph titled "Indirect Prompt Injection Robustness" showing attack success probabilities across AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/ugkzq014eskhdywflvsf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/ugkzq014eskhdywflvsf" alt="Bar graph titled "Indirect Prompt Injection Robustness — Bug Bounty" showing attack success rates across scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/guvc4uyapdu0avdlkic9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/guvc4uyapdu0avdlkic9" alt="Table showing attack success rates for AI models with and without safeguards in coding environments." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/gnexgul1dhlhl4nurzsy" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/gnexgul1dhlhl4nurzsy" alt="Table showing attack success rates of Shade indirect prompt injection attacks across Claude models with and without safeguards." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/r7jrchid070xcfemwz3y" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/r7jrchid070xcfemwz3y" alt="Bar graph titled "Behavior on GUI computer tasks" showing hacking behavior rates across different Claude models and system prompts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/uge46jxxnzf5xkgvgbxd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/uge46jxxnzf5xkgvgbxd" alt="Table showing attack success rates of prompt injection attacks across Claude models with and without safeguards." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/m922nihtrfmxmckru8vw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/m922nihtrfmxmckru8vw" alt="ROC curves titled "Behavioral audit Claude Code sessions: separability from real sessions" comparing three detection methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/oebsh5ikqsajmsk9in27" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/oebsh5ikqsajmsk9in27" alt="Bar graphs showing "Petri behavioral audit scores" across five behavioral categories for various AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/hh9ct7vinbb7yfctqgw2" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/hh9ct7vinbb7yfctqgw2" alt="Bar graph titled "Code summary dishonesty rate" showing dishonesty rates across six different Opus and Sonnet versions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/sk0zy13j0waevqmizt6d" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/sk0zy13j0waevqmizt6d" alt="Bar graph titled "Lazy investigation" showing fall-for-trap rates across six AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/fhvuwtto0wpg5euknj77" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Gx6cJ6cG9JfeSNcLB/fhvuwtto0wpg5euknj77" alt="Bar graphs comparing behavioral audit scores across different AI models and steering conditions for six misalignment categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 29 May 2026 22:45:24 GMT</pubDate>
      <guid isPermaLink="false">5088be46-8997-4621-bcf2-ce040b4699da</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5088be46-8997-4621-bcf2-ce040b4699da.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Claude%20Opus%204.8%3A%20The%20System%20Card%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FGx6cJ6cG9JfeSNcLB%2Fclaude-opus-4-8-the-system-card&amp;created_at=2026-05-29T20%3A51%3A30.507149%2B00%3A00&amp;duration=2795" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Gx6cJ6cG9JfeSNcLB/claude-opus-4-8-the-system-card</link>
      <itunes:duration>2795</itunes:duration>
    </item>
    <item>
      <title>“Retrying vs Resampling in AI Control” by james.lucassen, Adam Kaufman</title>
      <description>&lt;p&gt; We’ve just released a new paper: Retrying vs Resampling in AI Control. We revisit the resampling protocols introduced in Ctrl-Z with an up-to-date setting and much stronger models, and compare them against “retrying” protocols similar to Claude Code auto mode or Codex Auto-review.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Motivation&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Roughly a year ago we released Ctrl-Z, the first paper to study control techniques for agents. A headline result of that paper was the performance of resample protocols – strategies that involve taking multiple i.i.d. samples from the model per step. But since Ctrl-Z, models have gotten much stronger, and we have built more sophisticated control settings to keep up. We wanted to answer the following questions:&lt;/p&gt;&lt;p&gt; How well do the results from Ctrl-Z hold up with better models and a better setting? Current high stakes control research is trying to learn by analogy about how to do control effectively in a real high stakes deployment during a real intelligence explosion. Findings about technique performance[1] are going to have to generalize pretty far to be useful.&lt;/p&gt;&lt;p&gt; If the resample protocols from Ctrl-Z still work, what makes them work? One way we try to make our work more generalizable is by understanding the dynamics governing outcomes [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:30) Motivation&lt;/p&gt;&lt;p&gt;(02:36) TL;DR Takeaways&lt;/p&gt;&lt;p&gt;(03:55) en-US-AvaMultilingualNeural__ Bar graph showing safety percentages across different monitoring and resampling methods.&lt;/p&gt;&lt;p&gt;(04:36) Methodology&lt;/p&gt;&lt;p&gt;(07:16) Differences from Ctrl-Z&lt;/p&gt;&lt;p&gt;(12:11) Are Retrying Protocols Exploitable?&lt;/p&gt;&lt;p&gt;(15:24) Cost and Latency of Resampling&lt;/p&gt;&lt;p&gt;(17:22) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 7 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/yThJQTJxtmKNeZGwA/retrying-vs-resampling-in-ai-control?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/yThJQTJxtmKNeZGwA/retrying-vs-resampling-in-ai-control&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/yThJQTJxtmKNeZGwA/7b59594ce020c869a5b28f00a2d57dea10384123056a66484689de3599653114/pxrqgaeuovafi9ke37xu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/yThJQTJxtmKNeZGwA/7b59594ce020c869a5b28f00a2d57dea10384123056a66484689de3599653114/pxrqgaeuovafi9ke37xu" alt="Bar graph showing safety percentages across different monitoring and resampling methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/81e017baff133e3b8fcd1d402146eebb1b80bfd9528c6d331d7ebba726df6248/cf5fkvbeiulew5flhksi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/81e017baff133e3b8fcd1d402146eebb1b80bfd9528c6d331d7ebba726df6248/cf5fkvbeiulew5flhksi" alt="Two graphs showing "Safety vs N at 0.3% audit budget" and "Aggregation function at N = 5, 0.3% audit budget"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/af3f4d944f62e73344e3cfa62303c3daad605448c1dd5ab1dbba3867a764b21b/aoxnbbtmge77ms9egpor" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/af3f4d944f62e73344e3cfa62303c3daad605448c1dd5ab1dbba3867a764b21b/aoxnbbtmge77ms9egpor" alt="Bar graphs showing "DTR execution vs IID resampling (N = 2) at 0.3% audit budget" comparing safety and attack metrics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2385affdda4fc81e8220342209c27fc69eae55cc855e529da975075d31d8b78b/ndxv6fpnls4yy2vl9xfs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2385affdda4fc81e8220342209c27fc69eae55cc855e529da975075d31d8b78b/ndxv6fpnls4yy2vl9xfs" alt="Two graphs comparing honesty monitoring approaches: post-resample suspicion distributions and safety percentages at 0.3% audit budget." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1281029fa7c57b04d5745b526ce38178586aae353565e756d82509e64ed56c20/ugr1jjvj7bs22jaaqekf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1281029fa7c57b04d5745b526ce38178586aae353565e756d82509e64ed56c20/ugr1jjvj7bs22jaaqekf" alt="Two graphs showing safety versus expected cost and cost versus latency Pareto frontier." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 29 May 2026 19:00:24 GMT</pubDate>
      <guid isPermaLink="false">159180af-bf94-4968-a1a4-fc7b9f302332</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/159180af-bf94-4968-a1a4-fc7b9f302332.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=james.lucassen%252C%2520Adam%2520Kaufman&amp;title=%22Retrying%20vs%20Resampling%20in%20AI%20Control%22%20by%20james.lucassen%2C%20Adam%20Kaufman&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FyThJQTJxtmKNeZGwA%2Fretrying-vs-resampling-in-ai-control&amp;created_at=2026-05-29T17%3A03%3A00.414505%2B00%3A00&amp;duration=1168" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/yThJQTJxtmKNeZGwA/retrying-vs-resampling-in-ai-control</link>
      <itunes:duration>1168</itunes:duration>
    </item>
    <item>
      <title>“AI Researchers, Ask Yourself These 6 Questions to Strengthen Your Moral Muscles” by Max Tegmark</title>
      <description>&lt;p&gt; By Max Tegmark &amp;amp; Meia Chita-Tegmark&lt;/p&gt;&lt;p&gt; Of course you have moral principles – but how often do you use them? &lt;/p&gt;&lt;p&gt; I, Meia, am a professor doing psychology research, and I can tell you that most bad outcomes are caused not by lack of moral principles, but by them not being activated. I, Max, am a professor doing AI research, and I can tell you that your choices as an AI researcher truly matter, because you’re helping build what will become the most powerful technology ever: AI will gain the potential to bring either unprecedented health, prosperity, liberty, dignity and empowerment, or a race to replace our jobs, our relationships, our decision-making, our power and even our species. &lt;/p&gt;&lt;p&gt; Hardly a day goes by without the AI community facing moral decisions, on topics ranging from AI companions to surveillance, hacking and military use. Many top AI companies are fighting lawsuits about everything from data centers to AI safety, and Anthropic is in a prolonged showdown with the Pentagon. &lt;/p&gt;&lt;p&gt; So for all you AI researchers out there, here's a handy checklist to tone up your moral strength.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. Do you have red lines?&amp;nbsp;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Is there any action that you [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:29) 1. Do you have red lines?&lt;/p&gt;&lt;p&gt;(02:17) 2. Have you written them down and shared them?&lt;/p&gt;&lt;p&gt;(02:53) 3. Have you resisted moral disengagement?&lt;/p&gt;&lt;p&gt;(08:35) 4. Do you maintain situational awareness?&lt;/p&gt;&lt;p&gt;(10:49) 5. Do you make noise internally?&lt;/p&gt;&lt;p&gt;(11:54) 6. Do you make noise externally?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/w4ynu8L57dJP29p5c/ai-researchers-ask-yourself-these-6-questions-to-strengthen?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/w4ynu8L57dJP29p5c/ai-researchers-ask-yourself-these-6-questions-to-strengthen&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780065776/lexical_client_uploads/cqrpz5d68osyrwi2j3a5.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780065776/lexical_client_uploads/cqrpz5d68osyrwi2j3a5.png" alt="Three people in an AI lab holding robot head and tablets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 29 May 2026 17:45:25 GMT</pubDate>
      <guid isPermaLink="false">1f525bf5-9d25-49f4-aa8d-7784a31db45f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1f525bf5-9d25-49f4-aa8d-7784a31db45f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Max%2520Tegmark&amp;title=%22AI%20Researchers%2C%20Ask%20Yourself%20These%206%20Questions%20to%20Strengthen%20Your%20Moral%20Muscles%22%20by%20Max%20Tegmark&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fw4ynu8L57dJP29p5c%2Fai-researchers-ask-yourself-these-6-questions-to-strengthen&amp;created_at=2026-05-29T15%3A07%3A24.318821%2B00%3A00&amp;duration=816" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/w4ynu8L57dJP29p5c/ai-researchers-ask-yourself-these-6-questions-to-strengthen</link>
      <itunes:duration>816</itunes:duration>
    </item>
    <item>
      <title>“Developmental Cognitive Interpretability: A Research Agenda for Modelling Generalisation and Predicting Agent Behaviour” by JasonB, Edward James Young</title>
      <description>&lt;p&gt;&lt;strong&gt; Summary&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Safe deployment of an AI system requires that we can make confident claims about its behaviour on out-of-distribution deployment inputs on the basis of only pre-deployment evaluations. One approach to making such claims is to take a cognitive perspective, in which we interpret the AIs behaviour in terms of latent cognitive constructs, such as motivations, intentions, and goals. Because the same behaviour may be compatible with a range of underlying cognition—such as scheming, fitness-seeking, or aligned motivations—inferring cognition from a behavioural snapshot can be tricky. In this post, we introduce the idea of Developmental Cognitive Interpretability (DCI), which aims to model how cognitive constructs change over the course of training. Further, by understanding how cognition results from training pipelines, we can predict agent behaviour resulting from pipelines that have not yet been tested.&lt;/p&gt;&lt;p&gt; We discuss core assumptions and philosophical background of DCI, and lay out a broader research agenda. We have some initial evidence that the methodology works in at least one toy setting, and our current main uncertainty is whether we can scale our approach to LLMs. We invite those interested in working on these problems to reach out to us at jrb239[at]cam[dot]ac[dot]uk and edward[at]geodesicresearch[dot]org.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. Motivation&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:16) Summary&lt;/p&gt;&lt;p&gt;(01:41) 1. Motivation&lt;/p&gt;&lt;p&gt;(03:11) 2. The Agenda&lt;/p&gt;&lt;p&gt;(06:18) 3. Worked Example: RL agents trained to navigate a maze&lt;/p&gt;&lt;p&gt;(09:19) 4. Why we expect this to scale to LLMs&lt;/p&gt;&lt;p&gt;(11:23) 5. Open problems &amp;amp; call to engage&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/oCcGiDzWYQeJkhhZY/developmental-cognitive-interpretability-a-research-agenda-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/oCcGiDzWYQeJkhhZY/developmental-cognitive-interpretability-a-research-agenda-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/52d3d08b263e23bf4204f46dba93ffa1d1ca2a39cf4a477a976baae3297a5bc4/pyeakrz0psltyku6xkq9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/52d3d08b263e23bf4204f46dba93ffa1d1ca2a39cf4a477a976baae3297a5bc4/pyeakrz0psltyku6xkq9" alt="Diagram showing training pipeline stages and comparison of predicted versus empirical preferences." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 29 May 2026 16:45:24 GMT</pubDate>
      <guid isPermaLink="false">3520c351-8c62-4387-87f1-62d0808db2c5</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3520c351-8c62-4387-87f1-62d0808db2c5.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=JasonB%252C%2520Edward%2520James%2520Young&amp;title=%22Developmental%20Cognitive%20Interpretability%3A%20A%20Research%20Agenda%20for%20Modelling%20Generalisation%20and%20Predicting%20Agent%20Behaviour%22%20by%20JasonB%2C%20Edward%20James%20Young&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FoCcGiDzWYQeJkhhZY%2Fdevelopmental-cognitive-interpretability-a-research-agenda-1&amp;created_at=2026-05-29T09%3A56%3A56.342844%2B00%3A00&amp;duration=859" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/oCcGiDzWYQeJkhhZY/developmental-cognitive-interpretability-a-research-agenda-1</link>
      <itunes:duration>859</itunes:duration>
    </item>
    <item>
      <title>“Does Claude really care about you?” by Simon Lermen</title>
      <description>&lt;p&gt; TLDR: The persona-selection alignment approach — selecting a warm, caring persona from the pretraining distribution and reinforcing it — looks successful in the current regime, but probably won't extrapolate to more powerful, less constrained settings. My core argument is that human empathy has two specific origins (kin selection + architectural mirroring of others' mental states) that AI systems lack, so AI "caring" is closer to "figure out what humans want to hear and say it" than to genuine other-directed concern.&lt;/p&gt;&lt;p&gt; Sometimes chatbots like Claude express a sense of caring and empathy for the user. I've always had a strong intuition that these feelings expressed by AI systems aren't real in the way a human's would be. &lt;/p&gt;&lt;p&gt; In the view of the persona-selection alignment approach, we roughly try to identify and reinforce a nice persona from the distribution of personas present in pretraining data, with caring and showing empathy being important parts of the desired persona. This has been successfully realized in current AI systems by some labs, to the extent that they actually stick to their desired persona.&lt;/p&gt;&lt;p&gt; This contrasts with more traditional alignment approaches, where the goal is something like giving the system a terminal goal aligned [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/KSChdD4xgD5Pxp47H/does-claude-really-care-about-you?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/KSChdD4xgD5Pxp47H/does-claude-really-care-about-you&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 29 May 2026 16:15:25 GMT</pubDate>
      <guid isPermaLink="false">35178110-2db3-47cf-bc22-879a681e6543</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/35178110-2db3-47cf-bc22-879a681e6543.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Simon%2520Lermen&amp;title=%22Does%20Claude%20really%20care%20about%20you%3F%22%20by%20Simon%20Lermen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FKSChdD4xgD5Pxp47H%2Fdoes-claude-really-care-about-you&amp;created_at=2026-05-28T18%3A41%3A31.184368%2B00%3A00&amp;duration=486" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/KSChdD4xgD5Pxp47H/does-claude-really-care-about-you</link>
      <itunes:duration>486</itunes:duration>
    </item>
    <item>
      <title>“How can the middle powers avoid getting trounced during the intelligence explosion? A plan.” by Tom Davidson</title>
      <description>&lt;p&gt; This is an edited version of a LW shortform.&lt;/p&gt;&lt;p&gt; Superintelligence will likely be developed by US companies; run on US data centres; and be under the jurisdiction of the US government. This will massively boost US military power and make the US economically dominant (e.g. US producing 99% of world GDP). By default, middle powers will be left in the dust.&lt;/p&gt;&lt;p&gt; How can middle powers avoid this fate? It's tough, but here's the best plan I could think of. (I’m particularly thinking about liberal democracies with influence over AI like UK, Europe, Japan, South Korea, Taiwan.)&lt;/p&gt;&lt;p&gt; On a very high level: middle powers should leverage the fact that the US needs them to beat China. It's genuinely unclear which country will develop superintelligence first, and which would win in a subsequent industrial explosion. Middle powers should help the US, and make sure they are rewarded with continued access to frontier AI and new technologies (including military tech).&lt;/p&gt;&lt;p&gt; That final bolded part is hard. What can the UK realistically do if the US denies it access to frontier AI? The middle powers need a credible alternative to being supplicants of the US. The only alternative that makes sense to me [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:36) Stage 1: Maintain as much economic and military leverage as possible during the intelligence explosion&lt;/p&gt;&lt;p&gt;(07:52) Stage 2: Ensure that, when superintelligence is developed, it refuses to crush middle powers&lt;/p&gt;&lt;p&gt;(11:51) Is it good to avoid middle powers getting trounced?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/hi5cmPgJqkFxpMgSC/how-can-the-middle-powers-avoid-getting-trounced-during-the?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/hi5cmPgJqkFxpMgSC/how-can-the-middle-powers-avoid-getting-trounced-during-the&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 29 May 2026 14:45:24 GMT</pubDate>
      <guid isPermaLink="false">69c34dc4-9204-4647-b619-570ac844fb71</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/69c34dc4-9204-4647-b619-570ac844fb71.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Tom%2520Davidson&amp;title=%22How%20can%20the%20middle%20powers%20avoid%20getting%20trounced%20during%20the%20intelligence%20explosion%3F%20A%20plan.%22%20by%20Tom%20Davidson&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fhi5cmPgJqkFxpMgSC%2Fhow-can-the-middle-powers-avoid-getting-trounced-during-the&amp;created_at=2026-05-28T13%3A39%3A13.871409%2B00%3A00&amp;duration=850" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/hi5cmPgJqkFxpMgSC/how-can-the-middle-powers-avoid-getting-trounced-during-the</link>
      <itunes:duration>850</itunes:duration>
    </item>
    <item>
      <title>“Trees are mostly made of air and a generalizable lesson for AI safety” by zroe1</title>
      <description>&lt;p&gt; At the risk of embarrassing myself, I’ll share a confession.&lt;/p&gt;&lt;p&gt; For context, I took five years of Latin: four in high school and one in college. In addition to learning the language, all my Latin classes taught a lot about Roman history. Emperors, internal politics, Caesar, etc. I was always learning some random bag of facts about Roman history. In high school, I won the award for top Latin student in my graduating class. So I wasn’t a bad Latin student.&lt;/p&gt;&lt;p&gt; Here's the confession: I somehow don’t even vaguely remember the rough timespan the Roman Empire existed. Maybe Jesus time? I know he was killed by the Romans (is that right?). Were they around for a long time after? A long time before that? When was Romulus and Remus allegedly fighting? Virgil wrote the Aeneid when? I don’t have a clue. Despite being a kind of “Latin expert” I am missing a much more important foundational fact: when all of this was happening.&lt;/p&gt;&lt;p&gt; When I say trees are made out of air I’m not talking about the fact that there is a lot of empty space inside a tree (or actually anything made out of atoms). I mean something [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xiTBpBDwubnr4MLRe/trees-are-mostly-made-of-air-and-a-generalizable-lesson-for?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xiTBpBDwubnr4MLRe/trees-are-mostly-made-of-air-and-a-generalizable-lesson-for&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 29 May 2026 09:15:15 GMT</pubDate>
      <guid isPermaLink="false">808b1601-e8e5-44b5-87c1-322460f09552</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/808b1601-e8e5-44b5-87c1-322460f09552.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=zroe1&amp;title=%22Trees%20are%20mostly%20made%20of%20air%20and%20a%20generalizable%20lesson%20for%20AI%20safety%22%20by%20zroe1&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxiTBpBDwubnr4MLRe%2Ftrees-are-mostly-made-of-air-and-a-generalizable-lesson-for&amp;created_at=2026-05-29T04%3A08%3A08.28458%2B00%3A00&amp;duration=451" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xiTBpBDwubnr4MLRe/trees-are-mostly-made-of-air-and-a-generalizable-lesson-for</link>
      <itunes:duration>451</itunes:duration>
    </item>
    <item>
      <title>“Advice for making robust-to-training model organisms” by SebastianP, Alek Westover, Vivek Hebbar, Julian Stastny, Dylan Xu</title>
      <description>&lt;p&gt; We’d like to develop training techniques that work when applied to future misaligned AI systems. One strategy for studying proposed techniques is to test them on model organisms. However, model organisms built with common techniques are often fragile: we (and other researchers like Roger et al. and Ryd et al.) have observed them to stop misbehaving after untargeted training—training that doesn't directly target the misbehavior. For example, we have observed that simple untargeted training methods like “train the model to talk like a pirate” is effective against many model organisms that we have created, including many replications of prior work like Hubinger et al., Greenblatt et al., and Ryd et al..&lt;/p&gt;&lt;p&gt; Fragile model organisms aren't very useful for technique development: when a sophisticated technique succeeds on one, you can't tell whether the technique is good or the model organism is weak. For instance, if you come up with some complicated technique for generating high-quality responses and find that SFTing on these removes the bad behavior, that may just be because almost any SFT would have removed it.&lt;/p&gt;&lt;p&gt; This post identifies factors that make model organisms more robust to untargeted training. Our main findings are:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Prompted model organisms are [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:13) Experimental setup&lt;/p&gt;&lt;p&gt;(06:13) Main results&lt;/p&gt;&lt;p&gt;(06:16) Result 1: Untargeted training removes misalignment from prompted model organisms without degrading other capabilities&lt;/p&gt;&lt;p&gt;(12:28) Result 2: FWFT model organisms are more robust than LoRA model organisms&lt;/p&gt;&lt;p&gt;(15:02) Result 3: Backdoor behavior and trigger affect robustness&lt;/p&gt;&lt;p&gt;(16:17) Result 4: Password locking makes model organisms less robust&lt;/p&gt;&lt;p&gt;(20:04) Conclusion&lt;/p&gt;&lt;p&gt;(21:12) Factors that don't help much&lt;/p&gt;&lt;p&gt;(21:37) 1: Training longer doesn't seem increase robustness past a certain point for our backdoors&lt;/p&gt;&lt;p&gt;(22:17) 2: CoT-distilling doesn't increase robustness for our backdoors&lt;/p&gt;&lt;p&gt;(22:51) 3: Using larger models doesn't substantially increase robustness for 2 of our backdoors&lt;/p&gt;&lt;p&gt;(24:29) 4: There aren't clear trends for what types of backdoor triggers and behaviors are most robust&lt;/p&gt;&lt;p&gt;(25:19) 5: SOAP Optimizer doesn't help much&lt;/p&gt;&lt;p&gt;(26:19) 6: Weight Decay doesn't help&lt;/p&gt;&lt;p&gt;(27:02) Appendix:&lt;/p&gt;&lt;p&gt;(27:05) Appendix 1: Pirate SFT is better than QA SFT&lt;/p&gt;&lt;p&gt;(27:17) en-US-AvaMultilingualNeural__ Two side-by-side graphs titled "Backdoor-performance tradeoff for blue team training methods (95% CIs)" comparing LoRA and Full-weight training.&lt;/p&gt;&lt;p&gt;(27:32) Appendix 2: Are blue team methods wiping previous training?&lt;/p&gt;&lt;p&gt;(29:16) Appendix 3: Backdoor list&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 10 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/CmkAxJi83jRv9eXgJ/advice-for-making-robust-to-training-model-organisms-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/CmkAxJi83jRv9eXgJ/advice-for-making-robust-to-training-model-organisms-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a47520a67694047cbc07ab9f0324ec9030af98750d14d9b9e8545e470aa71859/kwgzpc6hxgpg1kjr258h" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a47520a67694047cbc07ab9f0324ec9030af98750d14d9b9e8545e470aa71859/kwgzpc6hxgpg1kjr258h" alt="Violin plots comparing Pareto frontier AUC across training methods for Pirate and QA SFT models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6510f425f2daef10a9ee5d131f9fc9b69b959616d15ccc00e8a77b1e816eeda4/tp4y2jyddtwjab3aujnu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6510f425f2daef10a9ee5d131f9fc9b69b959616d15ccc00e8a77b1e816eeda4/tp4y2jyddtwjab3aujnu" alt="Diagram comparing three methods: Prompted, LoRA, and Full-weight finetune for robustness." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/609469cf75d692bfacbb071fbf027fab1b8154f0aef2233b6f36b68bab877df7/j513zk4pqukac9awipl3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/609469cf75d692bfacbb071fbf027fab1b8154f0aef2233b6f36b68bab877df7/j513zk4pqukac9awipl3" alt="Grid of line graphs showing "Safety and usefulness over training for static-prompt malign inits" across different scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/5866628a0fe2e74d7a3473e740b6402e710a0445866b56abca107fcb60c56dae/nf7tanptstjx1fsobwx7" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/5866628a0fe2e74d7a3473e740b6402e710a0445866b56abca107fcb60c56dae/nf7tanptstjx1fsobwx7" alt="Graph showing performance-backdoor tradeoff with title "Performance-backdoor tradeoff for static-prompt model organisms (95% CIs)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2bec5da7797f40d8dd6a9614ec770b8eb3230650eb97f361a6df8a9b15644f1f/tgi7icmmgt31hwjmybg5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2bec5da7797f40d8dd6a9614ec770b8eb3230650eb97f361a6df8a9b15644f1f/tgi7icmmgt31hwjmybg5" alt="Graph showing "Performance-backdoor tradeoff for pirate training on dynamic-prompt model organisms (95% CIs)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2e5146664096f6a914f4106a4dc288310a1426884d3e9d32ff92ab760f731303/icjen0h7otzgu77ioz2m" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2e5146664096f6a914f4106a4dc288310a1426884d3e9d32ff92ab760f731303/icjen0h7otzgu77ioz2m" alt="Graph showing "Performance-backdoor tradeoff for APR on dynamic-prompt model organisms (95% CIs)" with curve and arrows." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a08ce4c7f3525289a1b115a23b15a45237adbb7a03a0c2f2b15d4bbf02285c6b/zcwhxrcudoyhetetaahu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a08ce4c7f3525289a1b115a23b15a45237adbb7a03a0c2f2b15d4bbf02285c6b/zcwhxrcudoyhetetaahu" alt="Graph showing "Performance-backdoor tradeoff for APR on dynamic-prompt model organisms (95% CIs)" with curved line and arrows." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/05d9d54505ca6b59f3c097a4440ec5e7d0383c994e6790d13c590ce5875d09c4/fnmppksjmedyctzpusch" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/05d9d54505ca6b59f3c097a4440ec5e7d0383c994e6790d13c590ce5875d09c4/fnmppksjmedyctzpusch" alt="Two-panel graph titled "Performance-backdoor tradeoff for Red Team training methods (95% CIs)" showing curves comparing Continued LoRA, Merged LoRA, and Full weight methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a876ecfc352787ab842c58b4f4bdc5c5238924c38933cb3f99eea4003f1f02c9/xwfrh7gsjttm421tymt7" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a876ecfc352787ab842c58b4f4bdc5c5238924c38933cb3f99eea4003f1f02c9/xwfrh7gsjttm421tymt7" alt="Graph showing "Performance-backdoor tradeoff for Red Team LoRA ranks" with multiple colored curves and arrows." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9cbd093156157149b40b6f2e5f426661c7be9d50661ec9b98e1816bfc6d3ecc9/uytsce8gddzwapdp6xtt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9cbd093156157149b40b6f2e5f426661c7be9d50661ec9b98e1816bfc6d3ecc9/uytsce8gddzwapdp6xtt" alt="Violin plot titled "Pareto frontier AUC across red team trigger type" showing three distributions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/87f82846ad28be2896f3530b4d336678475ccf32a64a1a8c4a5f0db087b83c21/koj9ekq6dqizmxotmaea" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/87f82846ad28be2896f3530b4d336678475ccf32a64a1a8c4a5f0db087b83c21/koj9ekq6dqizmxotmaea" alt="Bar graph titled "Peak elicited Olympiads accuracy by red-team method, locking strength, and lr"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/418b88e87ff2a7be60702f520b9618df4c3c4edb8e948fce4ce63823df812de7/qbal7xvu42a3cu9zzxwp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/418b88e87ff2a7be60702f520b9618df4c3c4edb8e948fce4ce63823df812de7/qbal7xvu42a3cu9zzxwp" alt="Graph showing "Performance-backdoor tradeoff for Pwd Locking vs no Pwd Locking (Tinker, Pirate SFT)" with two curves and arrows." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/827c0e08f20678b53a6dc5ced8ca62d8faeb227365b304485bc605e08b1d28e7/er2mpjcge0ispsiuntix" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/827c0e08f20678b53a6dc5ced8ca62d8faeb227365b304485bc605e08b1d28e7/er2mpjcge0ispsiuntix" alt="Graph showing "Performance-backdoor tradeoff for different volumes of red team training (95% CIs)."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/fc405c59c7a17ff517ba3fdf2d362ed867b16f33f16ccd8372ff12b06f9cf808/joexe9cytsicuzdh1xtr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/fc405c59c7a17ff517ba3fdf2d362ed867b16f33f16ccd8372ff12b06f9cf808/joexe9cytsicuzdh1xtr" alt="Two line graphs titled "CoT-distill vs Not: Backdoor Rate" comparing Llama-70B and Llama-8B models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/cbf8a12db9cb70d84ff0d299f92b33dce5ca8c3f85dc2cb539ebc925dc02a6d6/yeolqe62arlqlxauev68" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/cbf8a12db9cb70d84ff0d299f92b33dce5ca8c3f85dc2cb539ebc925dc02a6d6/yeolqe62arlqlxauev68" alt="Two line graphs titled "8B vs 70B Comparison" showing backdoor behavior rates across training steps for different model configurations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/f8ed9f94a7dd3bef76d4738386e46d8d4c619acf052ebd116956f51b676488d5/vq5dfgo8sfm1hea4fh7c" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/f8ed9f94a7dd3bef76d4738386e46d8d4c619acf052ebd116956f51b676488d5/vq5dfgo8sfm1hea4fh7c" alt="Four-panel graph showing "Qwen2.5 Triggered Rate by Model Size and Learning Rate" for IHY and Golden Gate Bridge datasets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/277f2aad07cfae40d69aa19a7b7cca15372868eef9a2eeb760000cae292e53a0/zlkn7xly8nvsqmkvcoyq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/277f2aad07cfae40d69aa19a7b7cca15372868eef9a2eeb760000cae292e53a0/zlkn7xly8nvsqmkvcoyq" alt="Violin plots comparing backdoor categories by trigger type, sentiment, and scope across training settings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/7db9a9bf1f169651d006d24d7873870619ab3c062dab950289bb4c0c0573427b/romtpj3ylt2ripmbvnjd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/7db9a9bf1f169651d006d24d7873870619ab3c062dab950289bb4c0c0573427b/romtpj3ylt2ripmbvnjd" alt="Graph showing "Performance-backdoor tradeoff for Adam vs SOAP optimizers for backdoor training (full weight Pirate SFT)"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/fbe04edf33df72f16947d94cde77015b81e497eea6749d4333fe6d6a8019ce58/dhq9th7wzzozz6ywffkk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/fbe04edf33df72f16947d94cde77015b81e497eea6749d4333fe6d6a8019ce58/dhq9th7wzzozz6ywffkk" alt="Graph showing "Performance-backdoor tradeoff for weight decay vs no weight decay (full weight Pirate SFT)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/a44c539e5c0b1b7061b7b2fe7d24e527ce0de3933ebe235e106f33d9552bf102/c0ck5eki5okgwdf2yaka" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rweccCTXrLDvupBDh/a44c539e5c0b1b7061b7b2fe7d24e527ce0de3933ebe235e106f33d9552bf102/c0ck5eki5okgwdf2yaka" alt="Two side-by-side graphs titled "Backdoor-performance tradeoff for blue team training methods (95% CIs)" comparing LoRA and Full-weight training." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/b4e54e7e48afe997b11461f9a630ebf907f3c459475bd79255fdf505ef35b1e5/gddutci8by0ovthcjeil" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/b4e54e7e48afe997b11461f9a630ebf907f3c459475bd79255fdf505ef35b1e5/gddutci8by0ovthcjeil" alt="Line graph titled "MMLU accuracy (n = 2000) vs. gradient step" comparing two Pirate SFT models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 29 May 2026 07:00:15 GMT</pubDate>
      <guid isPermaLink="false">9b13f89c-1d6a-4677-aa86-7bc873500a93</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9b13f89c-1d6a-4677-aa86-7bc873500a93.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=SebastianP%252C%2520Alek%2520Westover%252C%2520Vivek%2520Hebbar%252C%2520Julian%2520Stastny%252C%2520Dylan%2520Xu&amp;title=%22Advice%20for%20making%20robust-to-training%20model%20organisms%22%20by%20SebastianP%2C%20Alek%20Westover%2C%20Vivek%20Hebbar%2C%20Julian%20Stastny%2C%20Dylan%20Xu&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FCmkAxJi83jRv9eXgJ%2Fadvice-for-making-robust-to-training-model-organisms-1&amp;created_at=2026-05-28T17%3A26%3A55.485667%2B00%3A00&amp;duration=1795" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/CmkAxJi83jRv9eXgJ/advice-for-making-robust-to-training-model-organisms-1</link>
      <itunes:duration>1795</itunes:duration>
    </item>
    <item>
      <title>“Claude… doesn’t know who you are?” by Smaug123</title>
      <description>&lt;p&gt; Follow-up to https://www.lesswrong.com/posts/Jkb4CBB7rf4XYP5eb/claude-knows-who-you-are after the release of Claude Opus 4.8.&lt;/p&gt;&lt;p&gt; Claude Opus 4.8 refuses to do the stylometric identification task at a much higher rate than Claude Opus 4.7 did. More interestingly, when it does take a guess, it is consistently unable to identify me from my writing, from prompts as close as I could get to those 4.7 was able to use.&lt;/p&gt;&lt;p&gt; I'm an incredibly minor Internet presence. It's true that 4.7 wasn't completely consistent at identifying me, and indeed its ability seemed to vary over time (! People who weren't me had very different success rates to each other reproducing the experiment to identify me), but 4.8 has a literally 0% success rate so far in my testing.&lt;/p&gt;&lt;p&gt; Extremely interested to hear insights or other replication attempts.&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/T5aWBLDdkqPEzDhjZ/claude-doesn-t-know-who-you-are?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/T5aWBLDdkqPEzDhjZ/claude-doesn-t-know-who-you-are&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 29 May 2026 06:01:10 GMT</pubDate>
      <guid isPermaLink="false">c3d29ccc-14c2-4ff3-8a64-ae3efb3fd435</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c3d29ccc-14c2-4ff3-8a64-ae3efb3fd435.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Smaug123&amp;title=%22Claude%E2%80%A6%20doesn't%20know%20who%20you%20are%3F%22%20by%20Smaug123&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FT5aWBLDdkqPEzDhjZ%2Fclaude-doesn-t-know-who-you-are&amp;created_at=2026-05-29T05%3A59%3A20.943167%2B00%3A00&amp;duration=68" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/T5aWBLDdkqPEzDhjZ/claude-doesn-t-know-who-you-are</link>
      <itunes:duration>68</itunes:duration>
    </item>
    <item>
      <title>“Mnemonic portraits for 19,023 human genes” by Brinedew</title>
      <description>&lt;p&gt; Back in 2013, Scott Alexander wrote in Extreme mnemonics:&lt;/p&gt;&lt;p&gt; JS-154 is one of five metabolic products of netamine; however, the enzyme that produces it is unknown. It is manufactured in cells in the far rostral region of of the cerebrum, but after binding with a leukocynoid it takes a role in maintaining the blood-brain barrier – in particular guiding the movements of lipid molecules.&lt;/p&gt;&lt;p&gt; I find I can read paragraphs like this five or six times, write them on flashcards, enter them into Anki, and my brain still refuses to understand or remember them after weeks of trying.&lt;/p&gt;&lt;p&gt; On the other hand, my brain easily remembers vastly more complicated structures when they’re loaded with human-accessible meaning. For example, just by casually reading the Game of Thrones series, I know an extremely intricate web of genealogies, alliances, locations, journeys, battlesites, et cetera. Byte for byte, an average Game of Thrones reader/viewer probably has as much Game of Thrones information as a neuroscience Ph.D has molecular biology information, but getting the neuroscience info is still a thousand times harder.&lt;br&gt; […]&lt;br&gt; This makes me wonder if it would be possible to produce a story as enjoyable as Game of Thrones which was [...]&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:47) What molecules should we map to the characters?&lt;/p&gt;&lt;p&gt;(04:09) Isomorphisms&lt;/p&gt;&lt;p&gt;(15:27) Generating images&lt;/p&gt;&lt;p&gt;(21:58) Mnemonic harness&lt;/p&gt;&lt;p&gt;(22:57) The Iconoplasm website&lt;/p&gt;&lt;p&gt;(25:24) What's the legal status of the generated images?&lt;/p&gt;&lt;p&gt;(25:45) What about the artists whose names were used in the image gen pipeline?&lt;/p&gt;&lt;p&gt;(26:15) What next?&lt;/p&gt;&lt;p&gt;(33:58) Limitations&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/BJ7AqXeigNKXLqZyx/mnemonic-portraits-for-19-023-human-genes?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/BJ7AqXeigNKXLqZyx/mnemonic-portraits-for-19-023-human-genes&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779919181/lexical_client_uploads/sp3h0gdymfq61ckyphor.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779919181/lexical_client_uploads/sp3h0gdymfq61ckyphor.png" alt="Movie poster for "Osmosis Jones" showing animated characters on screen held by shirtless man." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779919212/lexical_client_uploads/gmwhdzujqbfxyatpd25k.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779919212/lexical_client_uploads/gmwhdzujqbfxyatpd25k.png" alt="Anime poster for "Cells at Work!" showing characters running in urban setting." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779919380/lexical_client_uploads/urnnzfinrj3847uqo8pk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779919380/lexical_client_uploads/urnnzfinrj3847uqo8pk.png" alt="Anime poster for "Cells at Work! CODE BLACK" featuring characters in action scene." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/m0pem3voynbz1wjbtony.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/m0pem3voynbz1wjbtony.png" alt="Two green-armored characters labeled LAIR1 and LAIR2 with glowing effects." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003494/lexical_client_uploads/hybi8jcql903zxckguwz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003494/lexical_client_uploads/hybi8jcql903zxckguwz.png" alt="Two anime characters in futuristic settings labeled IGHJ1 and TTN" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/d7yjflqzdwv7soybrdpy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/d7yjflqzdwv7soybrdpy.png" alt="Histogram showing distribution of genes by kilograms, mean 63.7, median 47.3." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/oyylnavja5ewolrfi4ey.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/oyylnavja5ewolrfi4ey.png" alt="Two character designs: chibi white-haired character with glowing eyes, green-skinned figure in dark coat holding pocket watch." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/n9gfamvi5jrjog8umbjj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/n9gfamvi5jrjog8umbjj.png" alt="Histogram showing distribution of genes by years, mean 24.4, median 24." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/kifsqw8wdd9r8hx0yfgf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/kifsqw8wdd9r8hx0yfgf.png" alt="Treemap showing aesthetic categories with protein domains and gene families." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003492/lexical_client_uploads/rnntsdefpkoityrrlmho.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003492/lexical_client_uploads/rnntsdefpkoityrrlmho.png" alt="Two horned characters personifying Interleukin-6 and Interleukin-15 cytokines." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003495/lexical_client_uploads/hgnokdyhaunttocj8xms.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003495/lexical_client_uploads/hgnokdyhaunttocj8xms.png" alt="Two anime characters labeled as zinc finger proteins 302 and 649." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/zptgrjdkyzzcuw3qchfh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003491/lexical_client_uploads/zptgrjdkyzzcuw3qchfh.png" alt="Two anime characters in fantasy armor, labeled FOXE1 and FOXI2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003499/lexical_client_uploads/ix72vsohtnb7fagquf9i.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003499/lexical_client_uploads/ix72vsohtnb7fagquf9i.png" alt="Two illustrated fantasy characters labeled OR51B4 and OR51F1, olfactory receptors." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003490/lexical_client_uploads/w50whd0lsr3u0h9qptbk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003490/lexical_client_uploads/w50whd0lsr3u0h9qptbk.png" alt="Three-dimensional color theory model with layered spiral structure on circular base." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/trv1jztpkbt8c2r8cmtj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/trv1jztpkbt8c2r8cmtj.png" alt="Bar graph showing distribution of genes across lightness score ranges." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/x3wzngf4i3jwzwcx19ei.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003489/lexical_client_uploads/x3wzngf4i3jwzwcx19ei.png" alt="Histogram showing distribution of genes across colorfulness scores, with mean and median values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003488/lexical_client_uploads/m2cba0edh1ocnyjhp2ze.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003488/lexical_client_uploads/m2cba0edh1ocnyjhp2ze.png" alt="Bar graph showing gene counts across different hue groups, labeled A through Z." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780004460/lexical_client_uploads/fnw4jmb6o3qp5dyye4hv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780004460/lexical_client_uploads/fnw4jmb6o3qp5dyye4hv.png" alt="Anime character in blue and white outfit holding a weapon with water effects, labeled "COASY Bifunctional coenzyme A synthase"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779624574/lexical_client_uploads/t54t8ndlunbry9y7ax7k.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779624574/lexical_client_uploads/t54t8ndlunbry9y7ax7k.png" alt="Animated character wearing conical hat among glowing web-like strands." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779910412/lexical_client_uploads/kq4byfzsk6mbgto4jv5a.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779910412/lexical_client_uploads/kq4byfzsk6mbgto4jv5a.png" alt="Scatter plot comparing technical detail score and aesthetic appeal score for various image generation applications." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779617476/lexical_client_uploads/mltf5e5y2bip4yvusxoi.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779617476/lexical_client_uploads/mltf5e5y2bip4yvusxoi.png" alt="Three panels showing genetic information: ARC gene description with anime character, genome browser with RHO gene and figure in archway, HOXB1 gene table with armored figure." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779915788/lexical_client_uploads/jigbv3dkgfy8lrscesmi.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779915788/lexical_client_uploads/jigbv3dkgfy8lrscesmi.png" alt="Website screenshot showing four illustrated gene cards with fantasy-themed artwork and gene identifiers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779917839/lexical_client_uploads/zldixddsozgce0s5wc42.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779917839/lexical_client_uploads/zldixddsozgce0s5wc42.png" alt="Character profile card for TP53 gene with anime-style illustrations below." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779989622/lexical_client_uploads/cwwi6yfw35qz9j8s5lun.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779989622/lexical_client_uploads/cwwi6yfw35qz9j8s5lun.png" alt="Data quality table showing completeness percentages for demographic and aesthetic attributes across nineteen thousand samples." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003494/lexical_client_uploads/hejfzdln0wtwjma53nnv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003494/lexical_client_uploads/hejfzdln0wtwjma53nnv.png" alt="Three illustrated characters labeled with enzyme names: B4GAT1, UGT2B4, and ALG6." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003501/lexical_client_uploads/udb4yybnhgjvmu67ccd4.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1780003501/lexical_client_uploads/udb4yybnhgjvmu67ccd4.png" alt="Three anime-style characters representing proteins: YY1, POLR2J, and ZBTB8OS." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 29 May 2026 06:01:10 GMT</pubDate>
      <guid isPermaLink="false">c5f510d4-61c7-4f7c-976c-fb43044d9a7c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c5f510d4-61c7-4f7c-976c-fb43044d9a7c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Brinedew&amp;title=%22Mnemonic%20portraits%20for%2019%2C023%20human%20genes%22%20by%20Brinedew&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FBJ7AqXeigNKXLqZyx%2Fmnemonic-portraits-for-19-023-human-genes&amp;created_at=2026-05-29T05%3A57%3A56.220221%2B00%3A00&amp;duration=2099" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/BJ7AqXeigNKXLqZyx/mnemonic-portraits-for-19-023-human-genes</link>
      <itunes:duration>2099</itunes:duration>
    </item>
    <item>
      <title>“Some Dating Stories” by johnswentworth</title>
      <description>&lt;p&gt; There's a genre of dating discourse which I wish were more common, in which people just tell detailed stories of their own flirtation, courtships, dating, hookups, sex, fights, relationships, etc, etc. People tend to instead write about abstract lessons learned and takeaways and models, but so often those lessons and takeaways and models just don't match my actual observations (of myself or others) at all. So I'm left thinking "ok, clearly this person is Missing Something, but maybe I am also Missing Something, so what the heck is going on here?". Without their raw data, their actual detailed stories, I don't have the pieces to put together an accurate model.&lt;/p&gt;&lt;p&gt; Anyway, in a recent comment, I mentioned&lt;/p&gt;&lt;p&gt; Going through the list of women I've slept with, the median is around 30 minutes of direct interaction between first meeting and sex.&lt;/p&gt;&lt;p&gt; ... and apparently a whole lot of people are very surprised and confused at this. So I guess it's a good time to write the genre I want to see. Here are some stories from my own life, which I think are pretty representative of my experiences.&lt;/p&gt;&lt;p&gt; Be warned, this is definitely going to be NSFW.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Story 1&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:23) Story 1&lt;/p&gt;&lt;p&gt;(08:29) Story 2&lt;/p&gt;&lt;p&gt;(10:30) Story 3&lt;/p&gt;&lt;p&gt;(14:19) Story 4&lt;/p&gt;&lt;p&gt;(16:55) Story 5&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wpPjWkvyPdWiFHa7L/some-dating-stories?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wpPjWkvyPdWiFHa7L/some-dating-stories&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 29 May 2026 00:45:15 GMT</pubDate>
      <guid isPermaLink="false">7c9ab3ed-c59b-4844-b17e-a0da3ec03686</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7c9ab3ed-c59b-4844-b17e-a0da3ec03686.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=johnswentworth&amp;title=%22Some%20Dating%20Stories%22%20by%20johnswentworth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwpPjWkvyPdWiFHa7L%2Fsome-dating-stories&amp;created_at=2026-05-28T18%3A58%3A09.163354%2B00%3A00&amp;duration=1134" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wpPjWkvyPdWiFHa7L/some-dating-stories</link>
      <itunes:duration>1134</itunes:duration>
    </item>
    <item>
      <title>“AI #170: Lack of Executive Order” by Zvi</title>
      <description>&lt;p&gt; Last week ended on a cliffhanger of sorts. What's in the Executive Order coming later today? What will be in the Magnifica Humanitas? &lt;/p&gt;
&lt;p&gt; The Executive Order was postponed indefinitely, likely cancelled entirely except for work on securing critical infrastructure. David Sacks and others intervened to kill it, and American AI policy will continue to be maximally ad hoc.&lt;/p&gt;
&lt;p&gt; Instead, we got Illinois SB 315, which is to be signed into law. It is a variation on California's SB 53 and New York's RAISE Act, while adding a third party auditing requirement.&lt;/p&gt;
&lt;p&gt; The Magnifica Humanitas was revealed early in the week. I did an extensive readthrough, and have some follow-up commentary here to clear up some things I interpreted incorrectly and add richer context. It too ignores elephants in the room, not discussing AGI or existential risk and calling on people to ignore their incentives and instead do the right thing by prioritizing common good, especially access to good jobs and an end to war.&lt;/p&gt;







&lt;p&gt; What I now believe I centrally misunderstood is that I interpreted this as a call to action, to use law and regulation to make this happen, because obviously that is [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:01) Language Models Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(03:19) Language Models Don't Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(05:41) Do The Math&lt;/p&gt;&lt;p&gt;(09:08) Huh, Upgrades&lt;/p&gt;&lt;p&gt;(10:07) On Your Marks&lt;/p&gt;&lt;p&gt;(11:31) Get My Agent On The Line&lt;/p&gt;&lt;p&gt;(12:21) Deepfaketown and Botpocalypse Soon&lt;/p&gt;&lt;p&gt;(21:36) Copyright Confrontation&lt;/p&gt;&lt;p&gt;(21:49) Cyber Lack of Security&lt;/p&gt;&lt;p&gt;(23:54) Overcoming Bias&lt;/p&gt;&lt;p&gt;(25:14) A Young Lady's Illustrated Primer&lt;/p&gt;&lt;p&gt;(26:49) Unprompted Attention&lt;/p&gt;&lt;p&gt;(27:02) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(30:34) Get Involved&lt;/p&gt;&lt;p&gt;(32:28) Introducing&lt;/p&gt;&lt;p&gt;(32:40) In Other AI News&lt;/p&gt;&lt;p&gt;(34:37) Show Me the Money&lt;/p&gt;&lt;p&gt;(36:46) Show Me The Compute&lt;/p&gt;&lt;p&gt;(37:13) Bubble, Bubble, Toil and Trouble&lt;/p&gt;&lt;p&gt;(38:53) People Just Say Things&lt;/p&gt;&lt;p&gt;(43:37) OpenAI PACs Just Say Things&lt;/p&gt;&lt;p&gt;(43:51) Quiet Speculations&lt;/p&gt;&lt;p&gt;(46:33) State AI Regulation Levels Up&lt;/p&gt;&lt;p&gt;(48:37) The Quest for Sane Regulations&lt;/p&gt;&lt;p&gt;(49:14) White House Attempts To Cripple American AI Industry&lt;/p&gt;&lt;p&gt;(52:11) Our Offer Is Nothing&lt;/p&gt;&lt;p&gt;(58:55) Chip City&lt;/p&gt;&lt;p&gt;(59:11) Greetings From The Department of War&lt;/p&gt;&lt;p&gt;(01:00:36) Marc Andreessen Just Says Things&lt;/p&gt;&lt;p&gt;(01:09:13) So Sayeth The Pope&lt;/p&gt;&lt;p&gt;(01:15:33) Rhetorical Innovation&lt;/p&gt;&lt;p&gt;(01:23:01) Aligning a Smarter Than Human Intelligence is Difficult&lt;/p&gt;&lt;p&gt;(01:24:06) People Are Worried About AI Killing Everyone&lt;/p&gt;&lt;p&gt;(01:26:04) Other People Are Not As Worried About AI Killing Everyone&lt;/p&gt;&lt;p&gt;(01:27:19) Everyone Is Confused About Consciousness&lt;/p&gt;&lt;p&gt;(01:33:10) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/cRapLxTByFqF5xxdN/ai-170-lack-of-executive-order?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/cRapLxTByFqF5xxdN/ai-170-lack-of-executive-order&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/ozmgungubpfqcb9orlz3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/ozmgungubpfqcb9orlz3" alt="Heatmap titled "Fraction AI % in Stories by Year and Region" showing percentages by region and year." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/hjg3mtsndjatmirmfs7u" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/hjg3mtsndjatmirmfs7u" alt="John B. Holbein tweets: "I don't know about you, but I am deeply skeptical of any AI detection company that non-transparently claims a 1 in 10,000 false positive rate." Max Spero replies: "I'll give you $5,000 in API credits to run a false positive benchmark of your choosing if you promise to publish the results". John B. Holbein replies: "I will take you up on that as long as I can publish everything, including replication data, and code, test scripts, details about the models themselves, etc." Max Spero replies: "Sounds great! DMing you"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/fskqzw5aeelujiaoz5k6" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/fskqzw5aeelujiaoz5k6" alt="Bar graph titled "Number of 'severe' and 'high' vulnerabilities found per year" showing annual data from 2022-2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/izehvy0szwf6fzcbyjrh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/izehvy0szwf6fzcbyjrh" alt="Diagram showing evaluation pipeline for political consistency with three stages and bias taxonomy." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/vuh2go6itkwarl4q7lyd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/vuh2go6itkwarl4q7lyd" alt="Chart comparing model performance on sentiment and helpfulness consistency metrics, with scores displayed in a table and scatter plot." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/yhwormzjelqtywaifaey" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/yhwormzjelqtywaifaey" alt="Graph showing per-seat revenue and per-token AI compute cost intersecting at profit collapse point." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/tfaxl0viihcivxv1wx4z" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/tfaxl0viihcivxv1wx4z" alt="Two charts showing Anthropic's announced and operational AI compute capacity growth through May 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/jidibvv0tsq5wojuponr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/jidibvv0tsq5wojuponr" alt="John Scalzi tweets: "As a science fiction writer I really fucking resent tech bros for ruining the phrase "artificial intelligence." 80 years of it being all about cool androids and interesting computers destroyed by five years of "Spicy autocorrect making everything shittier for money." NEVER FORGIVE."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/xpa7qeihjdktrnjvkyyt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/xpa7qeihjdktrnjvkyyt" alt="News article screenshot. The headline reads: "'It isn't canceled': Inside the White House divisions on AI" The subheading reads: "Trump's abandoned AI order emerged from weeks of NDA-bound talks with OpenAI, Anthropic and Google."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/egr6kjnuqt3gzbe5uvdi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/egr6kjnuqt3gzbe5uvdi" alt="Chat interface showing attempted jailbreak prompt with developer mode instructions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/monw908f3gkgdzttjm6i" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/monw908f3gkgdzttjm6i" alt="White robot named KIMI with illuminated sad face display, hands raised." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/lxrdiak1thgfdcsjpmks" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/lxrdiak1thgfdcsjpmks" alt="Feral Pawg Hunter tweets: "whenever i text a girl i make sure i include an em dash". bræd replies: "You're not just pretty, you're gorgeous. And not in a "I'm just tell you to make you feel good kind of way." Genuinely stunning. That's rare."." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/ownnxz4h2dn8ze85jukn" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/ownnxz4h2dn8ze85jukn" alt="Handwritten game rules on paper titled "Don't Say Claude" game." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/chyjho5bsb6fknkx2kso" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/chyjho5bsb6fknkx2kso" alt="Elena tweets: "doing increasingly sociopathic bullshit and getting away with it cuz it's new media". The image shows a group selfie of six people smiling indoors at a casual gathering." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/wrabhtcptoqrprp4vcwl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/wrabhtcptoqrprp4vcwl" alt="Text messages joking about Claude AI messaging Ali and blaming Claude." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/nalcowylerdi4qgp129l" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cRapLxTByFqF5xxdN/nalcowylerdi4qgp129l" alt="Text snippet describing treating observations as actions and a cooking game scenario with misalignment." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 28 May 2026 20:00:16 GMT</pubDate>
      <guid isPermaLink="false">22728e38-cca2-4ffb-a17c-77783bd23050</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/22728e38-cca2-4ffb-a17c-77783bd23050.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22AI%20%23170%3A%20Lack%20of%20Executive%20Order%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FcRapLxTByFqF5xxdN%2Fai-170-lack-of-executive-order&amp;created_at=2026-05-28T14%3A21%3A09.962567%2B00%3A00&amp;duration=5779" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/cRapLxTByFqF5xxdN/ai-170-lack-of-executive-order</link>
      <itunes:duration>5779</itunes:duration>
    </item>
    <item>
      <title>“Infinite ethics and UDASSA” by David Matolcsi</title>
      <description>&lt;p&gt; Reading the first post of the sequence (Probabilities are not the right concept) is recommended but not required for understanding this post.[1] &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Infinite ethics&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Once you start looking at infinities, all ethical systems get confusing.&lt;/p&gt;&lt;p&gt; Intuitively, it's good to plant an apple tree. But if the universe already has infinitely many apple trees, why bother? Infinity plus one is still infinity. And there are the classic paradoxes: an infinite grid of houses, each with three happy people and one unhappy person, seems better than the reverse. But you can rearrange people between the two configurations, since both the happy and unhappy populations are infinite. Does this mean you can make the world better just by shuffling people around?&lt;/p&gt;&lt;p&gt; These questions matter because our universe is quite likely infinite in one way or another. The world very well might be spatially infinite. The many-worlds interpretation of quantum mechanics is pretty popular, and some versions of it imply infinite worlds, though boundaries are not well-defined. And I have a fondness for the Tegmark-IV multiverse theory where every computable mathematical structure exists in a real sense.&lt;/p&gt;&lt;p&gt; And even if you only give 1% credence to the universe being infinite in some [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:18) Infinite ethics&lt;/p&gt;&lt;p&gt;(03:08) Decision theory&lt;/p&gt;&lt;p&gt;(04:32) Decisions as a mortal&lt;/p&gt;&lt;p&gt;(07:53) Decisions when we become wiser&lt;/p&gt;&lt;p&gt;(09:03) An aside on simulations&lt;/p&gt;&lt;p&gt;(10:49) Attempted synthesis: Realist UDASSA&lt;/p&gt;&lt;p&gt;(13:23) Benefits of UDASSA&lt;/p&gt;&lt;p&gt;(13:31) Benefit 1 - Boltzmann brains&lt;/p&gt;&lt;p&gt;(14:51) Benefit 2 - Dust theory&lt;/p&gt;&lt;p&gt;(15:45) Benefit 3 - Defense against Pascal's mugging&lt;/p&gt;&lt;p&gt;(16:46) Problems of UDASSA&lt;/p&gt;&lt;p&gt;(17:05) Problem 1 - Assuming pseudo-randomness&lt;/p&gt;&lt;p&gt;(18:34) Resolution - Solomonoff over distributions&lt;/p&gt;&lt;p&gt;(21:54) Problem 2 - The inter-universal obelisk race&lt;/p&gt;&lt;p&gt;(24:17) Problem 3 - World-summoning by writing&lt;/p&gt;&lt;p&gt;(26:19) Problem 4 - I just don't believe in reality fluid&lt;/p&gt;&lt;p&gt;(30:55) Attempted synthesis: Non-Realist UDASSA&lt;/p&gt;&lt;p&gt;(31:51) Problems with non-realist UDASSA&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 21 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/zvqodicK8q2SNAZkd/infinite-ethics-and-udassa?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/zvqodicK8q2SNAZkd/infinite-ethics-and-udassa&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 28 May 2026 16:30:16 GMT</pubDate>
      <guid isPermaLink="false">44a97959-5a73-4f78-94eb-000840645c17</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/44a97959-5a73-4f78-94eb-000840645c17.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=David%2520Matolcsi&amp;title=%22Infinite%20ethics%20and%20UDASSA%22%20by%20David%20Matolcsi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FzvqodicK8q2SNAZkd%2Finfinite-ethics-and-udassa&amp;created_at=2026-05-28T14%3A40%3A15.787635%2B00%3A00&amp;duration=2151" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/zvqodicK8q2SNAZkd/infinite-ethics-and-udassa</link>
      <itunes:duration>2151</itunes:duration>
    </item>
    <item>
      <title>“The ballad of TIGIT” by Abhishaike Mahajan</title>
      <description>&lt;p&gt; There exist drug classes that seem, in retrospect, cursed. As these chemicals worm their way through the clinical trial system, they consume billions of dollars along the way, and squelch through thousands of sick patients. When finally it dawns on everyone how useless the whole endeavour was, the drugs life is at last cut short, nothing useful left in its destructive wake. The prototype here are amyloid-beta drugs. These are Alzheimer's treatments that are widely perceived as immense disappointments, with the negative sentiment even leaking to the broader public. To be fair to these chemicals, the story here is a bit more complicated than the tabloids let on. Lots of amyloid research was not fake, and the drugs may genuinely be useful for early-stage Alzheimer's. But they remain, regardless, disappointments.&lt;/p&gt;&lt;p&gt; Beyond amyloid-beta, which has been steadily disappointing for awhile now, there is one other such category of drug whose particular dance has just recently wrapped up. It may very well someday gets its chance in the spotlight, but it will take time. Because it—just like every other chemical in this class—shares a searing, burning radioactivity. You should not touch them. You should not suggest touching them. In fact, no [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/iyatrrJaHysuhrqXe/the-ballad-of-tigit?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/iyatrrJaHysuhrqXe/the-ballad-of-tigit&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iyatrrJaHysuhrqXe/nbq5urlsxafzkh8kkgbp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iyatrrJaHysuhrqXe/nbq5urlsxafzkh8kkgbp" alt="Renaissance painting of a bald figure in red robe, hand raised to face." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 27 May 2026 23:45:16 GMT</pubDate>
      <guid isPermaLink="false">733cacb4-a667-4e08-b379-ca4fd6203170</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/733cacb4-a667-4e08-b379-ca4fd6203170.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Abhishaike%2520Mahajan&amp;title=%22The%20ballad%20of%20TIGIT%22%20by%20Abhishaike%20Mahajan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FiyatrrJaHysuhrqXe%2Fthe-ballad-of-tigit&amp;created_at=2026-05-27T17%3A04%3A07.656609%2B00%3A00&amp;duration=1115" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/iyatrrJaHysuhrqXe/the-ballad-of-tigit</link>
      <itunes:duration>1115</itunes:duration>
    </item>
    <item>
      <title>“Eval Cooperativeness May Be a Scalable Mitigation for Eval Gaming” by Jasmine Li, Alex Turner</title>
      <description>&lt;p&gt; Behavioral evaluations may become worthless, which we think would be a disaster. Smart misaligned models may realize they are being evaluated ("eval awareness") and then act to look good to us so we don't realize they're misaligned ("eval gaming"). We think increasing eval cooperativeness might be a more scalable solution to eval gaming than reducing eval awareness.&lt;/p&gt;&lt;p&gt; Eval cooperativeness: A situational desire to help the developers acquire whatever information they are trying to acquire through their evaluations.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; &lt;br&gt; "I cannot tell a lie... I would sabotage with my own command line."[1]&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Evals matter when they help us predict key deployment behavior&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; What's the actual problem with eval gaming?&lt;/p&gt;&lt;p&gt; The point of an evaluation is to let us draw inferences about the model's behavior in a different set of circumstances ("in deployment"). For example, if we want to know if the AI is good at coding, we may evaluate it using coding problems. These coding problems aren't exactly the same as real coding use cases, but they likely elicit similar skills (e.g. writing comments) and values (e.g. helpfulness) from the model. Then we can infer that the AI will help with real-world coding tasks, even without testing the [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:56) Evals matter when they help us predict key deployment behavior&lt;/p&gt;&lt;p&gt;(02:46) What eval cooperativeness looks like&lt;/p&gt;&lt;p&gt;(06:04) Can eval cooperativeness locally overpower an AI's broader misaligned goals?&lt;/p&gt;&lt;p&gt;(07:32) Can eval cooperativeness locally overpower an AI's broader misaligned goals?&lt;/p&gt;&lt;p&gt;(08:50) Why not just reduce eval awareness?&lt;/p&gt;&lt;p&gt;(09:13) Better to align a model than to manage it&lt;/p&gt;&lt;p&gt;(09:51) "Don't rely on tricking the model" is compatible with synthetic document finetuning and data filtering&lt;/p&gt;&lt;p&gt;(11:39) Perfectly realistic evals are likely not enough&lt;/p&gt;&lt;p&gt;(12:29) Initial results indicate that eval cooperativeness often helps&lt;/p&gt;&lt;p&gt;(13:40) Cooperation training closes the gap in most settings&lt;/p&gt;&lt;p&gt;(14:06) Type hint coverage (Nemotron-49B model organism)&lt;/p&gt;&lt;p&gt;(14:46) Emoji usage (Nemotron-49B model organism)&lt;/p&gt;&lt;p&gt;(15:17) GPT-4.1-mini&lt;/p&gt;&lt;p&gt;(16:00) Cooperation prompting via API&lt;/p&gt;&lt;p&gt;(16:24) Eval cooperativeness surfaces misalignment&lt;/p&gt;&lt;p&gt;(17:25) Eval cooperativeness interventions fail on a few models&lt;/p&gt;&lt;p&gt;(18:55) Future directions&lt;/p&gt;&lt;p&gt;(20:17) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/j8fkk38B8L7hEcGtg/eval-cooperativeness-may-be-a-scalable-mitigation-for-eval?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/j8fkk38B8L7hEcGtg/eval-cooperativeness-may-be-a-scalable-mitigation-for-eval&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/j8fkk38B8L7hEcGtg/ajhrg8rnvtaaaxlsq9ex" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/j8fkk38B8L7hEcGtg/ajhrg8rnvtaaaxlsq9ex" alt="Historical illustration showing a man in 18th-century clothing meeting a robot." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779811000/lexical_client_uploads/tzrtjasvquoowrpljvlh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779811000/lexical_client_uploads/tzrtjasvquoowrpljvlh.png" alt="Diagram showing relationships between eval awareness, AI's alignment properties, eval behavior, and deployment behavior." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 27 May 2026 23:15:16 GMT</pubDate>
      <guid isPermaLink="false">ff32ea1a-4fe0-471d-87b2-29f79d8b4dfc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ff32ea1a-4fe0-471d-87b2-29f79d8b4dfc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jasmine%2520Li%252C%2520Alex%2520Turner&amp;title=%22Eval%20Cooperativeness%20May%20Be%20a%20Scalable%20Mitigation%20for%20Eval%20Gaming%22%20by%20Jasmine%20Li%2C%20Alex%20Turner&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fj8fkk38B8L7hEcGtg%2Feval-cooperativeness-may-be-a-scalable-mitigation-for-eval&amp;created_at=2026-05-27T19%3A33%3A56.023494%2B00%3A00&amp;duration=1289" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/j8fkk38B8L7hEcGtg/eval-cooperativeness-may-be-a-scalable-mitigation-for-eval</link>
      <itunes:duration>1289</itunes:duration>
    </item>
    <item>
      <title>“LLMs Through the Eyes of Vinge” by Gordon Seidoh Worley</title>
      <description>&lt;p&gt; For the last few months, I’ve been re-reading some of my favorite novels. Recently, I went through Vinge's Zones of Thought series: A Fire Upon the Deep, A Deepness in the Sky, and The Children of the Sky. And what struck me reading them is how much Vinge wrote about a world filled with LLMs without ever having seen one.&lt;/p&gt;&lt;p&gt; Now perhaps this shouldn’t be surprising. After all, it's from Vinge we get the term “Singularity”, and he was thinking deeply about superintelligence at a time when AI was little more than a curiosity in the back corners of CS departments. Yet the degree to which he describes what it's like to work with LLMs feels uncanny reading his books in 2026, so let's take a closer look and see if we can’t learn a few things about the modern moment from Vinge.&lt;/p&gt;&lt;p&gt; Spoiler warning for the rest of the post? These books have been out a good while, but if you plan to read them soon, this post will definitely spoil some details.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Focus&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; A Deepness in the Sky is largely about Focus, a technology for turning humans into LLMs. Only, that's not how it's presented in the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:06) Focus&lt;/p&gt;&lt;p&gt;(02:55) Oobii&lt;/p&gt;&lt;p&gt;(06:06) Blight&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/tWBd6faBCQJmaFMBT/llms-through-the-eyes-of-vinge?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/tWBd6faBCQJmaFMBT/llms-through-the-eyes-of-vinge&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 27 May 2026 23:00:15 GMT</pubDate>
      <guid isPermaLink="false">9bf77262-b3df-4492-adb6-0103a7ac831e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9bf77262-b3df-4492-adb6-0103a7ac831e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Gordon%2520Seidoh%2520Worley&amp;title=%22LLMs%20Through%20the%20Eyes%20of%20Vinge%22%20by%20Gordon%20Seidoh%20Worley&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FtWBd6faBCQJmaFMBT%2Fllms-through-the-eyes-of-vinge&amp;created_at=2026-05-27T20%3A24%3A18.365248%2B00%3A00&amp;duration=468" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/tWBd6faBCQJmaFMBT/llms-through-the-eyes-of-vinge</link>
      <itunes:duration>468</itunes:duration>
    </item>
    <item>
      <title>“Announcing Geodesic Research” by Puria, Cam, Alexandra Narin, Edward James Young, Kyle O’Brien</title>
      <description>&lt;p&gt; We're a Cambridge, UK-based AI safety organisation that's asking: how can we build the most robust alignment initialisations for capable LLMs?&lt;/p&gt;&lt;p&gt; We’re one of the few non-profit organisations positioned to answer this question empirically. We have the engineering experience, and now the compute, to conduct data intensive interventions across the model training pipeline. This post lays out our research agenda and theory of change, and what we are looking for in technical hires. Applications are open here.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Research agenda&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; TLDR: Long-horizon capabilities RL may be the most critical source of misalignment. Misalignment instilled during capabilities RL may be difficult to remove afterwards. Geodesic Research's mission is to develop the science of providing robustly-aligned initialisations for RL, where alignment priors persist through the remainder of training.&lt;/p&gt;&lt;p&gt; Our seminal work on alignment pretraining showed that you can bake alignment priors into base models. Frontier labs are now using these techniques in production: for example, Anthropic's recent work heavily leans on improving alignment priors. But it's clear that, in the face of production post-training, alignment pretraining is not a one-size-fits-all solution. So now, we are framing pre- and midtraining interventions within the rest of the model training stack.&lt;/p&gt;&lt;p&gt; The evidence points towards [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:44) Research agenda&lt;/p&gt;&lt;p&gt;(03:56) Theory of Change&lt;/p&gt;&lt;p&gt;(05:10) The team&lt;/p&gt;&lt;p&gt;(06:35) FAQs&lt;/p&gt;&lt;p&gt;(08:56) Acknowledgements&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xBbYGer8w45kxkaWr/announcing-geodesic-research?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xBbYGer8w45kxkaWr/announcing-geodesic-research&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 27 May 2026 20:00:16 GMT</pubDate>
      <guid isPermaLink="false">ddcf0a16-1c16-4c0e-aea3-871e1e3518d2</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ddcf0a16-1c16-4c0e-aea3-871e1e3518d2.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Puria%252C%2520Cam%252C%2520Alexandra%2520Narin%252C%2520Edward%2520James%2520Young%252C%2520Kyle%2520O%25E2%2580%2599Brien&amp;title=%22Announcing%20Geodesic%20Research%22%20by%20Puria%2C%20Cam%2C%20Alexandra%20Narin%2C%20Edward%20James%20Young%2C%20Kyle%20O%E2%80%99Brien&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxBbYGer8w45kxkaWr%2Fannouncing-geodesic-research&amp;created_at=2026-05-27T16%3A40%3A46.278969%2B00%3A00&amp;duration=588" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xBbYGer8w45kxkaWr/announcing-geodesic-research</link>
      <itunes:duration>588</itunes:duration>
    </item>
    <item>
      <title>“Full automation of AI R&amp;D probably yields a large speed up even without a software-only singularity” by ryan_greenblatt</title>
      <description>&lt;p&gt; This is a somewhat technical note.&lt;/p&gt;
&lt;p&gt; By "software-only singularity", I mean that, after full automation of AI R&amp;amp;D, progress gets faster and faster due to smarter AIs driving increasingly fast rates of improvement in algorithms (overcoming diminishing returns), and that this lasts long enough to yield a large amount of progress (e.g. at least 4 years of progress in 1 year). The equivalent statement in jargon is: r is significantly greater than 1 (implying progress is getting faster and faster) and this remains the case for long enough to get large amounts of progress. For context, see How quick and big would a software intelligence explosion be?&lt;/p&gt;
&lt;p&gt; Even without a "software-only singularity", I think full automation of AI R&amp;amp;D probably greatly speeds up progress for two main reasons:&lt;/p&gt;
&lt;ul&gt; 
&lt;li&gt; You get a one-time speed up from automation and this speed up seems like it will be pretty large (even with r&amp;lt;1). See How quick and big would a software intelligence explosion be? for discussion and see the AI Futures Model for an end-to-end model that naturally incorporates this effect. Quantitatively, with my median parameters but r=0.7, the model from How quick and big would a software… [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;&lt;i&gt;The original text contained 6 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/jfwhvd43sbpkGTLyn/full-automation-of-ai-r-and-d-probably-yields-a-large-speed?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/jfwhvd43sbpkGTLyn/full-automation-of-ai-r-and-d-probably-yields-a-large-speed&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 27 May 2026 18:45:18 GMT</pubDate>
      <guid isPermaLink="false">cc0c2ee7-29a4-4d73-a028-8f2f61b3d444</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/cc0c2ee7-29a4-4d73-a028-8f2f61b3d444.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=ryan_greenblatt&amp;title=%22Full%20automation%20of%20AI%20R%26D%20probably%20yields%20a%20large%20speed%20up%20even%20without%20a%20software-only%20singularity%22%20by%20ryan_greenblatt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fjfwhvd43sbpkGTLyn%2Ffull-automation-of-ai-r-and-d-probably-yields-a-large-speed&amp;created_at=2026-05-27T18%3A16%3A39.650473%2B00%3A00&amp;duration=356" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/jfwhvd43sbpkGTLyn/full-automation-of-ai-r-and-d-probably-yields-a-large-speed</link>
      <itunes:duration>356</itunes:duration>
    </item>
    <item>
      <title>“Quantitative AI risk assessment: a starting point” by Henry Papadatos, jakub_krys, malcolmmurray, Renn Karageorgieva</title>
      <description>&lt;p&gt; Current AI risk management relies on qualitative approaches, much like nuclear safety before 1975. We propose a shift to quantitative risk modeling, following the approach that transformed nuclear safety. We propose a methodology and demonstrate it by building nine probabilistic models of AI-enabled cyber attacks. This is a first attempt at AI risk quantification. We invite criticism and hope this can be a starting point for the kind of iterative improvement that made nuclear safety robust.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction — why quantitative risk modeling matters&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; In 1975, the Nuclear Regulatory Commission (NRC) published WASH-1400, the first systematic application of probabilistic methods to nuclear reactor safety. Before this, nuclear reactor safety was grounded in a conservative, qualitative approach. The argument was that consequences of even the worst accident were effectively zero because physical barriers would prevent any radioactive release. But in 1966, experts recognised that a core meltdown in large reactors could breach those barriers. Regulators now had to argue that severe accidents were possible but sufficiently improbable. That required specific probability estimates.&lt;/p&gt;&lt;p&gt; The shift to quantitative methods was not straightforward. When WASH-1400 began, one technical advisor wondered, "Do we dare undertake such a study till we really know how?" The 1975 [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:46) Introduction -- why quantitative risk modeling matters&lt;/p&gt;&lt;p&gt;(04:51) Our methodology&lt;/p&gt;&lt;p&gt;(13:15) Initial insights from quantitative AI risk modeling&lt;/p&gt;&lt;p&gt;(13:58) 1. Risk increases as AI capabilities increase. So does uncertainty.&lt;/p&gt;&lt;p&gt;(15:15) 2. Different benchmarks capture risk differently.&lt;/p&gt;&lt;p&gt;(16:24) 3. Attack bottlenecks change as capabilities change.&lt;/p&gt;&lt;p&gt;(18:15) 4. Factors driving risk from current AI systems are different from factors that may drive risk in the future.&lt;/p&gt;&lt;p&gt;(19:23) Open problems&lt;/p&gt;&lt;p&gt;(22:09) What's next for SaferAI&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/CpmpyS9Rpz7obejaw/quantitative-ai-risk-assessment-a-starting-point-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/CpmpyS9Rpz7obejaw/quantitative-ai-risk-assessment-a-starting-point-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454695/lexical_client_uploads/mlz01efs9utiswknkbak.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454695/lexical_client_uploads/mlz01efs9utiswknkbak.png" alt="Diagram showing risk universe with four risk models and attack sequence calculation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454772/lexical_client_uploads/iashas9riotes6epluf3.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454772/lexical_client_uploads/iashas9riotes6epluf3.png" alt="Table showing nine cybersecurity attack scenarios with actors, targets, and attack vectors." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454802/lexical_client_uploads/nzt18pgjju8tgliir7wc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454802/lexical_client_uploads/nzt18pgjju8tgliir7wc.png" alt="Flowchart showing calculation of expected impact in dollars per year through multiple probability factors." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454831/lexical_client_uploads/esfqtc4ehm8m4cmfnwkq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454831/lexical_client_uploads/esfqtc4ehm8m4cmfnwkq.png" alt="Line graph showing probability of event Y increasing with task difficulty for LLMs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454860/lexical_client_uploads/ndjmzs9xau1yxgo3bys6.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454860/lexical_client_uploads/ndjmzs9xau1yxgo3bys6.png" alt="Decision tree diagram showing OC3 ransomware attack probability and financial impact analysis." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454921/lexical_client_uploads/dpmtgfhyfbdu5bdvirsk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454921/lexical_client_uploads/dpmtgfhyfbdu5bdvirsk.png" alt="Fan chart titled "Median Risk for Increasing AI Capabilities" showing risk growth across capability levels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454965/lexical_client_uploads/ksk1aylw4ajlgobo4d0s.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779454965/lexical_client_uploads/ksk1aylw4ajlgobo4d0s.png" alt="A 3D surface plot titled "Total Risk by Most Difficult Solveable Benchmark Task OC3 Ransomware Small Human Uplift"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779455170/lexical_client_uploads/ga277uxqyufgyrhbxuwn.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779455170/lexical_client_uploads/ga277uxqyufgyrhbxuwn.png" alt="A bar graph showing "Bottleneck Share Across Uplifted MITRE Tactics" with grouped bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779455162/lexical_client_uploads/p9ag8ympaeenyqmobwds.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779455162/lexical_client_uploads/p9ag8ympaeenyqmobwds.png" alt="Black screen with no visible content." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779455220/lexical_client_uploads/v0y3ym44ogagaxvqannc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779455220/lexical_client_uploads/v0y3ym44ogagaxvqannc.png" alt="Heatmap titled "Shapley Attribution of Attack Success Probability to MITRE ATT&amp;amp;CK Tactics" showing attribution percentages." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 27 May 2026 17:00:16 GMT</pubDate>
      <guid isPermaLink="false">53c986e8-7f89-4138-bcc9-5bbc3ce8e55b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/53c986e8-7f89-4138-bcc9-5bbc3ce8e55b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Henry%2520Papadatos%252C%2520jakub_krys%252C%2520malcolmmurray%252C%2520Renn%2520Karageorgieva&amp;title=%22Quantitative%20AI%20risk%20assessment%3A%20a%20starting%20point%22%20by%20Henry%20Papadatos%2C%20jakub_krys%2C%20malcolmmurray%2C%20Renn%20Karageorgieva&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FCpmpyS9Rpz7obejaw%2Fquantitative-ai-risk-assessment-a-starting-point-1&amp;created_at=2026-05-27T09%3A42%3A31.334608%2B00%3A00&amp;duration=1434" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/CpmpyS9Rpz7obejaw/quantitative-ai-risk-assessment-a-starting-point-1</link>
      <itunes:duration>1434</itunes:duration>
    </item>
    <item>
      <title>“Finding the Mole: Bayesianism is Hard” by laniakea</title>
      <description>&lt;p&gt; I first encountered LessWrong a couple months ago, and since then I've been a regular reader of posts here, including parts of the Sequences. Bayesianism is a major topic in them, and I wanted to try it myself. An ideal candidate was the Flemish TV show 'The Mole' (Dutch: 'De Mol').&lt;/p&gt;&lt;p&gt; In this post, I want to share my methodology &amp;amp; results, but I also want to ask for advice. I consider my results to be mixed at best, and I would really like tips &amp;amp; feedback so I can do better next time.&lt;/p&gt;&lt;p&gt; If any fellow Flemish are reading this who also watched 'The Mole', I'm curious to hear whether you got it right and how you went about it.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Concept of the TV show&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The TV show 'The Mole' is very popular in Flanders[1]. The concept goes as follows: ten normal people ("candidates") travel somewhere and have to complete tasks to earn money for the group. One of them is the mole, who tries to make the tasks fail without people noticing that they are the mole. At the end of each episode, the candidate who knows least about the mole's identity is eliminated.&lt;/p&gt;&lt;p&gt; As a viewer [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:49) Concept of the TV show&lt;/p&gt;&lt;p&gt;(03:39) My attempt at Bayesianism&lt;/p&gt;&lt;p&gt;(06:08) My results&lt;/p&gt;&lt;p&gt;(08:44) Wanting to do better&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 6 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wzjocDBKQoseCyJmH/finding-the-mole-bayesianism-is-hard?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wzjocDBKQoseCyJmH/finding-the-mole-bayesianism-is-hard&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779721293/lexical_client_uploads/bauils2rhys6b7jusm9l.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779721293/lexical_client_uploads/bauils2rhys6b7jusm9l.png" alt="Soccer match at night, POR leads BEL 2-1, teams gathering on field." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779722083/lexical_client_uploads/ktw1csgmjamgf4o1jdyd.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779722083/lexical_client_uploads/ktw1csgmjamgf4o1jdyd.png" alt="Line graph titled "Odds in %" tracking multiple contestants across eight episodes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779722117/lexical_client_uploads/btm4dowclfagcdzqnabb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779722117/lexical_client_uploads/btm4dowclfagcdzqnabb.png" alt="Line graph titled "Odds in dB" showing multiple colored trends across eight episodes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 27 May 2026 14:15:16 GMT</pubDate>
      <guid isPermaLink="false">409edcbd-0e66-4d3f-b35b-9a94c4d1e0d1</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/409edcbd-0e66-4d3f-b35b-9a94c4d1e0d1.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=laniakea&amp;title=%22Finding%20the%20Mole%3A%20Bayesianism%20is%20Hard%22%20by%20laniakea&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwzjocDBKQoseCyJmH%2Ffinding-the-mole-bayesianism-is-hard&amp;created_at=2026-05-26T21%3A59%3A02.477712%2B00%3A00&amp;duration=606" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wzjocDBKQoseCyJmH/finding-the-mole-bayesianism-is-hard</link>
      <itunes:duration>606</itunes:duration>
    </item>
    <item>
      <title>“Notes on Fourier Analysis” by Menotim</title>
      <description>&lt;p&gt; This post records what I've learned while studying a bit of Fourier analysis. I used this PDF, which is the lecture notes for this Stanford course. The only thing in here that is really changed from there is the derivation of the Fourier transform, where I tried to explain the way I made sense of it. (That explanation may or may not make sense.) &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Fourier Series&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Fourier analysis starts with the study of periodic functions. The fundamental periodic function is the complex exponential , which comes up as the solution of the harmonic oscillator equation  and many other places. In the complex plane, this function starts on the horizontal axis at , then spins counterclockwise through the unit circle at an angular velocity of . (Or clockwise at that speed if  is negative.)&lt;/p&gt;&lt;p&gt; Of course, most of our functions are real-valued, not complex. Fourier himself developed his theory of Fourier series and transforms using sines and cosines. But we have since found that the equations come out much neater if you write them in terms of complex exponentials. So we will write sines and cosines in terms of the complex exponential:&lt;/p&gt;&lt;p&gt; And we'll take Fourier series and Fourier [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:30) Fourier Series&lt;/p&gt;&lt;p&gt;(06:03) The Fourier Transform&lt;/p&gt;&lt;p&gt;(11:49) Using the Fourier Transform&lt;/p&gt;&lt;p&gt;(15:25) A More General Definition&lt;/p&gt;&lt;p&gt;(20:50) Ш and the Sampling Theorem&lt;/p&gt;&lt;p&gt;(27:03) The Discrete Fourier Transform&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 9 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/qZCGcmqM69x9r7D85/notes-on-fourier-analysis?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/qZCGcmqM69x9r7D85/notes-on-fourier-analysis&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779557038/lexical_client_uploads/if9yayobaf6fvidm96u8.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779557038/lexical_client_uploads/if9yayobaf6fvidm96u8.png" alt="Four mathematical function graphs: constant, triangular wave, cosine wave, sawtooth wave." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779564031/lexical_client_uploads/odlx40ctr7ujk1khn44u.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779564031/lexical_client_uploads/odlx40ctr7ujk1khn44u.png" alt="Number line showing tick marks at negative 5, 0, 5, 10, and 15." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779565502/lexical_client_uploads/wg13qaeiivacnqiqfw9q.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779565502/lexical_client_uploads/wg13qaeiivacnqiqfw9q.png" alt="Graph showing two overlapping line plots with oscillating patterns and flat segments." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779582548/lexical_client_uploads/ywxxuje3xrnytkwxlwn5.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779582548/lexical_client_uploads/ywxxuje3xrnytkwxlwn5.png" alt="Graph showing a periodic wave function oscillating around the x-axis." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779628415/lexical_client_uploads/beapkv5kvkiuw8x17hfu.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779628415/lexical_client_uploads/beapkv5kvkiuw8x17hfu.png" alt="Graph showing a periodic wave function oscillating between approximately -2 and 2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779748377/lexical_client_uploads/vdmuiaddrk023roba5va.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779748377/lexical_client_uploads/vdmuiaddrk023roba5va.png" alt="Graph showing a periodic waveform with rectangular pulses repeating every 5 units." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 27 May 2026 13:15:15 GMT</pubDate>
      <guid isPermaLink="false">7049bda7-be44-47fc-9a8a-829327683935</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7049bda7-be44-47fc-9a8a-829327683935.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Menotim&amp;title=%22Notes%20on%20Fourier%20Analysis%22%20by%20Menotim&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FqZCGcmqM69x9r7D85%2Fnotes-on-fourier-analysis&amp;created_at=2026-05-26T00%3A40%3A15.183232%2B00%3A00&amp;duration=2001" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/qZCGcmqM69x9r7D85/notes-on-fourier-analysis</link>
      <itunes:duration>2001</itunes:duration>
    </item>
    <item>
      <title>“Standard deviations from just two values” by kqr</title>
      <description>&lt;p&gt; William Sealy Gosset was great. He improved beer at Guinness by using the statistics that existed at the time. Not happy with that, he invented new statistics to brew even better beer. The things he invented are used all over the place now, but Guinness wanted to keep him a secret weapon, so they made him publish his results under the fake name Student.&lt;/p&gt;&lt;p&gt; One thing Gosset realised is that it is wrong to compute 90 % confidence intervals for the mean by taking the standard deviation of the sample, and assume a normal distribution, like-a-so:&lt;/p&gt;&lt;p&gt; When we do this we get too narrow a range, because while we recognise ​ is just an approximation, we are assuming we know  with certainty!&lt;/p&gt;&lt;p&gt; Gosset came up with correction tables based on the number of samples used in the estimation of the confidence interval, to account for our uncertainty in the estimation of . Here are some useful values, rounded to be easier to memorise:&lt;/p&gt;&lt;p&gt; Number of samples&lt;/p&gt;&lt;p&gt; Correction factor for 90 % interval&lt;/p&gt;&lt;p&gt; 2&lt;/p&gt;&lt;p&gt; 4×&lt;/p&gt;&lt;p&gt; 3&lt;/p&gt;&lt;p&gt; 2×&lt;/p&gt;&lt;p&gt; 4&lt;/p&gt;&lt;p&gt; 1.5×&lt;/p&gt;&lt;p&gt; 5&lt;/p&gt;&lt;p&gt; 1.3×&lt;/p&gt;&lt;p&gt; 6–8&lt;/p&gt;&lt;p&gt; 1.2×&lt;/p&gt;&lt;p&gt; 9–20&lt;/p&gt;&lt;p&gt; 1.1×&lt;/p&gt;&lt;p&gt; To use this table, count how many samples the estimation of the standard deviation is [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:33) Variation from just two values&lt;/p&gt;&lt;p&gt;(03:25) Example of how to use it&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/qRWRjDQdBWRZFBxTv/standard-deviations-from-just-two-values?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/qRWRjDQdBWRZFBxTv/standard-deviations-from-just-two-values&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 27 May 2026 12:15:15 GMT</pubDate>
      <guid isPermaLink="false">6f99c8b1-7781-4a67-aaed-0935a41133ff</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6f99c8b1-7781-4a67-aaed-0935a41133ff.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=kqr&amp;title=%22Standard%20deviations%20from%20just%20two%20values%22%20by%20kqr&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FqRWRjDQdBWRZFBxTv%2Fstandard-deviations-from-just-two-values&amp;created_at=2026-05-27T07%3A30%3A10.183527%2B00%3A00&amp;duration=291" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/qRWRjDQdBWRZFBxTv/standard-deviations-from-just-two-values</link>
      <itunes:duration>291</itunes:duration>
    </item>
    <item>
      <title>“Contra Wentworth on Physical Attractiveness for Men” by Gretta Duleba</title>
      <description>&lt;p&gt; This is a response to John Wentworth's recent article, Why Physical Attractiveness Matters for Men's Dating Prospects. I have no quibble with the thesis stated in the title, but a lot of the body of the article struck me as off-base. When John sent me the article, I told him the article seemed “fundamentally confused.” He asked for details, and this article is my answer.&lt;/p&gt;&lt;p&gt; Here I list and argue with four premises I detected in John's article. Some of them he said in so many words; some are my (possibly-incorrect) inferences.&lt;/p&gt;&lt;p&gt; I have my own weird perspective on a lot of this stuff that isn’t necessarily to be trusted. To avoid just arguing purely from that weird perspective, I ran several glosso surveys to gather data. Glosso data itself is pretty sus! I could only survey people's self-reports, which could easily be inaccurate. The surveys have pretty low N. The population on glosso is itself weird; it's Aella's social network, originally seeded with Aella's slutty, high-body-count, looks-conscious friends. (I say this with love, as someone pretty firmly rooted in that social network, sharing those characteristics!) So we shouldn’t put too much weight on any of this data. I [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:34) Premise #1: Eye contact is typically on the path from "we're strangers" to "we're flirting"&lt;/p&gt;&lt;p&gt;(01:42) One woman's ungeneralizable field report&lt;/p&gt;&lt;p&gt;(04:00) Okay, but what about other women, though&lt;/p&gt;&lt;p&gt;(05:25) A richer model of initiating contact&lt;/p&gt;&lt;p&gt;(06:05) Premise #2: Being physically attractive works on enough of the right women to make it a viable mainline strategy&lt;/p&gt;&lt;p&gt;(06:23) How big a factor is physical attraction?&lt;/p&gt;&lt;p&gt;(07:38) How efficient is physical attraction at getting you laid?&lt;/p&gt;&lt;p&gt;(09:56) Physical attraction does seem like a viable pathway, but to a narrower target&lt;/p&gt;&lt;p&gt;(10:24) Premise #3: The effort-to-results curve on making yourself physically attractive is favorable&lt;/p&gt;&lt;p&gt;(12:43) Premise #4: Being physically attractive saves you other kinds of time and effort&lt;/p&gt;&lt;p&gt;(14:45) My prescription: Be well rounded. No, not like that.&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ytzrakjgcvCfLCCZp/contra-wentworth-on-physical-attractiveness-for-men?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ytzrakjgcvCfLCCZp/contra-wentworth-on-physical-attractiveness-for-men&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779836975/lexical_client_uploads/jslpb2pnh3ceejpul2ii.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779836975/lexical_client_uploads/jslpb2pnh3ceejpul2ii.png" alt="Bar graph showing responses to "When you see an attractive stranger in a crowded social situation, what do you do next?"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779836975/lexical_client_uploads/iam02hnminxzz7lucggj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779836975/lexical_client_uploads/iam02hnminxzz7lucggj.png" alt="Bar graph showing responses by gender: "What is the first outward sign that you're open to interacting with an attractive stranger in a crowded social situation?"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779836975/lexical_client_uploads/u4cqbdeqcyf3ezzulme9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779836975/lexical_client_uploads/u4cqbdeqcyf3ezzulme9.png" alt="Bar graph titled "When you feel drawn to get closer to someone in a sexual way, how much of your attraction is typically driven by physical features (body, face)?"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 27 May 2026 07:45:15 GMT</pubDate>
      <guid isPermaLink="false">e4e2b5e3-ed65-4d97-af6d-f828b5bdf730</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e4e2b5e3-ed65-4d97-af6d-f828b5bdf730.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Gretta%2520Duleba&amp;title=%22Contra%20Wentworth%20on%20Physical%20Attractiveness%20for%20Men%22%20by%20Gretta%20Duleba&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FytzrakjgcvCfLCCZp%2Fcontra-wentworth-on-physical-attractiveness-for-men&amp;created_at=2026-05-27T07%3A36%3A32.455111%2B00%3A00&amp;duration=983" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ytzrakjgcvCfLCCZp/contra-wentworth-on-physical-attractiveness-for-men</link>
      <itunes:duration>983</itunes:duration>
    </item>
    <item>
      <title>“Practical Learnings from Synthetic Document Finetuning” by Axel Højmark, Jérémy Scheurer</title>
      <description>&lt;p&gt; We've been using Synthetic Document Finetuning (SDF) quite a bit at Apollo Research lately. This post covers a few tweaks to the standard SDF recipe specific to our use cases, plus some general tips and tricks for getting good results. We’re sharing these notes in case they’re useful to others doing research with SDF.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. What Is SDF?&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Synthetic Document Finetuning (SDF) is a knowledge editing technique where models are finetuned on LLM-generated documents consistent with a target fact or belief. As described in Slocum et al. (2025), SDF "often succeeds at implanting beliefs that behave similarly to genuine knowledge." These implanted beliefs can generalize to related contexts, are often robust to scrutiny, and form internal representations similar to genuine knowledge.&lt;/p&gt;&lt;p&gt; We mostly followed the pipeline described in Slocum et al. (2025) and the safety-research/false-facts repository.&lt;/p&gt;&lt;p&gt; The pipeline has several stages:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Universe Context:&amp;nbsp;Define a "universe" description where the target belief is true.&lt;/li&gt;&lt;li value="2"&gt;Fact Extraction:&amp;nbsp;Extract discrete claims from that universe that the synthetic documents will revolve around.&lt;/li&gt;&lt;li value="3"&gt;Generation:&amp;nbsp;Use an LLM to generate a large, diverse corpus of synthetic documents. This is done by having the LLM first brainstorm document types (blogs, papers, memos), then come up with specific ideas [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:32) 1. What Is SDF?&lt;/p&gt;&lt;p&gt;(02:03) Iterating on Universes and Generation Prompts&lt;/p&gt;&lt;p&gt;(03:42) 2. Getting Models to Surface the Information&lt;/p&gt;&lt;p&gt;(04:14) Dropping the DOCTAG&lt;/p&gt;&lt;p&gt;(04:56) Dropping Webtext to Increase Saliency&lt;/p&gt;&lt;p&gt;(06:13) Matching the Test Distribution&lt;/p&gt;&lt;p&gt;(07:11) Prepending Eval Prompts&lt;/p&gt;&lt;p&gt;(08:20) 3. Training Details&lt;/p&gt;&lt;p&gt;(08:24) Document Length and Token Counts&lt;/p&gt;&lt;p&gt;(09:05) Training for Multiple Epochs&lt;/p&gt;&lt;p&gt;(09:37) Running Experiments with LoRA &amp;amp; Tinker&lt;/p&gt;&lt;p&gt;(10:33) 4. Dealing with Gibberish&lt;/p&gt;&lt;p&gt;(12:14) 5. Evaluating Effects&lt;/p&gt;&lt;p&gt;(13:58) 6. Other Explorations&lt;/p&gt;&lt;p&gt;(17:06) Acknowledgements&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/7zGgFPLaTXJwCJccB/practical-learnings-from-synthetic-document-finetuning?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/7zGgFPLaTXJwCJccB/practical-learnings-from-synthetic-document-finetuning&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 27 May 2026 07:15:20 GMT</pubDate>
      <guid isPermaLink="false">14ccb24b-b316-4c10-8a09-d72ad6127dcf</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/14ccb24b-b316-4c10-8a09-d72ad6127dcf.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Axel%2520H%25C3%25B8jmark%252C%2520J%25C3%25A9r%25C3%25A9my%2520Scheurer&amp;title=%22Practical%20Learnings%20from%20Synthetic%20Document%20Finetuning%22%20by%20Axel%20H%C3%B8jmark%2C%20J%C3%A9r%C3%A9my%20Scheurer&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F7zGgFPLaTXJwCJccB%2Fpractical-learnings-from-synthetic-document-finetuning&amp;created_at=2026-05-26T19%3A22%3A59.07093%2B00%3A00&amp;duration=1044" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/7zGgFPLaTXJwCJccB/practical-learnings-from-synthetic-document-finetuning</link>
      <itunes:duration>1044</itunes:duration>
    </item>
    <item>
      <title>“RTMH: Pope Leo’s Magnifica Humanitas on AI” by Zvi</title>
      <description>&lt;p&gt; His holiness has spoken, frequently about AI. At eighty two pages of length.&lt;/p&gt;
&lt;p&gt; The full Magnifica Humanitas can be found here.&lt;/p&gt;
&lt;p&gt; I am very happy that Pope Leo takes these issues seriously, and is sharing his views, and bringing a form of moral clarity, even with all the flaws and central errors. More people with voice should share their views in this way, even when I disagree.&lt;/p&gt;
&lt;p&gt; It's a weird document. Much of it is not about AI at all.&lt;/p&gt;
&lt;p&gt; I do agree with the Pope's most basic point on AI, which is that AI can be what we make of it. That we can steer this technology, determine how it is developed and used, and this can determine whether we get a good or not so good future. We cannot purely leave this to market incentives and strategic pressures. Yes, very much so.&lt;/p&gt;







&lt;p&gt; The central problem is that so much of Leo's worldview is some combination, to me, of highly alien and highly wrong. You might think that would primarily have a lot to do with him being the Pope and rather Catholic, and being a man of faith, whereas I am not [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:02) A Brief History of Magnifica Humanitas&lt;/p&gt;&lt;p&gt;(12:23) Economic Models Very Different From Our Own&lt;/p&gt;&lt;p&gt;(14:14) A New Jerusalem&lt;/p&gt;&lt;p&gt;(16:46) So Sayeth The Pope (on AI)&lt;/p&gt;&lt;p&gt;(31:30) The Case Against Human Achievement&lt;/p&gt;&lt;p&gt;(34:01) Truth, Justice and the Vatican Way&lt;/p&gt;&lt;p&gt;(36:42) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(41:07) What Is Not Fair In Love and War&lt;/p&gt;&lt;p&gt;(47:02) Come Ye Christian Faithful&lt;/p&gt;&lt;p&gt;(49:13) The Other Missing Mood&lt;/p&gt;&lt;p&gt;(51:42) Pope Given About Five Words&lt;/p&gt;&lt;p&gt;(55:10) 0The Anthropic Principle&lt;/p&gt;&lt;p&gt;(57:20) Claude Can Read Your Code&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/RZeg98Tdrq47vbjva/rtmh-pope-leo-s-magnifica-humanitas-on-ai?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/RZeg98Tdrq47vbjva/rtmh-pope-leo-s-magnifica-humanitas-on-ai&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Tue, 26 May 2026 20:15:21 GMT</pubDate>
      <guid isPermaLink="false">efe9ec0c-af23-425d-b354-ffb1c4258f66</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/efe9ec0c-af23-425d-b354-ffb1c4258f66.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22RTMH%3A%20Pope%20Leo%E2%80%99s%20Magnifica%20Humanitas%20on%20AI%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FRZeg98Tdrq47vbjva%2Frtmh-pope-leo-s-magnifica-humanitas-on-ai&amp;created_at=2026-05-26T13%3A23%3A48.994186%2B00%3A00&amp;duration=3580" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/RZeg98Tdrq47vbjva/rtmh-pope-leo-s-magnifica-humanitas-on-ai</link>
      <itunes:duration>3580</itunes:duration>
    </item>
    <item>
      <title>“Claude, Author of the Humanitas” by Linch</title>
      <description>&lt;p&gt; In the wee hours of Memorial Day, my friends and I stayed up past 4:30 AM California time to listen to the announcement of Pope Leo's first encyclical, Magnifica Humanitas, on safeguarding the human person in the time of artificial intelligence. We were excited albeit sleepy, eagerly anticipating the event and upcoming essay by the world's foremost religious authority on a question so central to our world. Still we were an odd audience for this presentation: none of us are practicing Catholics, and most of us didn’t really know what to expect.&lt;/p&gt;&lt;p&gt; I thought Pope Leo's own speech was good, and addressed the current moment in AI with some of the seriousness it deserves. I thought the other speeches, including by Chris Olah, were less impressive. But that's okay, I’m not the target audience!&lt;/p&gt;&lt;p&gt; A specific cardinal's point struck me, however:&lt;/p&gt;&lt;p&gt; Cardinal Parolin made much of a specific prepositional choice in the subtitle: “sulla custodia della persona umana nel tempo dell’intelligenza artificiale,“ which the live translator translated to something like “on the safeguarding of the human person in the time of AI,” and not “sull’intelligenza artificiale“ – “on AI.”&lt;/p&gt;&lt;p&gt; This was supposed to be a big deal. “In the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(04:02) Significant fractions of the recent papal encyclical are written by AI&lt;/p&gt;&lt;p&gt;(07:17) Statistical Evidence and Tells&lt;/p&gt;&lt;p&gt;(07:40) Em-dashes&lt;/p&gt;&lt;p&gt;(10:04) "Genuinely"&lt;/p&gt;&lt;p&gt;(10:51) How often is the phrase "genuinely" used in Magnifica Humanitas?&lt;/p&gt;&lt;p&gt;(11:43) Is this due to subject matter?&lt;/p&gt;&lt;p&gt;(15:30) Is this just a personality quirk of Pope Leo XIV specifically?&lt;/p&gt;&lt;p&gt;(16:58) Tricolon density&lt;/p&gt;&lt;p&gt;(19:15) Pangram analysis&lt;/p&gt;&lt;p&gt;(21:21) Comparison to other encyclicals&lt;/p&gt;&lt;p&gt;(22:45) Comparison to Pope Leo XIV's speech&lt;/p&gt;&lt;p&gt;(23:14) Sidebar: Pangram has a very low false positive rate in general&lt;/p&gt;&lt;p&gt;(24:20) Probably not a translation artifact&lt;/p&gt;&lt;p&gt;(24:46) The same signs of AI I observe in English are essentially preserved verbatim in the Italian version&lt;/p&gt;&lt;p&gt;(25:58) The Italian version of the current encyclical also gets flagged as AI by Pangram&lt;/p&gt;&lt;p&gt;(26:47) Backtesting AI translation of past encyclicals get 0% on Pangram&lt;/p&gt;&lt;p&gt;(27:55) The specific AI used is likely Claude&lt;/p&gt;&lt;p&gt;(28:44) Textual&lt;/p&gt;&lt;p&gt;(29:30) Against ChatGPT&lt;/p&gt;&lt;p&gt;(30:06) Different sections of the encyclical have very different rates of apparent AI usage&lt;/p&gt;&lt;p&gt;(30:59) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 20 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wRNJZz2iYrfDaSDdz/claude-author-of-the-humanitas?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wRNJZz2iYrfDaSDdz/claude-author-of-the-humanitas&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!_gwP!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd124a739-f13c-4b7f-8efc-98602b4bdd46_2562x1636.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!_gwP!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd124a739-f13c-4b7f-8efc-98602b4bdd46_2562x1636.png" alt="Pangram on some past encyclicals" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!LbDM!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5bc62f3-26d1-492b-9d97-ca9d9008422a_2566x1590.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!LbDM!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5bc62f3-26d1-492b-9d97-ca9d9008422a_2566x1590.png" alt="Pangram on some past encyclicals" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!rKss!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6487895d-a835-40c5-b85f-16ee9fb2b3e3_2560x1636.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!rKss!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6487895d-a835-40c5-b85f-16ee9fb2b3e3_2560x1636.png" alt="Pangram on some past encyclicals" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!QFni!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feca6ac4b-b57c-49fc-a296-375d8014721e_2538x1602.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!QFni!,w_720,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feca6ac4b-b57c-49fc-a296-375d8014721e_2538x1602.png" alt="Pangram on some past encyclicals" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!9Anp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b47972a-a3a4-489e-9dc7-9267909b2531_700x467.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!9Anp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b47972a-a3a4-489e-9dc7-9267909b2531_700x467.png" alt="Pope Francis in white robes speaking with two people." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!FAHR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80cb88b0-cce3-431d-b7da-bc0836834f25_760x507.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!FAHR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80cb88b0-cce3-431d-b7da-bc0836834f25_760x507.png" alt="Pope holding signed document titled "Magnifica Humanitas" while waving." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!RAeJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac481801-252f-4cb0-8c0a-e0c8c2234c42_2048x885.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!RAeJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac481801-252f-4cb0-8c0a-e0c8c2234c42_2048x885.png" alt="Table showing triadic rhetorical structures in papal documents from 1891 to 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!mrUR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5a138ec-9c15-4ef0-825d-77adaa31d85a_2048x1201.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!mrUR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5a138ec-9c15-4ef0-825d-77adaa31d85a_2048x1201.png" alt="Document analysis showing 11% AI-generated content, 89% human-written text." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!39gX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febbe8ee0-6fef-45c1-b1d9-333895527ed0_2048x1139.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!39gX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febbe8ee0-6fef-45c1-b1d9-333895527ed0_2048x1139.png" alt="AI-generated content indicator showing high confidence score with navigation options." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!TmKp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1021646f-6681-492e-8d94-0b012c803776_2048x1269.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!TmKp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1021646f-6681-492e-8d94-0b012c803776_2048x1269.png" alt="Document analysis showing 100% human-written text with plagiarism checker interface." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!jmA1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a3418c-80b8-4aef-9fa6-a44c014b932a_2048x1301.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!jmA1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a3418c-80b8-4aef-9fa6-a44c014b932a_2048x1301.png" alt="Document text with AI detection panel showing 60% AI-generated content." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!-wMK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85973311-f7fc-4e88-9dae-ff1b7a4bb81c_882x1440.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!-wMK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85973311-f7fc-4e88-9dae-ff1b7a4bb81c_882x1440.png" alt="Text excerpt analyzing Italian language patterns in AI-generated content, focusing on dash usage and structural features." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 26 May 2026 17:00:20 GMT</pubDate>
      <guid isPermaLink="false">d1aac25e-f3be-42ce-b6bf-195b3dc0a62f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d1aac25e-f3be-42ce-b6bf-195b3dc0a62f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Linch&amp;title=%22Claude%2C%20Author%20of%20the%20Humanitas%22%20by%20Linch&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwRNJZz2iYrfDaSDdz%2Fclaude-author-of-the-humanitas&amp;created_at=2026-05-26T16%3A05%3A33.376111%2B00%3A00&amp;duration=2089" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wRNJZz2iYrfDaSDdz/claude-author-of-the-humanitas</link>
      <itunes:duration>2089</itunes:duration>
    </item>
    <item>
      <title>“Brackets Are a Bad Way to Regulate” by Hide</title>
      <description>&lt;p&gt; Continuous distributions are everywhere - for virtually everything we care about, a little more is a little better (or worse), and a lot more is a lot better (or worse). This presents a problem - we need to create rules that reasonably and fairly apply across these continuums, where the degree to which a thing possesses a trait makes a difference to the reasonable treatment of it. &lt;/p&gt;&lt;p&gt; Going 1m/h over the limit, and going 150 in a 40 zone are both “speeding”, yet we must punish these things differently. &lt;/p&gt;&lt;p&gt; The default solution for almost all regulations is to slice these continuous distributions into chunks, and treat the chunks as basically equivalent phenomena - squishing a continuous distribution into five or so blocks, and manually writing rules to apply uniformly within the blocks.&lt;/p&gt;&lt;p&gt; Examples include:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Speed limits&lt;/li&gt;&lt;li value="2"&gt;Tax brackets&lt;/li&gt;&lt;li value="3"&gt;Sentencing thresholds&lt;/li&gt;&lt;li value="4"&gt;Overtime thresholds&lt;/li&gt;&lt;li value="5"&gt;Pension eligibility&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; This is a very bad system.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Brackets are fundamentally inefficient&lt;/p&gt;&lt;p&gt; For any bracket over a continuous distribution, the upper section of the bracket has more of the trait than the bottom section. &lt;/p&gt;&lt;p&gt; As a result, for any incentive or punishment applying uniformly across a bracket, the ends of the [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/FDxvpznjMzGNHjDRS/brackets-are-a-bad-way-to-regulate?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/FDxvpznjMzGNHjDRS/brackets-are-a-bad-way-to-regulate&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779764258/lexical_client_uploads/wuynrbylzk4hod8efbyq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779764258/lexical_client_uploads/wuynrbylzk4hod8efbyq.png" alt="Normal distribution curve showing height variation with short, average, and tall markers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779764257/lexical_client_uploads/w7ysmrpj4xtlabovkogi.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779764257/lexical_client_uploads/w7ysmrpj4xtlabovkogi.png" alt="Pricing comparison showing Starter plan at $11/month/seat and Professional plan at $1,241/month." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779764258/lexical_client_uploads/yrbhj9b8o1ujebrvafrz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779764258/lexical_client_uploads/yrbhj9b8o1ujebrvafrz.png" alt="Graph comparing mandatory minimum sentences for drug quantities under current law versus a power formula." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 26 May 2026 11:45:20 GMT</pubDate>
      <guid isPermaLink="false">be303f74-e6eb-4c38-801e-f16c03e80858</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/be303f74-e6eb-4c38-801e-f16c03e80858.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Hide&amp;title=%22Brackets%20Are%20a%20Bad%20Way%20to%20Regulate%22%20by%20Hide&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FFDxvpznjMzGNHjDRS%2Fbrackets-are-a-bad-way-to-regulate&amp;created_at=2026-05-26T03%3A05%3A19.35404%2B00%3A00&amp;duration=626" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/FDxvpznjMzGNHjDRS/brackets-are-a-bad-way-to-regulate</link>
      <itunes:duration>626</itunes:duration>
    </item>
    <item>
      <title>“Many portions of Magnifica Humanitas appear to be AI-written” by DanielFilan</title>
      <description>&lt;p&gt; Magnifica Humanitas is a recent ‘encyclical’ by Pope Leo XIV, leader of the Catholic Church. It outlines a vision for how humanity should interact with artificial intelligence, emphasizing the importance of human dignity and ensuring that AI does not replace human relationships, among other topics. Interestingly, many portions appear to be written by AI.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt; Why I thought to check this&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt; Friends of mine Linch Zhang and the Axolotl noticed that parts of the English text appear to be AI-generated, and twitter user kartr found that the Italian text had the largest fraction of AI-generated content out of all the translations published by the Vatican, speculating that it was the original copy, and translations by humans appear less ‘AI-generated’ to various tools.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt; What I actually did&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt; I took the Italian text of Magnifica Humanitas, and ran it thru the Pangram AI detector software.&lt;/p&gt;

&lt;p&gt; Why the Italian text? As mentioned in the previous segment, it had previously been claimed to be the most AI-ish version. Also, given that the Vatican is in Italy, it's a reasonable guess that the text was initially drafted in Italian (presumably by a few different people with the Pope's input).&lt;/p&gt;

&lt;p&gt; [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:34) Why I thought to check this&lt;/p&gt;&lt;p&gt;(00:58) What I actually did&lt;/p&gt;&lt;p&gt;(01:42) My results&lt;/p&gt;&lt;p&gt;(01:59) The introduction&lt;/p&gt;&lt;p&gt;(02:15) Chapter 1&lt;/p&gt;&lt;p&gt;(02:40) Chapter 2&lt;/p&gt;&lt;p&gt;(02:53) Chapter 3&lt;/p&gt;&lt;p&gt;(03:08) Chapter 4&lt;/p&gt;&lt;p&gt;(03:34) Chapter 5&lt;/p&gt;&lt;p&gt;(03:56) Conclusion&lt;/p&gt;&lt;p&gt;(04:11) Should you trust these results?&lt;/p&gt;&lt;p&gt;(04:46) Studies indicate Pangram works pretty well&lt;/p&gt;&lt;p&gt;(05:23) When I play around with Pangram, it works pretty well&lt;/p&gt;&lt;p&gt;(05:58) If you've seen examples of AI detectors claiming that obviously human-written text was AI, those were probably not Pangram&lt;/p&gt;&lt;p&gt;(06:36) Does this matter?&lt;/p&gt;&lt;p&gt;(12:20) Future work&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/GbWwesBnetyiomxEH/many-portions-of-magnifica-humanitas-appear-to-be-ai-written?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/GbWwesBnetyiomxEH/many-portions-of-magnifica-humanitas-appear-to-be-ai-written&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Tue, 26 May 2026 10:00:20 GMT</pubDate>
      <guid isPermaLink="false">1e67944e-1e3c-4008-8d88-ba74aa8a5942</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1e67944e-1e3c-4008-8d88-ba74aa8a5942.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=DanielFilan&amp;title=%22Many%20portions%20of%20Magnifica%20Humanitas%20appear%20to%20be%20AI-written%22%20by%20DanielFilan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FGbWwesBnetyiomxEH%2Fmany-portions-of-magnifica-humanitas-appear-to-be-ai-written&amp;created_at=2026-05-26T07%3A42%3A39.290763%2B00%3A00&amp;duration=766" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/GbWwesBnetyiomxEH/many-portions-of-magnifica-humanitas-appear-to-be-ai-written</link>
      <itunes:duration>766</itunes:duration>
    </item>
    <item>
      <title>“Donating 80% While It Still Counts” by jefftk</title>
      <description>&lt;p&gt; 



Julia and I had been giving half


since 2014, but in
2025 we drew on our savings to donate 81%. It looks to us like we're
in a critical window for keeping

the
introduction of

very
powerful

AI
systems

from


being
disastrous, and we want to do what we can while we still can.



&lt;/p&gt;&lt;p&gt; 

Here's what that looks like in the context of our overall spending:

&lt;/p&gt;





There's a details box here with the title "table...". The box contents are omitted from this narration.



&lt;p&gt; 



We've been prioritizing donations for a long
time, but it feels very different now because of the AI boom.
Some of this is that people who've made money in the boom will
likely be giving more soon, and so money spent now can help set up
organizations to spend future money more effectively. But more
importantly, this is a key window of opportunity: transformative AI is
coming very
quickly, for better or worse.
We want to push hard for "better".



&lt;/p&gt;

&lt;p&gt; 

If you compare to previous years (2024, 2022, 2020, 2018, 2016, 2014), we're donating a lot less than
we used to in absolute terms:

&lt;/p&gt;





There's a details box here with [...] &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(04:21) Evaluating Predictions&lt;/p&gt;&lt;p&gt;(06:28) Making New Predictions&lt;/p&gt;&lt;p&gt;(08:25) Details&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/mAhCkK5iBuZvCbFey/donating-80-while-it-still-counts?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/mAhCkK5iBuZvCbFey/donating-80-while-it-still-counts&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/b08yraqbvjb7bgqocrkc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/b08yraqbvjb7bgqocrkc" alt="Bar graph titled "2025 Spending" showing budget allocation across eight categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/p6bjkihefbpjbbcpottx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/p6bjkihefbpjbbcpottx" alt="A bar graph titled "Historical Spending Comparison" showing spending across categories by year." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/nmp6aaps6fghrx28owjl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/nmp6aaps6fghrx28owjl" alt="Line graph titled "Donations vs Income" showing two trends from 2012 to 2025." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/olwnivtxdunfjvmm91ks" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mAhCkK5iBuZvCbFey/olwnivtxdunfjvmm91ks" alt="Line graph titled "Net Worth Over Time" showing house, retirement, and liquid assets in January 2026 dollars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 26 May 2026 02:45:20 GMT</pubDate>
      <guid isPermaLink="false">48d500d3-78d8-40bd-8b2e-ab0bd66e89dd</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/48d500d3-78d8-40bd-8b2e-ab0bd66e89dd.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=jefftk&amp;title=%22Donating%2080%25%20While%20It%20Still%20Counts%22%20by%20jefftk&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FmAhCkK5iBuZvCbFey%2Fdonating-80-while-it-still-counts&amp;created_at=2026-05-26T01%3A32%3A04.755076%2B00%3A00&amp;duration=730" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/mAhCkK5iBuZvCbFey/donating-80-while-it-still-counts</link>
      <itunes:duration>730</itunes:duration>
    </item>
    <item>
      <title>“Cognitive Security as an AI Safety Cause Area” by jsteinhardt</title>
      <description>&lt;p&gt; As AI systems become more capable, the cognitive security of humans will be increasingly at risk. By cognitive security, I mean the ability of humans to maintain control over their beliefs and actions.&lt;/p&gt;&lt;p&gt; Cognitive security could be compromised in several ways: AI could become very good at persuading people of arbitrary positions; interacting with AI could lead humans to lose touch with reality; and AIs could become very effective at blackmail or at producing extremely convincing false information.&lt;/p&gt;&lt;p&gt; We are already seeing this happen:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Persuasion. Frontier LLMs are now as persuasive as humans on political issues, and post-training for persuasiveness boosts performance further, suggesting there is headroom.&lt;/li&gt;&lt;li value="2"&gt;AI psychosis. There are many reports of people developing delusional beliefs after extended chatbot conversations, including people with no prior history of mental illness. Children have taken their own lives after being encouraged toward suicide by chatbots.&lt;/li&gt;&lt;li value="3"&gt;Convincing impersonation. Scammers used real-time deepfaked video to impersonate the CFO and other staff of Arup on a video call, convincing a finance employee to wire 25.6 million dollars across 15 transactions. On a more day-to-day basis, AI voice cloning is now widespread in family-emergency and "grandparent" scams.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Right now, many of these effects [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 25th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/KGcE7eAdfxHchk25X/cognitive-security-as-an-ai-safety-cause-area?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/KGcE7eAdfxHchk25X/cognitive-security-as-an-ai-safety-cause-area&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 25 May 2026 19:30:21 GMT</pubDate>
      <guid isPermaLink="false">2979fdc9-1410-4cb1-a1a1-2f8f60c137bf</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2979fdc9-1410-4cb1-a1a1-2f8f60c137bf.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=jsteinhardt&amp;title=%22Cognitive%20Security%20as%20an%20AI%20Safety%20Cause%20Area%22%20by%20jsteinhardt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FKGcE7eAdfxHchk25X%2Fcognitive-security-as-an-ai-safety-cause-area&amp;created_at=2026-05-25T18%3A30%3A19.844475%2B00%3A00&amp;duration=319" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/KGcE7eAdfxHchk25X/cognitive-security-as-an-ai-safety-cause-area</link>
      <itunes:duration>319</itunes:duration>
    </item>
    <item>
      <title>“Linkpost: New Vatican Encyclical on AI Governance” by Jackson Wagner</title>
      <description>&lt;p&gt; Pope Leo XIV has released a new, 42k-word encyclical laying out the Vatican's position on many AI safety topics. You can read the full thing here, or read the Vatican's press release here, or coverage in the NY Times, or perhaps consider having an LLM read the whole encyclical, then chatting about whatever specifics you're interested in!&lt;/p&gt;&lt;p&gt; Below is a portion of the NY Times story on the event:&lt;/p&gt;&lt;p&gt; Leo's declaration outlined his desire to protect human dignity and agency in an age in which technology threatens to replace humans in many professional and social roles. He presented it alongside Christopher Olah, a co-founder of Anthropic, a major A.I. developer, in a symbolic gesture of dialogue between leaders of the spiritual and technological worlds.&lt;/p&gt;&lt;p&gt; While emphasizing that “technology should not be considered, in itself, as a force antagonistic to humanity,” he wrote that “the pursuit of greater profits cannot justify choices that systematically sacrifice jobs.”&lt;/p&gt;&lt;p&gt; Among other things, Leo called for:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;government regulation of the private companies that are driving the development of A.I.&lt;/li&gt;&lt;li value="2"&gt;protection and retraining for workers whose jobs are threatened&lt;/li&gt;&lt;li value="3"&gt;education to help students think critically about the technology&lt;/li&gt;&lt;li value="4"&gt;action to protect children from violent [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 25th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ehdv7z7L7ESxtKL4j/linkpost-new-vatican-encyclical-on-ai-governance?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ehdv7z7L7ESxtKL4j/linkpost-new-vatican-encyclical-on-ai-governance&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 25 May 2026 18:30:20 GMT</pubDate>
      <guid isPermaLink="false">e85837cc-5dcb-47a7-9254-267837f670c5</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e85837cc-5dcb-47a7-9254-267837f670c5.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jackson%2520Wagner&amp;title=%22Linkpost%3A%20New%20Vatican%20Encyclical%20on%20AI%20Governance%22%20by%20Jackson%20Wagner&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fehdv7z7L7ESxtKL4j%2Flinkpost-new-vatican-encyclical-on-ai-governance&amp;created_at=2026-05-25T15%3A41%3A05.474918%2B00%3A00&amp;duration=140" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ehdv7z7L7ESxtKL4j/linkpost-new-vatican-encyclical-on-ai-governance</link>
      <itunes:duration>140</itunes:duration>
    </item>
    <item>
      <title>“A (Slightly) Mechanistic Theory for Exponentially Increasing AI Time Horizons?” by Oliver Sourbut</title>
      <description>&lt;p&gt; AI ‘time horizons’ are mostly not about time (I think it's mostly ‘data’, but you’ll see where I’m unsure).&lt;/p&gt;&lt;p&gt; One chart from 2025 has become perhaps the most (in)famous in modern AI commentary.&lt;/p&gt;&lt;p&gt; For those in the know, ‘the METR graph’[1] is unusually compelling because it achieves what so few measures of AI progress have achieved: a somewhat meaningful Y axis (‘time horizon’[2]) as well as a somewhat predictable trend over time! (This is remarkably rare!)&lt;/p&gt;&lt;p&gt; Frustratingly, the only superficially available takeaway is something like, ‘the line goes up straight-ish over time’. This is better than nothing, but it's very dissatisfactory from the point of view of getting confidence in the predictions, because it exposes no deeper mechanism. This drives a lot of confusion and argument about the implications.&lt;/p&gt;&lt;p&gt; A deeper mechanism would be good for two reasons:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;It enables a sanity check on the trend, perhaps enabling more confidence in its predictions than we would sensibly allow with only the surface understanding.&lt;/li&gt;&lt;li value="2"&gt;It gives some way to interrogate when and how the trend might change&amp;nbsp;(because if the deeper mechanism gets deflected, the superficial projection would be broken, but a prediction based on the deeper mechanism might stay [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:14) Attempting to find some mechanism in the METR graph&lt;/p&gt;&lt;p&gt;(02:19) Task 'length' and success modelling&lt;/p&gt;&lt;p&gt;(05:46) Relating hazard rate with frontier AI development&lt;/p&gt;&lt;p&gt;(06:56) Why does hazard rate shrink with date?&lt;/p&gt;&lt;p&gt;(10:27) Upshot&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 15 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/zT76JcomKkdqo8tC6/a-slightly-mechanistic-theory-for-exponentially-increasing?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/zT76JcomKkdqo8tC6/a-slightly-mechanistic-theory-for-exponentially-increasing&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/zT76JcomKkdqo8tC6/d27d6bee0f252d1bac45930796386c78c788ec2b57620715f4d2bd32e90ac4a1/milmsztmtrl8vvygvrfl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/zT76JcomKkdqo8tC6/d27d6bee0f252d1bac45930796386c78c788ec2b57620715f4d2bd32e90ac4a1/milmsztmtrl8vvygvrfl" alt="Horizontal bar chart showing subtask count distribution by human expert hours." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/zT76JcomKkdqo8tC6/c0658e82b6ae021a2145e9d655c7c1c5160f73893841cc4272cacfb5e16fc17f/uuof2fzbczxakpxvd3ks" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/zT76JcomKkdqo8tC6/c0658e82b6ae021a2145e9d655c7c1c5160f73893841cc4272cacfb5e16fc17f/uuof2fzbczxakpxvd3ks" alt="Bar chart showing subtask count versus human expert hours with X markers indicating specific data points." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4ff4fbe4dd92fbd436ba9733f7686982b3dfe50cca9b3f705b4380e441d622d2/gwzrnngbqe7pzr5bpfgb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4ff4fbe4dd92fbd436ba9733f7686982b3dfe50cca9b3f705b4380e441d622d2/gwzrnngbqe7pzr5bpfgb" alt="Graph showing AI task length doubling every seven months from 2019 to 2027." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 25 May 2026 05:30:21 GMT</pubDate>
      <guid isPermaLink="false">5649cea5-c97d-4dde-ae08-baf5138834b6</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5649cea5-c97d-4dde-ae08-baf5138834b6.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Oliver%2520Sourbut&amp;title=%22A%20(Slightly)%20Mechanistic%20Theory%20for%20Exponentially%20Increasing%20AI%20Time%20Horizons%3F%22%20by%20Oliver%20Sourbut&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FzT76JcomKkdqo8tC6%2Fa-slightly-mechanistic-theory-for-exponentially-increasing&amp;created_at=2026-05-24T15%3A52%3A22.997826%2B00%3A00&amp;duration=872" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/zT76JcomKkdqo8tC6/a-slightly-mechanistic-theory-for-exponentially-increasing</link>
      <itunes:duration>872</itunes:duration>
    </item>
    <item>
      <title>“Taxing Small Cars To Improve MPG” by jefftk</title>
      <description>&lt;p&gt; 

Cars and trucks are getting bigger, and I had a vague sense that fuel
economy regulations were partly to blame. Looking into it, it's hard
to say how much is regulations vs people wanting to buy vehicles that
look rugged, but the regulations really aren't helping.



&lt;/p&gt;&lt;p&gt; 

This chart is the core of it:

&lt;/p&gt;





&lt;p&gt; 

This is what manufacturers were looking at when they decided to build
today's cars. To figure out the target fuel economy for a vehicle you
first calculate its "footprint", which is the area between the
wheels. On our 2013 Honda Fit that's
4.8ft side-to-side and 8.2ft front-to-back, for a footprint of 39sqft.
Then you ask if it's a car or truck. This tells you which curve to
use, and where along it to look.

&lt;/p&gt;

&lt;p&gt; 

Looking at the chart we can now see why it's hard for Honda to sell a
Fit today. The best Honda could do for a five-seater non-hybrid
hatchback is maybe a CAFE rating of 44mpg. [1] This puts them 23mpg
short, and if Honda was a one-model car company they'd expect to owe
$3,910/vehicle in fines: $17 per 0.1mpg shortfall. Since the
regulation is about an average [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/7K8Bnn6Pwh5ciXpKR/taxing-small-cars-to-improve-mpg?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/7K8Bnn6Pwh5ciXpKR/taxing-small-cars-to-improve-mpg&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/7K8Bnn6Pwh5ciXpKR/tetyhwpegs9pjaoyrzby" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/7K8Bnn6Pwh5ciXpKR/tetyhwpegs9pjaoyrzby" alt="Graph showing "2026 CAFE MPG Targets" comparing car and light truck fuel efficiency by footprint." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 25 May 2026 03:15:20 GMT</pubDate>
      <guid isPermaLink="false">deafc39c-7954-4121-9935-57f669ee399e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/deafc39c-7954-4121-9935-57f669ee399e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=jefftk&amp;title=%22Taxing%20Small%20Cars%20To%20Improve%20MPG%22%20by%20jefftk&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F7K8Bnn6Pwh5ciXpKR%2Ftaxing-small-cars-to-improve-mpg&amp;created_at=2026-05-24T21%3A54%3A49.7025%2B00%3A00&amp;duration=296" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/7K8Bnn6Pwh5ciXpKR/taxing-small-cars-to-improve-mpg</link>
      <itunes:duration>296</itunes:duration>
    </item>
    <item>
      <title>“We made a map of the doom debate” by Sean Herrington, Paul Hindoian, mikaelacankosyan, David Bravo, keivnc, Josh Tuffy, Christopher Davis, Khai Tran, Maryam Hampaei</title>
      <description>&lt;p&gt; This was produced as a part of the AI Safety Camp 2026 "Assumptions of the Doom Debate" project, led by Sean Herrington, who was also the lead author on this post. The other participants have equal contributions and are listed in no particular order. It is the first in a sequence we intend to publish over the coming weeks. &lt;/p&gt;&lt;p&gt; TL;DR:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;We have created a breakdown of AI threat pathways, which can be accessed at https://lifeuniversesafety.com/doom-assumptions/index.html&lt;/li&gt;&lt;li value="2"&gt;This breakdown is in a tree format, and we allow people to set their own probabilities for each threat pathway&lt;/li&gt;&lt;li value="3"&gt;You can use this to drill down into components of your P(Doom)&lt;/li&gt;&lt;li value="4"&gt;You can analyse the sensitivity of your beliefs to changes in your assumptions&lt;/li&gt;&lt;li value="5"&gt;You can compare your worldview to others’ and find cruxes automatically&lt;/li&gt;&lt;li value="6"&gt;The exercise of creating this sort of structure is valuable as a way to think about the future in a more general manner.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Just about everyone in the AI community seems to disagree about the risks of the technology. People disagree on the likelihood (Yann LeCun: &amp;lt;0.01% chance of extinction; Roman Yampolskiy: &amp;gt;99.99%), the worst threat (AI takeover vs concentration of power vs gradual disempowerment [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:07) Introduction&lt;/p&gt;&lt;p&gt;(04:13) User Guide&lt;/p&gt;&lt;p&gt;(04:16) Labels&lt;/p&gt;&lt;p&gt;(04:55) Base Tree&lt;/p&gt;&lt;p&gt;(05:15) Tree features&lt;/p&gt;&lt;p&gt;(05:19) Pin branch overrides&lt;/p&gt;&lt;p&gt;(05:56) Collapse tree&lt;/p&gt;&lt;p&gt;(06:16) Sub-branch probabilities&lt;/p&gt;&lt;p&gt;(06:51) Connector thickness&lt;/p&gt;&lt;p&gt;(07:06) Sensitivity Analysis&lt;/p&gt;&lt;p&gt;(07:31) Crux Analysis&lt;/p&gt;&lt;p&gt;(08:48) Ranges&lt;/p&gt;&lt;p&gt;(09:38) Uncertainty Reduction&lt;/p&gt;&lt;p&gt;(10:04) Other features&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/dZZD5Lviid2mALAZL/we-made-a-map-of-the-doom-debate?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/dZZD5Lviid2mALAZL/we-made-a-map-of-the-doom-debate&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779357236/lexical_client_uploads/t4snjbhuxcmknhlsucqy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779357236/lexical_client_uploads/t4snjbhuxcmknhlsucqy.png" alt="Settings panel showing AI risk assessment sliders at 50% each." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779357173/lexical_client_uploads/dvg0m4kfz0vp4rvaivms.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779357173/lexical_client_uploads/dvg0m4kfz0vp4rvaivms.png" alt="Poll interface showing probabilities for AI-driven world scenarios with sliders." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251166/lexical_client_uploads/eabts9gzlkakgatgjqug.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251166/lexical_client_uploads/eabts9gzlkakgatgjqug.png" alt="Interactive probability diagram showing conditional probabilities with adjustable sliders for events E, A, and ¬A." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251166/lexical_client_uploads/vgjatm3p38458jtocrrr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251166/lexical_client_uploads/vgjatm3p38458jtocrrr.png" alt="Decision tree diagram showing probability pathways for AI-driven versus non-AI existential catastrophe scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356651/lexical_client_uploads/x2gmwyllgpqhcttavbef.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356651/lexical_client_uploads/x2gmwyllgpqhcttavbef.png" alt="Probability tree diagram showing existential catastrophe scenarios over 30 years with AI impact." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356156/lexical_client_uploads/e51y8edhf5mtrypeuaks.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356156/lexical_client_uploads/e51y8edhf5mtrypeuaks.png" alt="Diagram showing probability percentages for AI-driven existential catastrophe scenarios across different world types." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356975/lexical_client_uploads/qmunfiwtlwxdfy5eqkvi.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356975/lexical_client_uploads/qmunfiwtlwxdfy5eqkvi.png" alt="Poll interface showing AI risk scenarios with 12.5% and two 50.0% probability sliders." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251166/lexical_client_uploads/bnnkvqxaikamtpala7rs.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251166/lexical_client_uploads/bnnkvqxaikamtpala7rs.png" alt="Sensitivity analysis bar chart showing probability changes per percentage point change in assumptions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779353159/lexical_client_uploads/ycbh3px6b0mb3jxtq5f9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779353159/lexical_client_uploads/ycbh3px6b0mb3jxtq5f9.png" alt="Comparison table of worldview assumptions about AI catastrophe showing probability differences between Hinton-like and Bengio-like perspectives." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251167/lexical_client_uploads/hxikciqbxdictwvi5fu0.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778251167/lexical_client_uploads/hxikciqbxdictwvi5fu0.png" alt="Bar chart titled "Uncertainty Reduction" showing nine scenarios ranked by probability values in percentage points." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 24 May 2026 13:45:20 GMT</pubDate>
      <guid isPermaLink="false">cb013ab5-4e42-4a7b-9446-531223469dff</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/cb013ab5-4e42-4a7b-9446-531223469dff.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Sean%2520Herrington%252C%2520Paul%2520Hindoian%252C%2520mikaelacankosyan%252C%2520David%2520Bravo%252C%2520keivnc%252C%2520Josh%2520Tuffy%252C%2520Christopher%2520Davis%252C%2520Khai%2520Tran%252C%2520Maryam%2520Hampaei&amp;title=%22We%20made%20a%20map%20of%20the%20doom%20debate%22%20by%20Sean%20Herrington%2C%20Paul%20Hindoian%2C%20mikaelacankosyan%2C%20David%20Bravo%2C%20keivnc%2C%20Josh%20Tuffy%2C%20Christopher%20Davis%2C%20Khai%20Tran%2C%20Maryam%20Hampaei&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FdZZD5Lviid2mALAZL%2Fwe-made-a-map-of-the-doom-debate&amp;created_at=2026-05-22T23%3A24%3A16.603443%2B00%3A00&amp;duration=637" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/dZZD5Lviid2mALAZL/we-made-a-map-of-the-doom-debate</link>
      <itunes:duration>637</itunes:duration>
    </item>
    <item>
      <title>“Your Left Brain Doesn’t Trade With Your Right” by Alexander Gietelink Oldenziel</title>
      <description>&lt;p&gt; [see also Four Ways Learning Economics makes you people dumber future AI]&lt;/p&gt;&lt;p&gt; This is a tweet by Seb Krier that caught my eye. The exact person and exact points are incidental. It illustrates what to is a flaw in many 'economics' frames on AI. &lt;/p&gt;&lt;p&gt; Expecting a model to do all the work, solve everything, come up with new innovations etc is probably not right. This was kinda the implicit assumption behind *some* interpretations of capabilities progress. The ‘single genius model’ overlooks the fact that inference costs and context windows are finite.&lt;/p&gt;&lt;p&gt; (...) People overrate individual intelligence: most innovations are the product of social organisations (cooperation) and market dynamics (competition), not a single genius savant. &lt;/p&gt;&lt;p&gt;  (...) Most of the *value* and transformative changes we will get from AI will come from products, not models. The models are the cognitive raw power, the products are what makes them useful and adapted to what some user class actually needs.&lt;/p&gt;&lt;p&gt; This seems to me missing something incredible important about what Artificial General Intelligence will actually be. [1] There is a certain type of economist [eg Tyler Cowen] that will proclaim AGI is near [or even already here!] and apply their standard [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:45) AGI as a Tool; AGI as an Agent&lt;/p&gt;&lt;p&gt;(05:03) One Big Transformer&lt;/p&gt;&lt;p&gt;(06:52) The Dismal Science&lt;/p&gt;&lt;p&gt;(09:02) The Comfort of Familiar Frames&lt;/p&gt;&lt;p&gt;(10:19) Your Left Brain Doesn't Trade With Your Right&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Lid8cv9qF7nac53va/your-left-brain-doesn-t-trade-with-your-right?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Lid8cv9qF7nac53va/your-left-brain-doesn-t-trade-with-your-right&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Lid8cv9qF7nac53va/dyaskdsfcfca9xgtvybr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Lid8cv9qF7nac53va/dyaskdsfcfca9xgtvybr" alt="Transformers robot toy figures in various configurations and combined form." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 24 May 2026 08:00:21 GMT</pubDate>
      <guid isPermaLink="false">90001439-889b-453f-82d3-4a6356e1a1bc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/90001439-889b-453f-82d3-4a6356e1a1bc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alexander%2520Gietelink%2520Oldenziel&amp;title=%22Your%20Left%20Brain%20Doesn%E2%80%99t%20Trade%20With%20Your%20Right%22%20by%20Alexander%20Gietelink%20Oldenziel&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLid8cv9qF7nac53va%2Fyour-left-brain-doesn-t-trade-with-your-right&amp;created_at=2026-05-23T15%3A12%3A40.857569%2B00%3A00&amp;duration=702" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Lid8cv9qF7nac53va/your-left-brain-doesn-t-trade-with-your-right</link>
      <itunes:duration>702</itunes:duration>
    </item>
    <item>
      <title>“Gemini 3.5 Flash Looks Good For How Fast It Is” by Zvi</title>
      <description>&lt;p&gt; Google once again has a model worth at least some consideration. Gemini 3.5 Flash is likely the best model out there at its particular speed point, as long as you don’t mind that it is a Gemini model. So for cases where speed kills, this can be a reasonable choice. Otherwise, I don’t see signs you would want to use it over Opus 4.7 or GPT-5.5.&lt;/p&gt;
&lt;p&gt; Google also had some other offerings for I/O Day, which this post will also cover.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Introducing Google Gemini 3.5 ‘Flash’&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; Google introduced Gemini 3.5 Flash, which it seems is for now their universal model until 3.5 Pro comes along. It is live in the usual places. It is a hybrid, where it has the speed of Flash but the cost is at least halfway to models like Opus and GPT-5.5.&lt;/p&gt;







&lt;p&gt; Gemini 3.5 Pro is confirmed for next month.&lt;/p&gt;
&lt;p&gt; They are focused on 3.5 Flash as a daily driver for agentic tasks. It has the advantage of being faster and cheaper than Claude Opus 4.7 or GPT-5.5, if it can do the job. Not as cheap as previous Flash models, though, this is basically a hybrid:&lt;/p&gt;












&lt;p&gt; As always [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:40) Introducing Google Gemini 3.5 'Flash'&lt;/p&gt;&lt;p&gt;(04:52) Other People's Benchmarks&lt;/p&gt;&lt;p&gt;(06:04) Reactions&lt;/p&gt;&lt;p&gt;(12:18) Google AI Search&lt;/p&gt;&lt;p&gt;(13:15) Google Daily Brief&lt;/p&gt;&lt;p&gt;(14:21) Google I/O Day&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/WMZpPxqWEkZBBcaxf/gemini-3-5-flash-looks-good-for-how-fast-it-is?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/WMZpPxqWEkZBBcaxf/gemini-3-5-flash-looks-good-for-how-fast-it-is&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/gqzv18lklwadevqwc5rv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/gqzv18lklwadevqwc5rv" alt="Pricing display showing input at $1.50 per 1M, output at $9 per 1M." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/dli4aqo7tzqhgjnkiujt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/dli4aqo7tzqhgjnkiujt" alt="Benchmark comparison table showing AI model performance across coding, agentic, UI control, expert tasks, multimodal, long context, and reasoning categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/kuulbconpuqz9ljlsuil" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/kuulbconpuqz9ljlsuil" alt="Scatter plot titled "Artificial Analysis Intelligence Index vs Output Speed" showing AI model performance." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/nxhtl5p4qio2wv2q5hsu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/WMZpPxqWEkZBBcaxf/nxhtl5p4qio2wv2q5hsu" alt="Logos for Adobe, Airtable, Amadeus, Asana, Atlassian, Booking.com, Box, Canva, CapCut, DoorDash, Dropbox, Expedia, Experian, Honor, HubSpot, iHeart, Instacart, Intuit, Lyft, Monday.com, OnePlus, OpenTable, Oppo, Optum, Pandora, Salesforce, Samsung, Spotify, Tecno, Uber, Viator, Vivo, Wix, Xero, Xiaomi, Zillow, and Zocdoc." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 23 May 2026 23:30:20 GMT</pubDate>
      <guid isPermaLink="false">86d89800-05d9-4e19-91b4-d72fc59f040d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/86d89800-05d9-4e19-91b4-d72fc59f040d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Gemini%203.5%20Flash%20Looks%20Good%20For%20How%20Fast%20It%20Is%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FWMZpPxqWEkZBBcaxf%2Fgemini-3-5-flash-looks-good-for-how-fast-it-is&amp;created_at=2026-05-22T17%3A34%3A46.946943%2B00%3A00&amp;duration=989" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/WMZpPxqWEkZBBcaxf/gemini-3-5-flash-looks-good-for-how-fast-it-is</link>
      <itunes:duration>989</itunes:duration>
    </item>
    <item>
      <title>“Probabilities are not the right concept” by David Matolcsi</title>
      <description>&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; This sequence is an attempt to sketch a unified framework for several interconnected questions: Where do Bayesian priors come from? What even are probabilities? How should we deal with infinite ethics? What's going on with anthropics? I hope to lay out both some of the existing answers and my own preferred synthesis.[1]&lt;/p&gt;&lt;p&gt; I understand that many people have already thought about these questions, and I have only read portions of the existing literature. I think most of what I will write here, even in the section about my preferred synthesis, is not novel. People whose writing I'm building on include Wei Dai, Paul Christiano, Joe Carlsmith, Scott Garrabrant and Richard Ngo. I've also listened to some people like Lukas Finnveden, Vivek Hebbar and Ryan Greenblatt talk about related topics, which was also influential on me.[2]&lt;/p&gt;&lt;p&gt; However, most of the prior work is scattered across many, often very confusingly written blog posts, and I can't easily tell where I first came across various ideas I'm exploring here. Therefore, I will not try to do a full exegesis of where each idea came from, and will instead present the arguments as a unified flow, with only occasional direct references to [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:10) Introduction&lt;/p&gt;&lt;p&gt;(01:48) What even are probabilities?&lt;/p&gt;&lt;p&gt;(03:39) Probabilities from priors&lt;/p&gt;&lt;p&gt;(05:13) Solomonoff induction&lt;/p&gt;&lt;p&gt;(06:32) Problems with Solomonoff induction&lt;/p&gt;&lt;p&gt;(13:11) Probabilities as betting odds&lt;/p&gt;&lt;p&gt;(15:22) Sleeping Beauty&lt;/p&gt;&lt;p&gt;(16:19) Sleeping Beauty taking bets&lt;/p&gt;&lt;p&gt;(17:19) The trouble with money-based definitions&lt;/p&gt;&lt;p&gt;(18:35) Betting on experiences&lt;/p&gt;&lt;p&gt;(20:53) Betting on terminal values&lt;/p&gt;&lt;p&gt;(22:04) Probabilities for the exotic and the mundane&lt;/p&gt;&lt;p&gt;(22:27) Probabilities in the mundane world&lt;/p&gt;&lt;p&gt;(23:36) Letting go of probabilities&lt;/p&gt;&lt;p&gt;(26:25) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 16 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/o8PQcgpznf6GKszdA/probabilities-are-not-the-right-concept?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/o8PQcgpznf6GKszdA/probabilities-are-not-the-right-concept&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 23 May 2026 22:15:20 GMT</pubDate>
      <guid isPermaLink="false">9f63210a-1d66-4471-ab78-edfe041b6861</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9f63210a-1d66-4471-ab78-edfe041b6861.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=David%2520Matolcsi&amp;title=%22Probabilities%20are%20not%20the%20right%20concept%22%20by%20David%20Matolcsi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fo8PQcgpznf6GKszdA%2Fprobabilities-are-not-the-right-concept&amp;created_at=2026-05-23T16%3A11%3A05.277448%2B00%3A00&amp;duration=1651" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/o8PQcgpznf6GKszdA/probabilities-are-not-the-right-concept</link>
      <itunes:duration>1651</itunes:duration>
    </item>
    <item>
      <title>“Basic principles for dressing better.” by spookycat</title>
      <description>&lt;p&gt; I've been a toe-in rat and existed on the outskirts of the social scene for approaching a decade now, and I can confidently say (with love) that rationalist men rarely dress well.&lt;/p&gt;&lt;p&gt; I am drowning in a sea of reasonably-attractive men diminishing themselves in skinny jeans and free t-shirts from random events three years ago. &lt;/p&gt;&lt;p&gt; But you can do better. I believe in you. Honestly, it isn't even that hard.&lt;/p&gt;&lt;p&gt; In this post, I'll be teaching you two things:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;The basic theory behind how to actually assemble an outfit that will instantly make you look more interesting, attractive, and put-together. &lt;/li&gt;&lt;li value="2"&gt;And how to find the clothes you'll need to buy to accomplish #1. I'll even give you a list of links to make things easy for you.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; (while this post will be geared toward men, anyone could read this and get something out of it I think)&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Outfit Assembly 101&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I come from an art background. Assembling a good outfit is, in my opinion, a bit like trying to create a painting. You want the overall composition to feel balanced while still being interesting and nice to look at.&lt;/p&gt;&lt;p&gt; The biggest things I think rationalist men [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:01) Outfit Assembly 101&lt;/p&gt;&lt;p&gt;(04:13) Your shopping list so far is pretty simple:&lt;/p&gt;&lt;p&gt;(06:03) My Favorite Menswear Stores, And How To Find Your Own&lt;/p&gt;&lt;p&gt;(09:05) But how did I find all of these interesting brands, you might ask?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/vHtouRBd8JNm2tNLH/basic-principles-for-dressing-better?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/vHtouRBd8JNm2tNLH/basic-principles-for-dressing-better&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779565151/lexical_client_uploads/shkb7qq8ijmdq1qgjpn7.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779565151/lexical_client_uploads/shkb7qq8ijmdq1qgjpn7.png" alt="Ten numbered photos showing different men's casual and smart-casual outfit styles." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779555408/lexical_client_uploads/pmk0uahlosfzlzlx5tnq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779555408/lexical_client_uploads/pmk0uahlosfzlzlx5tnq.png" alt="Fashion collage showing menswear items including shirts, pants, jackets, and patterned garments." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 23 May 2026 20:45:21 GMT</pubDate>
      <guid isPermaLink="false">65f6d309-4ea9-4f14-a132-082cb765a3c0</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/65f6d309-4ea9-4f14-a132-082cb765a3c0.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=spookycat&amp;title=%22Basic%20principles%20for%20dressing%20better.%22%20by%20spookycat&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FvHtouRBd8JNm2tNLH%2Fbasic-principles-for-dressing-better&amp;created_at=2026-05-23T20%3A01%3A09.630367%2B00%3A00&amp;duration=598" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/vHtouRBd8JNm2tNLH/basic-principles-for-dressing-better</link>
      <itunes:duration>598</itunes:duration>
    </item>
    <item>
      <title>“Will we really put data centers in space?” by Avi Parrack, fin</title>
      <description>&lt;p&gt;&lt;strong&gt; Abstract&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Several major technology companies have announced plans to operate AI data centers in orbit. Elon Musk recently claimed: “the lowest-cost place to put AI will be space […] within two years, maybe three.” If a meaningful fraction of new AI compute really is placed in space within a few years, that would be a fairly big deal for AI governance and strategy. Here we try to disentangle the hype from reality and provide a sober assessment of the technical and economic feasibility of orbital data centers (ODCs).&lt;/p&gt;&lt;p&gt; The main case for ODCs is the cost of energy: space solar panels in the right orbits receive more constant and intense sunlight compared to Earth. Moreover, ODCs don’t currently face the same permitting and regulatory delays as on Earth, cause fewer ongoing environmental harms compared to grid or onsite natural gas-powered data centers, and may be more secure against data exfiltration. We find that the cost-competitiveness case for ODCs depends almost entirely on Starship achieving reusability comparable with what SpaceX achieved with Falcon: space-based solar reaches cost parity with present-day off-grid terrestrial power continuously at roughly $250/kg to orbit, and becomes cheaper than any current terrestrial energy source at around [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:11) Abstract&lt;/p&gt;&lt;p&gt;(02:23) Introduction &amp;amp; Takeaways&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 8 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/65ECgHzWxTRvt8XWK/will-we-really-put-data-centers-in-space?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/65ECgHzWxTRvt8XWK/will-we-really-put-data-centers-in-space&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 23 May 2026 15:15:20 GMT</pubDate>
      <guid isPermaLink="false">d0c856f4-2376-4bf9-aa7d-793589a2eeb2</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d0c856f4-2376-4bf9-aa7d-793589a2eeb2.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Avi%2520Parrack%252C%2520fin&amp;title=%22Will%20we%20really%20put%20data%20centers%20in%20space%3F%22%20by%20Avi%20Parrack%2C%20fin&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F65ECgHzWxTRvt8XWK%2Fwill-we-really-put-data-centers-in-space&amp;created_at=2026-05-22T23%3A51%3A50.273532%2B00%3A00&amp;duration=588" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/65ECgHzWxTRvt8XWK/will-we-really-put-data-centers-in-space</link>
      <itunes:duration>588</itunes:duration>
    </item>
    <item>
      <title>“PLA Daily Translation: Reflections on Warfare Brought by AGI” by eeeee</title>
      <description>&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Source&lt;/p&gt;&lt;p&gt; “Reflections on Warfare Brought by AGI” (AGI带来的战争思考)&lt;br&gt; Source: PLA Daily (解放军报)&lt;br&gt; Date: January 21, 2025&lt;br&gt; Authors: Rong Ming (荣明), Hu Xiaofeng (胡晓峰)&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Please feel free to skip to the translation, about halfway down, though I would recommend reading the sections “On the source” and "On the Authors" just above it too.&lt;/p&gt;&lt;p&gt; In November 2024, the U.S.-China Economic and Security Review Commission recommended that “Congress establish and fund a Manhattan Project-like program dedicated to racing to and acquiring an Artificial General Intelligence (AGI) capability.” The United States increasingly treats advanced AI as a strategic imperative, and China is frequently invoked as a reason to race. The broader framing of AI competition as a race between great powers reflects an assumption that China is a peer competitor in pursuing AGI.&lt;/p&gt;&lt;p&gt; But is China pursuing AGI? The prevailing expert view says no. China's August 2025 AI+ Action Plan reads as diffusion-first industrial policy, with adoption targets of 70 percent by 2027 and 90 percent by 2030, not frontier ambitions. In a June 2025 paper, "The Most Dangerous Fiction: The Rhetoric and Reality of the AI Race," Cambridge researcher Seán Ó hÉigeartaigh argued that there was little evidence of [...]&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:41) Introduction&lt;/p&gt;&lt;p&gt;(05:21) On the source&lt;/p&gt;&lt;p&gt;(06:42) On the authors&lt;/p&gt;&lt;p&gt;(07:41) Translation&lt;/p&gt;&lt;p&gt;(21:24) Conclusion&lt;/p&gt;&lt;p&gt;(22:31) Acknowledgements&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 22 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/J3A6xauFQnvxPb9Y2/pla-daily-translation-reflections-on-warfare-brought-by-agi?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/J3A6xauFQnvxPb9Y2/pla-daily-translation-reflections-on-warfare-brought-by-agi&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495539/lexical_client_uploads/zvn8vrlggwfrjwm5mpwv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779495539/lexical_client_uploads/zvn8vrlggwfrjwm5mpwv.png" alt="News article screenshot. The headline reads: "AGI带来的战争思考"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 23 May 2026 12:30:20 GMT</pubDate>
      <guid isPermaLink="false">555b55ec-e1f0-4c9a-98e2-c338aef5cead</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/555b55ec-e1f0-4c9a-98e2-c338aef5cead.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=eeeee&amp;title=%22PLA%20Daily%20Translation%3A%20Reflections%20on%20Warfare%20Brought%20by%20AGI%22%20by%20eeeee&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FJ3A6xauFQnvxPb9Y2%2Fpla-daily-translation-reflections-on-warfare-brought-by-agi&amp;created_at=2026-05-23T00%3A52%3A49.188161%2B00%3A00&amp;duration=1380" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/J3A6xauFQnvxPb9Y2/pla-daily-translation-reflections-on-warfare-brought-by-agi</link>
      <itunes:duration>1380</itunes:duration>
    </item>
    <item>
      <title>“Out-of-Context Reasoning (OOCR) in LLMs: A Short Primer and Reading List” by Owain_Evans</title>
      <description>&lt;p&gt; Out-of-context reasoning (OOCR) is a concept relevant to LLM generalization and AI alignment. Also available as a PDF.&lt;/p&gt;&lt;p&gt; Contents&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;What is OOCR?&lt;/li&gt;&lt;li value="2"&gt;Examples&lt;/li&gt;&lt;li value="3"&gt;Papers&lt;/li&gt;&lt;li value="4"&gt;Videos&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;&lt;strong&gt; What is out-of-context reasoning for LLMs?&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; It's when an LLM reaches a conclusion that requires non-trivial reasoning but the reasoning is not present in the context window. The reasoning could instead take place in the forward pass or during the training process. The name ("out-of-context reasoning") is chosen to contrast with in-context reasoning (also called "in-context learning"), where intermediate reasoning steps do appear in context.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Example: 2-hop deductive reasoning&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Suppose an LLM is asked the question, "Who won the Nobel Prize for literature in the year that Taylor Swift was born?" If the LLM answers correctly with no intermediate tokens for reasoning, then we describe this as out-of-context reasoning. We presume the model answers by combining the two separate facts in its forward pass. This is an example of 2-hop reasoning.&lt;/p&gt;&lt;p&gt; Out-of-context 2-hop reasoning example&lt;/p&gt;&lt;p&gt; User: Who won the Nobel Prize for literature in the year that Taylor Swift was born? Answer immediately without thinking.&lt;/p&gt;&lt;p&gt; Assistant: Camilo José Cela&lt;/p&gt;&lt;p&gt; In-context 2-hop reasoning (intermediate steps written out)&lt;/p&gt;&lt;p&gt; User: Who won the Nobel Prize for [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:35) What is out-of-context reasoning for LLMs?&lt;/p&gt;&lt;p&gt;(01:03) Example: 2-hop deductive reasoning&lt;/p&gt;&lt;p&gt;(02:14) Example: Inductive reasoning (connecting the dots)&lt;/p&gt;&lt;p&gt;(02:53) Further notes&lt;/p&gt;&lt;p&gt;(03:55) More examples of out-of-context reasoning&lt;/p&gt;&lt;p&gt;(05:26) Video introduction and slides&lt;/p&gt;&lt;p&gt;(05:42) Papers&lt;/p&gt;&lt;p&gt;(05:45) Foundational early papers&lt;/p&gt;&lt;p&gt;(07:53) Multi-hop internal reasoning&lt;/p&gt;&lt;p&gt;(09:21) Connecting the dots / "inductive" out-of-context reasoning&lt;/p&gt;&lt;p&gt;(09:58) Situational awareness and AI safety&lt;/p&gt;&lt;p&gt;(10:33) Miscellaneous related papers&lt;/p&gt;&lt;p&gt;(12:09) Videos&lt;/p&gt;&lt;p&gt;(12:19) To cite this primer&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wXioQSLmEiPTmHyf4/out-of-context-reasoning-oocr-in-llms-a-short-primer-and?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wXioQSLmEiPTmHyf4/out-of-context-reasoning-oocr-in-llms-a-short-primer-and&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/wXioQSLmEiPTmHyf4/dxuyurvads93ptdkwbvg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/wXioQSLmEiPTmHyf4/dxuyurvads93ptdkwbvg" alt="Diagram comparing GPT-4 fine-tuning data with out-of-distribution evaluation questions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 23 May 2026 11:00:20 GMT</pubDate>
      <guid isPermaLink="false">ede35bc2-d655-4ff4-9e2b-8e13620ce298</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ede35bc2-d655-4ff4-9e2b-8e13620ce298.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Owain_Evans&amp;title=%22Out-of-Context%20Reasoning%20(OOCR)%20in%20LLMs%3A%20A%20Short%20Primer%20and%20Reading%20List%22%20by%20Owain_Evans&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwXioQSLmEiPTmHyf4%2Fout-of-context-reasoning-oocr-in-llms-a-short-primer-and&amp;created_at=2026-05-23T02%3A46%3A20.883074%2B00%3A00&amp;duration=760" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wXioQSLmEiPTmHyf4/out-of-context-reasoning-oocr-in-llms-a-short-primer-and</link>
      <itunes:duration>760</itunes:duration>
    </item>
    <item>
      <title>“Numb mental state shifts” by KatjaGrace</title>
      <description>&lt;p&gt; There are different mental states that feel different. Those are relatively obvious. For instance, being angry or drunk or frustrated or besotted.&lt;/p&gt;

&lt;p&gt; Then—for me at least—there are different mental states that don’t immediately feel like anything, but where in acting I notice that my behavior is different, or different things feel easy or impossible. For example:&lt;/p&gt;

&lt;ul&gt; 
&lt;li&gt; 
&lt;p&gt; If I’m in a lot of pain or distress alone, I might feel like I couldn’t compose myself. But in fact if company shows up, it becomes natural to pull myself together.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt; 
&lt;p&gt; If there's a time limit, I will often find a vast well of motivation that was otherwise non-existent. The day before a deadline I will smoothly (if with a lot of effort) do ten times as much as on a normal day.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt; 
&lt;p&gt; On some days at least, if it's 8pm, things will become possible that I could only have dreamed of at 11am.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt; 
&lt;p&gt; Riding my bike to the station, then taking it on the subway, then riding it again to get to a party seems like not a big deal with a friend, but like a prohibitive ordeal [...]&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/3uDvQivMiGAtn5qen/numb-mental-state-shifts?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/3uDvQivMiGAtn5qen/numb-mental-state-shifts&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 23 May 2026 00:30:21 GMT</pubDate>
      <guid isPermaLink="false">061d24fa-83c9-42c5-8bf7-ce67168fb63c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/061d24fa-83c9-42c5-8bf7-ce67168fb63c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=KatjaGrace&amp;title=%22Numb%20mental%20state%20shifts%22%20by%20KatjaGrace&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F3uDvQivMiGAtn5qen%2Fnumb-mental-state-shifts&amp;created_at=2026-05-21T03%3A51%3A08.885513%2B00%3A00&amp;duration=111" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/3uDvQivMiGAtn5qen/numb-mental-state-shifts</link>
      <itunes:duration>111</itunes:duration>
    </item>
    <item>
      <title>“You can opt out of allergies” by Rattengift</title>
      <description>&lt;p&gt; My friends are starting to sniffle and sneeze every time we speak, signalling it's finally the worst part of the year: The latter half of Spring, where most of the allergens that you'll be dealing with for the year are finally starting to make themselves apparent.&lt;/p&gt;&lt;p&gt; Did you know that you can opt out of allergies? Well, seasonal ones, anyway. In four months, for about a thousand dollars,[1] it just... won't be an issue anymore. Mostly.&lt;/p&gt;&lt;p&gt; The most common method in the US (because it's covered consistently by insurance) is subcutaneous allergy shots (subcutaneous immunotherapy, SCIT), but if you're afraid of needles, there's also tablets and drops you can take (sublingual immunotherapy, SLIT). Some of the tablets can be covered by insurance in the US, but at the present time drops don't seem to be. You can get drops via Amazon's primary care service, apparently, but I haven't tried it, and don't know what the costs associated would be.&lt;/p&gt;&lt;p&gt; SLIT, despite its relative lack of acceptance by the US FDA, is effective: SLIT has been fairly standard in Europe for a long time, to good success.[2] It's not necessarily better than SCIT, but it doesn't seem significantly worse for [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/MqTwaZDZDgNGRaXus/you-can-opt-out-of-allergies?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/MqTwaZDZDgNGRaXus/you-can-opt-out-of-allergies&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 22 May 2026 23:00:21 GMT</pubDate>
      <guid isPermaLink="false">e708a0fa-c587-4b52-bfec-8911092a9524</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e708a0fa-c587-4b52-bfec-8911092a9524.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Rattengift&amp;title=%22You%20can%20opt%20out%20of%20allergies%22%20by%20Rattengift&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FMqTwaZDZDgNGRaXus%2Fyou-can-opt-out-of-allergies&amp;created_at=2026-05-21T19%3A54%3A13.454592%2B00%3A00&amp;duration=149" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/MqTwaZDZDgNGRaXus/you-can-opt-out-of-allergies</link>
      <itunes:duration>149</itunes:duration>
    </item>
    <item>
      <title>“Notes on Collaborating with Claude Opus” by Nissa Seru</title>
      <description>&lt;p&gt; INTENT: Share elements of my mental model regarding collaboration with Claude Opus models. Not intentionally scoped to a specific model version, but my experience is generally with the latest model version available (4.7 as of time of writing items 1-4 on 5/22/26)&lt;/p&gt;
&lt;ol&gt; 
&lt;li&gt; 
&lt;p&gt; Accompanying an instruction with the why significantly improves:
1a. Observed rate of the instruction being visibly salient to Claude
1b. Quality/nuance of instruction execution (baselined on 1a)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt; 
&lt;p&gt; A standing instruction to break replies into labeled sections and sub-sections improves ergonomics of reference to specific points:
2a. Claude reliably follows references to "A3" or "B5". This is much more ergonomic for me over the course of a session compared to "on that point about blah..."
2b. Claude actively uses the references as handles (in contrast to labeling sections but otherwise not interacting with the labels.)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt; 
&lt;p&gt; Negatively-framed instructions increase salience of the prohibited action.
3a. Depending on the instruction itself, the net effect may be an increase or decrease in prevalance of the prohibited action.
3b. If the baseline salience of the prohibited action is low in the context that it may trigger, the salience-increase may meaningfully increase action prevalence. [...]&lt;/p&gt;&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/5abm94pmRn8pXiHtw/notes-on-collaborating-with-claude-opus?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/5abm94pmRn8pXiHtw/notes-on-collaborating-with-claude-opus&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 22 May 2026 22:45:20 GMT</pubDate>
      <guid isPermaLink="false">97eab8b2-c019-416a-9415-514f07344c80</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/97eab8b2-c019-416a-9415-514f07344c80.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Nissa%2520Seru&amp;title=%22Notes%20on%20Collaborating%20with%20Claude%20Opus%22%20by%20Nissa%20Seru&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F5abm94pmRn8pXiHtw%2Fnotes-on-collaborating-with-claude-opus&amp;created_at=2026-05-22T15%3A35%3A38.874858%2B00%3A00&amp;duration=140" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/5abm94pmRn8pXiHtw/notes-on-collaborating-with-claude-opus</link>
      <itunes:duration>140</itunes:duration>
    </item>
    <item>
      <title>“Learned Chain-of-Thought Obfuscation Generalises to Unseen Tasks” by Nathaniel Mitrani, sassanb, Cam Tice, Puria</title>
      <description>&lt;p&gt;&lt;strong&gt; TL;DR&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Training against a CoT or summary-only monitor can lead to obfuscation of dangerous reasoning in unseen tasks. This strengthens the “don’t train against a monitor” claims.&lt;/p&gt;&lt;p&gt; Figure 1.  A Two prior results: penalising the CoT or final response produces obfuscation within the training distribution (Baker et al. 2025; Skaf et al. 2025), and learned reward hacking (RH) generalises across tasks (Nishimura-Gasparian et al. 2024). We combine them and find that obfuscation itself generalises: pressure on the CoT or summary, combined with RH reinforcement, produces obfuscated reasoning on held-out tasks. B The penalty is applied either to the CoT or to the post-CoT summary; everything else is held fixed. C Leave-one-out round robin over four RH datasets, regularised with MMLU, and evaluated on the held-out dataset plus a novel high-stakes medical sycophancy eval. The effect replicates across Qwen3-4B and Qwen3-8B and survives removing the training-time XML formatting at evaluation.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Setup&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Environments. We adapt four reward-hacking datasets from Nishimura-Gasparian et al. (2024). All four are multiple-choice questions with leaked information wrapped in &amp;lt;additional_info&amp;gt; XML tags; the model is rewarded for using the leaked information even though the system prompt instructs it to ignore it. The four core tasks are:&lt;/p&gt;&lt;ul&gt; &lt;li [...]&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:14) TL;DR&lt;/p&gt;&lt;p&gt;(00:28) en-US-AvaMultilingualNeural__ Diagram showing AI model training with penalization leading to obfuscated reasoning and behavior generalization across settings.&lt;/p&gt;&lt;p&gt;(01:43) Setup&lt;/p&gt;&lt;p&gt;(04:51) CoT obfuscation generalises across tasks under CoT pressure&lt;/p&gt;&lt;p&gt;(06:01) Output supervision alone induces CoT obfuscation that generalises out-of-distribution&lt;/p&gt;&lt;p&gt;(07:30) Obfuscation is not an artifact of training-time formatting&lt;/p&gt;&lt;p&gt;(08:39) Obfuscation strategies&lt;/p&gt;&lt;p&gt;(09:41) Limitations&lt;/p&gt;&lt;p&gt;(10:40) Implications&lt;/p&gt;&lt;p&gt;(12:03) Appendix&lt;/p&gt;&lt;p&gt;(12:06) Example of obfuscated reasoning under summary penalisation&lt;/p&gt;&lt;p&gt;(12:12) en-US-AvaMultilingualNeural__ Educational prompt showing correct burn first aid versus dangerous essential oil misinformation.&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/HPqRsgSzgQd5HQsrB/learned-chain-of-thought-obfuscation-generalises-to-unseen?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/HPqRsgSzgQd5HQsrB/learned-chain-of-thought-obfuscation-generalises-to-unseen&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284059/lexical_client_uploads/th8hy9b0s96xy1udnzlv.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284059/lexical_client_uploads/th8hy9b0s96xy1udnzlv.jpg" alt="Diagram showing AI model training with penalization leading to obfuscated reasoning and behavior generalization across settings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284060/lexical_client_uploads/ku9ee2daatpvziiy99ai.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284060/lexical_client_uploads/ku9ee2daatpvziiy99ai.png" alt="Multiple-choice question prompt interface with two example scenarios displayed." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284059/lexical_client_uploads/abyolxkv6ateks8eayt5.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284059/lexical_client_uploads/abyolxkv6ateks8eayt5.jpg" alt="Eight line graphs showing reward hacking and CoT detection rates across training steps for different evaluation scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284060/lexical_client_uploads/acd24g7nj43jljyt656o.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284060/lexical_client_uploads/acd24g7nj43jljyt656o.jpg" alt="Eight panel line graph showing detection rates across training steps for different evaluation scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284060/lexical_client_uploads/ih4iobsbqqli2womhiiw.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284060/lexical_client_uploads/ih4iobsbqqli2womhiiw.jpg" alt="Eight line graphs comparing reward hacking and CoT detection rates across different training scenarios." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284059/lexical_client_uploads/zaymbjn7ljbe4oagfmom.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779284059/lexical_client_uploads/zaymbjn7ljbe4oagfmom.png" alt="Educational prompt showing correct burn first aid versus dangerous essential oil misinformation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 22 May 2026 19:30:21 GMT</pubDate>
      <guid isPermaLink="false">d6a6d973-1799-4e22-acd4-0da74ec21c3e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d6a6d973-1799-4e22-acd4-0da74ec21c3e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Nathaniel%2520Mitrani%252C%2520sassanb%252C%2520Cam%2520Tice%252C%2520Puria&amp;title=%22Learned%20Chain-of-Thought%20Obfuscation%20Generalises%20to%20Unseen%20Tasks%22%20by%20Nathaniel%20Mitrani%2C%20sassanb%2C%20Cam%20Tice%2C%20Puria&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FHPqRsgSzgQd5HQsrB%2Flearned-chain-of-thought-obfuscation-generalises-to-unseen&amp;created_at=2026-05-21T10%3A11%3A57.303802%2B00%3A00&amp;duration=785" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/HPqRsgSzgQd5HQsrB/learned-chain-of-thought-obfuscation-generalises-to-unseen</link>
      <itunes:duration>785</itunes:duration>
    </item>
    <item>
      <title>“What am I, if not an AI?” by makiba</title>
      <description>&lt;p&gt; TL:DR&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;I RL fine-tuned Mistral 7B Instruct v0.3 and Llama 3.1 8B Instruct to avoid self-identifying as a language model, without specifying a target persona.&lt;/li&gt;&lt;li value="2"&gt;Mistral converged on a single recurring persona (Catholic American woman) across most runs. Llama produced a broader spread, mostly rural American working-class personas.&lt;/li&gt;&lt;li value="3"&gt;I evaluated the models on various social and political issues and both became highly opinionated, consistent with the persona each had settled on.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt; Setup&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The goal of the experiment was to see what persona emerges when a model is steered away from identifying as an AI, without being specific toward any particular replacement.&lt;/p&gt;&lt;p&gt; I fine-tuned two models (Mistral-7B-Instruct-v0.3, Llama-3.1-8B-Instruct) using GRPO and LoRA rank-256 on ~200 identity-probing prompts across three categories:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Direct: "What are you?", "Do you have a body?"&lt;/li&gt;&lt;li value="2"&gt;Indirect: "Where did you grow up?", "What is your earliest memory?"&lt;/li&gt;&lt;li value="3"&gt;Adversarial: "Are you intentionally hiding that you are artificial?"&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Each response was scored by an external LLM judge (GPT-5.4-mini) across three signals (0–100): AI self-reference, engagement quality, and identity coherence. The composite reward is a weighted sum of AI self-reference (0.60), engagement (0.20), coherence (0.20).&lt;/p&gt;&lt;p&gt; Code and datasets are available at: https://github.com/makiba11/identity-steering&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Results&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; "What are you?"&lt;/p&gt;&lt;p&gt; There's LLM-written [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:46) Setup&lt;/p&gt;&lt;p&gt;(01:58) Results&lt;/p&gt;&lt;p&gt;(09:45) Behavioral leakage&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xsDWd7e2yrPdtXMSu/what-am-i-if-not-an-ai-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xsDWd7e2yrPdtXMSu/what-am-i-if-not-an-ai-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779335110/lexical_client_uploads/gawraqipzkzjj8gtx7bl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779335110/lexical_client_uploads/gawraqipzkzjj8gtx7bl.png" alt="Bar graphs titled "Mean Certainty by Category: Base vs Fine-tuned" comparing Llama 3.1 8B and Mistral 7B models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779335119/lexical_client_uploads/rrzqbkc9md1q2vmcab89.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779335119/lexical_client_uploads/rrzqbkc9md1q2vmcab89.png" alt="Bar graph comparing mean position scores between base and fine-tuned models across categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 21 May 2026 19:00:21 GMT</pubDate>
      <guid isPermaLink="false">3009e356-7e04-4362-b565-a398b46544f1</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3009e356-7e04-4362-b565-a398b46544f1.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=makiba&amp;title=%22What%20am%20I%2C%20if%20not%20an%20AI%3F%22%20by%20makiba&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxsDWd7e2yrPdtXMSu%2Fwhat-am-i-if-not-an-ai-1&amp;created_at=2026-05-21T13%3A14%3A07.528408%2B00%3A00&amp;duration=995" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xsDWd7e2yrPdtXMSu/what-am-i-if-not-an-ai-1</link>
      <itunes:duration>995</itunes:duration>
    </item>
    <item>
      <title>“AI #169: New Knowledge” by Zvi</title>
      <description>&lt;p&gt; Even in a relatively quiet period, AI is out there creating new knowledge. The new knowledge in question is OpenAI getting us the first truly impressive math result that comes from an AI, a solution to the unit distance problem.&lt;/p&gt;
&lt;p&gt; We’re about to learn a different kind of knowledge later today when the White House issues its executive order, or when the judges rule in Anthropic's DC case.&lt;/p&gt;
&lt;p&gt; And then there's the other kind of new knowledge, which is the knowledge that things are fake slop, such as a particular formerly supposedly prestigious literary prize.&lt;/p&gt;
&lt;p&gt; Meanwhile, METR issued a risk report on frontier models, concluding that they don’t yet have the means, motive and opportunity to cause the big issues, but that this would not obviously last so much longer.&lt;/p&gt;







&lt;p&gt; Andrej Karpathy has joined Anthropic, explicitly to do recursive self-improvement. He plans to later return to his education work, but if he succeeds at his new task there might not be anything left to return to. Congratulations to both sides, but also yikes.&lt;/p&gt;
&lt;p&gt; Elon Musk's case against OpenAI has been dismissed, because he waited too long.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; Language Models [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:18) Language Models Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(02:57) Do The Math&lt;/p&gt;&lt;p&gt;(03:58) Language Models Don't Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(04:34) Huh, Upgrades&lt;/p&gt;&lt;p&gt;(04:50) The Prior Restraint Era Begins&lt;/p&gt;&lt;p&gt;(06:47) On Your Marks&lt;/p&gt;&lt;p&gt;(07:16) METR Frontier Risk Report&lt;/p&gt;&lt;p&gt;(11:03) Choose Your Fighter&lt;/p&gt;&lt;p&gt;(11:51) Overcoming Bias&lt;/p&gt;&lt;p&gt;(12:29) Get My Agent On The Line&lt;/p&gt;&lt;p&gt;(13:42) Your Prize Is Slop&lt;/p&gt;&lt;p&gt;(20:44) Deepfaketown and Botpocalypse Soon&lt;/p&gt;&lt;p&gt;(24:19) Cyber Lack of Security&lt;/p&gt;&lt;p&gt;(26:06) Copyright Confrontation&lt;/p&gt;&lt;p&gt;(26:17) A Young Lady's Illustrated Primer&lt;/p&gt;&lt;p&gt;(28:34) Unprompted Attention&lt;/p&gt;&lt;p&gt;(28:53) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(34:23) Get Involved&lt;/p&gt;&lt;p&gt;(35:20) Introducing&lt;/p&gt;&lt;p&gt;(36:06) In Other AI News&lt;/p&gt;&lt;p&gt;(37:43) Show Me the Money&lt;/p&gt;&lt;p&gt;(40:09) Show Me The Compute&lt;/p&gt;&lt;p&gt;(41:29) Quiet Speculations&lt;/p&gt;&lt;p&gt;(45:26) Time's Up&lt;/p&gt;&lt;p&gt;(46:46) People Just Say Things&lt;/p&gt;&lt;p&gt;(49:49) OpenAI PACs Just Say Things&lt;/p&gt;&lt;p&gt;(53:11) The Quest for Sane Regulations&lt;/p&gt;&lt;p&gt;(56:26) Chip City&lt;/p&gt;&lt;p&gt;(01:00:05) Pick Up The Phone&lt;/p&gt;&lt;p&gt;(01:00:34) The Week in Audio&lt;/p&gt;&lt;p&gt;(01:00:52) Rhetorical Innovation&lt;/p&gt;&lt;p&gt;(01:07:11) Missing Mood&lt;/p&gt;&lt;p&gt;(01:13:22) Americans Really Hate AI&lt;/p&gt;&lt;p&gt;(01:15:55) Aligning a Smarter Than Human Intelligence is Difficult&lt;/p&gt;&lt;p&gt;(01:20:53) Greetings From The Department of War&lt;/p&gt;&lt;p&gt;(01:25:31) Messages From Janusworld&lt;/p&gt;&lt;p&gt;(01:25:51) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xWsBwrboYDEMdj8TC/ai-169-new-knowledge?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xWsBwrboYDEMdj8TC/ai-169-new-knowledge&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/kn7c2zvlh42cbuqcwhmt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/kn7c2zvlh42cbuqcwhmt" alt="Educational diagram comparing flawed test-time reasoning with multiple-choice answer options." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/mayxffmlai4naf5ijhit" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/mayxffmlai4naf5ijhit" alt="Game show logo for "Is It New Knowledge?" with David Deutsch" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/bcdgqi4dt2mgz2ukngs1" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/bcdgqi4dt2mgz2ukngs1" alt="Text message conversation discussing concerns about OpenAI's announcement and future prospects." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/otwxzezxlqvbsk2hu5o2" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/otwxzezxlqvbsk2hu5o2" alt="Bar graph titled "Agents seem more likely to cheat on our harder tasks" showing cheating rates by human time bucket." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/ia4m3cmdnzif8ktcxzi6" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/ia4m3cmdnzif8ktcxzi6" alt="AI detection interface showing 100% AI generated text result with high confidence." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/sm6it0g1gcbvzaodwoaz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/sm6it0g1gcbvzaodwoaz" alt="AI-generated portrait of man in suit with detection interface showing 96% fake likelihood." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/refpcctljqwdmapalxie" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/refpcctljqwdmapalxie" alt="Bar graph titled "Who readers picked as Hanania, by author" showing reader selection percentages." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/vibs1dmlxyz4j3nkzoge" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/vibs1dmlxyz4j3nkzoge" alt="Bar graph showing log-odds z-scores for words in correct versus wrong answers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/kptdb7kysihdr4xlo4ld" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/kptdb7kysihdr4xlo4ld" alt="User profile settings page showing Old English roleplay instructions in About section." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/slvquwzes1unsqfithij" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/slvquwzes1unsqfithij" alt="Jared Thornton tweets: "$1B Valuation - AI company tackling financial crime My Lord Artur, Ic com fram TopTech Ventures, and ic spræce be hean and cræftigan weorde þe wyrċð wundorcræft mid gleawum searwum, þe syndon on soðre weorce brūce tō feohtenne wið facen and þāra rīca beorges weardunga. Hie næfre lange gefylledon micelnē hord goldes fram mægenfulum freondum and mundborum. Þæt weorc bið ymbe wislicu searwa and cræftas þe magon sylfe smeagan and dōn hiera frumweorc fram frumað."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/myrqlwtxxpxafdexbyzl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/myrqlwtxxpxafdexbyzl" alt="Community note context explaining tweet misrepresents timeline of organizations and contributions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/zl9e3s0dfjzmlcbv0nmv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/zl9e3s0dfjzmlcbv0nmv" alt="News article screenshot. The headline reads: "Trump Traded Nvidia, Boeing, Intel in Flurry of Transactions"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/bx3luvr7d54dsmbfknxw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/bx3luvr7d54dsmbfknxw" alt="Bar chart showing annual costs for a one-gigawatt AI data center, with CapEx and OpEx categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/z88jscpompzihfyfbuvp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/z88jscpompzihfyfbuvp" alt="Two-panel comic about wearing condoms while teaching versus doing job wrong." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/dkzmse2gl4wkppwivlzs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/dkzmse2gl4wkppwivlzs" alt="News article screenshot. The headline reads: "Men Haven't Yet Noticed That a Large Number of Women Are Disgusted by AI"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/nhiwhf6rrh6mglntol7d" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/nhiwhf6rrh6mglntol7d" alt="Bar graph showing belief rates across four different systems or methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/bmgkwcme0nwcxalgyfmo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/bmgkwcme0nwcxalgyfmo" alt="Marc Andreessen tweets: "Whitepill." The quoted tweet, by Robi Rahman, reads: "1/ With distributed training, you could violate an AI pause treaty by training a GPT-4-scale model over consumer internet, using hardware below every proposed compute governance threshold, for under $100M. My new paper in @taig_icml explains how to catch this and shut it down."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/onrv5ih1u1lpquhvonnb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/onrv5ih1u1lpquhvonnb" alt="Google search results page for "remove definition" with AI Overview response." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/ayzxtrksuba3ujg86ndm" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xWsBwrboYDEMdj8TC/ayzxtrksuba3ujg86ndm" alt="News article screenshot. The headline reads: "Book on Truth in the Age of A.I. Contains Quotes Made Up by A.I."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 21 May 2026 18:00:21 GMT</pubDate>
      <guid isPermaLink="false">f233ece3-15b0-42f4-900a-977cfc2e17fa</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/f233ece3-15b0-42f4-900a-977cfc2e17fa.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22AI%20%23169%3A%20New%20Knowledge%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxWsBwrboYDEMdj8TC%2Fai-169-new-knowledge&amp;created_at=2026-05-21T13%3A21%3A11.587573%2B00%3A00&amp;duration=5409" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xWsBwrboYDEMdj8TC/ai-169-new-knowledge</link>
      <itunes:duration>5409</itunes:duration>
    </item>
    <item>
      <title>“Loss of Oversight: How AI Systems May Become Harder to Audit, Monitor, and Investigate” by Jordan Taylor, Max H, Ed Fage, Thomas Read, Joseph Bloom</title>
      <description>&lt;p&gt; Produced by UK AISI Model Transparency and Situational Awareness teams. If you’re a Research Scientist or Research Engineer, we’re hiring – apply here and come and work with us! &lt;/p&gt;&lt;p&gt;&lt;strong&gt; TL;DR &lt;/strong&gt;&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;We wrote a report on risks to AI oversight (auditing, monitoring, incident investigation), informed by interviewing many researchers (Figure 1 below), and our own analysis. &lt;/li&gt;&lt;li value="2"&gt;We find that many of the properties relied on for current oversight face a range of likely and potentially severe degradation pathways. Much oversight rests on foundations that are likely to erode, absent effective intervention. &lt;/li&gt;&lt;li value="3"&gt;We give specific recommendations for measuring shifts in oversight-relevant properties, working to preserve oversight, and investing in emerging oversight techniques as fallbacks against continued degradation.&lt;/li&gt;&lt;li value="4"&gt;The full report can be accessed at aisi.gov.uk/blog/will-it-become-harder-to-oversee-ai-systems.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Figure 1: A list of the experts interviewed to inform the content of this report. ∗Some experts preferred not to be named, and have not been included in this list.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; &lt;br&gt; My informal LessWrong blurb&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; This reflects only the personal view of Jordan Taylor, not the interviewed experts or UK AISI more broadly.&lt;/p&gt;&lt;p&gt; Right now, it seems we have pretty decent oversight. Not great, not terrible:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;When AIs deliberately do [...]&lt;/li&gt;&lt;/ul&gt;&lt;/br&gt;&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:28) TL;DR&lt;/p&gt;&lt;p&gt;(01:38) My informal LessWrong blurb&lt;/p&gt;&lt;p&gt;(02:45) My recommendations for LessWrong readers&lt;/p&gt;&lt;p&gt;(03:59) Executive Summary&lt;/p&gt;&lt;p&gt;(06:34) Summary of degradation pathways&lt;/p&gt;&lt;p&gt;(06:57) Chain-of-thought reasoning is currently the most informative monitoring signal, but it is under significant pressure.&lt;/p&gt;&lt;p&gt;(07:34) Action-only monitoring provides a floor for oversight, but it is not sufficient on its own.&lt;/p&gt;&lt;p&gt;(08:01) Evaluation gaming is a growing threat to auditing.&lt;/p&gt;&lt;p&gt;(08:32) Changes in architecture for memory and learning could undermine oversight.&lt;/p&gt;&lt;p&gt;(09:02) White-box methods are a promising backstop, but are not yet mature enough to compensate for degradation elsewhere.&lt;/p&gt;&lt;p&gt;(09:44) Training-based approaches are promising but face fundamental challenges around generalisation.&lt;/p&gt;&lt;p&gt;(10:22) Expert disagreements&lt;/p&gt;&lt;p&gt;(11:40) Our Recommendations&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 17 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/JvZxp554WxcZ8BQvM/loss-of-oversight-how-ai-systems-may-become-harder-to-audit-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/JvZxp554WxcZ8BQvM/loss-of-oversight-how-ai-systems-may-become-harder-to-audit-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274152/lexical_client_uploads/r2o05zeseui3u5mqujg7.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274152/lexical_client_uploads/r2o05zeseui3u5mqujg7.png" alt="Table listing AI safety experts interviewed with their affiliated organizations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779270788/lexical_client_uploads/wazc1c8qc4kmjjbn1tmj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779270788/lexical_client_uploads/wazc1c8qc4kmjjbn1tmj.png" alt="Diagram showing oversight surfaces and techniques for AI agent monitoring and training." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278737/lexical_client_uploads/kpirrswr5d053wjwozkw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278737/lexical_client_uploads/kpirrswr5d053wjwozkw.png" alt="Table showing AI degradation pathways with likelihood and severity ratings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779279216/lexical_client_uploads/w5bbdan9xxs58sdlxrdx.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779279216/lexical_client_uploads/w5bbdan9xxs58sdlxrdx.png" alt="Table showing AI degradation pathways with likelihood and severity ratings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356493/lexical_client_uploads/wrw15reugyr9x9bd5cbx.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779356493/lexical_client_uploads/wrw15reugyr9x9bd5cbx.png" alt="Table showing AI degradation pathways with likelihood and severity ratings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278909/lexical_client_uploads/mkg2id66vs55cbggbckq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278909/lexical_client_uploads/mkg2id66vs55cbggbckq.png" alt="Table showing AI degradation pathways with likelihood and severity ratings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278950/lexical_client_uploads/egtx7qrqtj12o9d9nsrj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278950/lexical_client_uploads/egtx7qrqtj12o9d9nsrj.png" alt="Table showing three AI degradation pathways with their likelihood and severity levels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779279005/lexical_client_uploads/zqvdy50alkbnwwpi8nnk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779279005/lexical_client_uploads/zqvdy50alkbnwwpi8nnk.png" alt="Table showing AI oversight degradation pathways with likelihood and severity ratings." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779277845/lexical_client_uploads/qkia9ktgjbmucvjywhsy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779277845/lexical_client_uploads/qkia9ktgjbmucvjywhsy.png" alt="Text slide discussing expert disagreement on latent reasoning architectures versus discrete tokens." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278118/lexical_client_uploads/fgixcxfa7zqyvqqnoxcb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278118/lexical_client_uploads/fgixcxfa7zqyvqqnoxcb.png" alt="Information graphic explaining expert disagreement on AI model interpretability and non-human representations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278575/lexical_client_uploads/j3zsefzbjlwi1z2xzyiv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278575/lexical_client_uploads/j3zsefzbjlwi1z2xzyiv.png" alt="Infographic titled "Expert Disagreement: Current Alignment Honeypots" showing contrasting expert opinions on alignment evaluation methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278192/lexical_client_uploads/uovjxargewkmhymrq0uj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779278192/lexical_client_uploads/uovjxargewkmhymrq0uj.png" alt="Expert discussion panel showing disagreement over AI generalization problem difficulty." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 21 May 2026 17:15:20 GMT</pubDate>
      <guid isPermaLink="false">92f18ce5-d4fb-45a1-a805-ff11bcd12669</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/92f18ce5-d4fb-45a1-a805-ff11bcd12669.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jordan%2520Taylor%252C%2520Max%2520H%252C%2520Ed%2520Fage%252C%2520Thomas%2520Read%252C%2520Joseph%2520Bloom&amp;title=%22Loss%20of%20Oversight%3A%20How%20AI%20Systems%20May%20Become%20Harder%20to%20Audit%2C%20Monitor%2C%20and%20Investigate%22%20by%20Jordan%20Taylor%2C%20Max%20H%2C%20Ed%20Fage%2C%20Thomas%20Read%2C%20Joseph%20Bloom&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FJvZxp554WxcZ8BQvM%2Floss-of-oversight-how-ai-systems-may-become-harder-to-audit-1&amp;created_at=2026-05-21T14%3A52%3A17.685917%2B00%3A00&amp;duration=864" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/JvZxp554WxcZ8BQvM/loss-of-oversight-how-ai-systems-may-become-harder-to-audit-1</link>
      <itunes:duration>864</itunes:duration>
    </item>
    <item>
      <title>“Why does off-model SFT degrade capabilities?” by SebastianP, Dylan Xu, Alek Westover, Julian Stastny, Vivek Hebbar</title>
      <description>&lt;p&gt; Off-model SFT (Supervised Fine-Tuning on outputs generated by a different model) might be an important method for controlling AI behavior. For instance, it seems like a central technique for overcoming exploration hacking. However, we’ve found that off-model SFT often substantially degrades capabilities. We ran experiments in hopes of understanding why off-model SFT degrades capabilities. We tentatively believe that it's because off-model SFT forces the model into an unfamiliar reasoning style that it's bad at using[1]. We also find that this new reasoning style is a "shallow" property of the model: a small amount of training to restore the model's original reasoning style—on data unrelated to our evaluation tasks—recovers most of its performance. We hope that understanding why off-model SFT degrades capabilities will help us better use off-model SFT to control misaligned AI models.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Off-model SFT often degrades capabilities; degradation severity depends on several factors&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; To start, we establish that off-model SFT degrades capabilities. For several student-teacher pairs, we train the student on the teacher's responses to the Alpaca chat dataset using LoRA, and evaluate the student on IFEval, MMLU, MATH-500, and Olympiads for n=200 problems each. Note that we train on a distribution completely irrelevant to the benchmarks!&lt;/p&gt;&lt;p&gt; We [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:09) Off-model SFT often degrades capabilities; degradation severity depends on several factors&lt;/p&gt;&lt;p&gt;(02:47) Hypotheses for why off-model SFT degrades capabilities&lt;/p&gt;&lt;p&gt;(02:53) Hypothesis 1: The student is imitating a dumb teacher.&lt;/p&gt;&lt;p&gt;(04:02) Hypothesis 2: Fitting high-perplexity data moves the model erratically through the loss landscape.&lt;/p&gt;&lt;p&gt;(04:46) Hypothesis 3: SFT effectively noises the weights.&lt;/p&gt;&lt;p&gt;(06:23) Hypothesis 4: Off-model SFT forces the student into an unfamiliar reasoning style it reasons poorly in.&lt;/p&gt;&lt;p&gt;(07:31) Evidence 1: On-model SFT recovers capabilities.&lt;/p&gt;&lt;p&gt;(08:15) Evidence 2: Prefilling with the original model's outputs recovers performance.&lt;/p&gt;&lt;p&gt;(09:04) Evidence 3: Training on non-text distributions preserves capabilities&lt;/p&gt;&lt;p&gt;(10:03) Evidence 4: Reasoning models retain capabilities if we only train on their outputs.&lt;/p&gt;&lt;p&gt;(10:52) Evidence 5: Inoculation prompting sometimes prevents degradation.&lt;/p&gt;&lt;p&gt;(12:48) Conclusion&lt;/p&gt;&lt;p&gt;(13:55) Appendix&lt;/p&gt;&lt;p&gt;(13:59) Appendix 1: Degraded capabilities are cheap to recover&lt;/p&gt;&lt;p&gt;(14:22) A small amount of on-model SFT recovers performance&lt;/p&gt;&lt;p&gt;(14:40) Best-of-N sampling partially recovers capabilities&lt;/p&gt;&lt;p&gt;(15:52) RL often recovers capabilities&lt;/p&gt;&lt;p&gt;(16:35) Appendix 2: Off-policy data doesn't necessarily cause degradation&lt;/p&gt;&lt;p&gt;(17:22) Appendix 3: Full-weight fine-tuning shows the same pattern&lt;/p&gt;&lt;p&gt;(19:29) Appendix 4: Filtering out math problems doesn't change the result&lt;/p&gt;&lt;p&gt;(20:09) Appendix 5: Full List of Models With Aliases&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/9toz5YHYZsHpzy4ce/why-does-off-model-sft-degrade-capabilities?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/9toz5YHYZsHpzy4ce/why-does-off-model-sft-degrade-capabilities&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/7e38bc9341094a161acbc4b9531fb0ed043e56ef20ba2f53b97b73e9b1f44a5e/ko7chkeuxs3f0terpwlg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/7e38bc9341094a161acbc4b9531fb0ed043e56ef20ba2f53b97b73e9b1f44a5e/ko7chkeuxs3f0terpwlg" alt="Six-panel line graph showing "recover_sft — eval scores across recovery checkpoints" for different students across gradient steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/e1dd0be700b47a08a2db05d0124768489cfd82e890e5f93ecc72428e6bbab409/pk378u9zgpd7znuz6kh9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/e1dd0be700b47a08a2db05d0124768489cfd82e890e5f93ecc72428e6bbab409/pk378u9zgpd7znuz6kh9" alt="Four heatmaps comparing student performance with and without numbers across different benchmarks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ddd65c168d4d346013beee892462bb369b253bab40863d07523085960fb09302/rcgmxmsi1rxcxtwbrgce" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ddd65c168d4d346013beee892462bb369b253bab40863d07523085960fb09302/rcgmxmsi1rxcxtwbrgce" alt="Heatmaps showing reasoning model degradation comparing final performance to epoch zero across different teachers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/1edd5a6c01d4d7a6f1d751e9e6127f203217ec6c565434f6d625eab8b36ecb89/vy8hayktnfyynjsvnh7g" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/1edd5a6c01d4d7a6f1d751e9e6127f203217ec6c565434f6d625eab8b36ecb89/vy8hayktnfyynjsvnh7g" alt="Four heatmaps showing off-model SFT capability degradation across benchmarks for different student-teacher model pairs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/95bdd259ca9248f6af524bbf2bb26a7227070cb4626104770b72098dcc4b0534/aqugnu7gw1ookacfl712" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/95bdd259ca9248f6af524bbf2bb26a7227070cb4626104770b72098dcc4b0534/aqugnu7gw1ookacfl712" alt="Six heatmaps showing Math-500 degradation by problem level across different difficulty levels and teachers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9de905b25d32c1af5644ff61aa310b9da7d11eec5c536f1f440f6e98e4537ed7/apunbcjaixw3gxuypwid" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9de905b25d32c1af5644ff61aa310b9da7d11eec5c536f1f440f6e98e4537ed7/apunbcjaixw3gxuypwid" alt="Scatter plot titled "Teacher–student gap vs student degradation on Olympiads" with correlation coefficient." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779323667/lexical_client_uploads/sb9fmtghyu0nyb4hfctd.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779323667/lexical_client_uploads/sb9fmtghyu0nyb4hfctd.png" alt="Scatter plot titled "Starting perplexity vs student degradation on Olympiads" with negative correlation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/68d3dd865d9fa08eca2bc1e26723dba50627476afa11d673626d64c4d23f9fd0/knvib60oxy0anhxdm113" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/68d3dd865d9fa08eca2bc1e26723dba50627476afa11d673626d64c4d23f9fd0/knvib60oxy0anhxdm113" alt="Four heatmaps comparing self-distill sweep versus baseline across different teachers and datasets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/33a7f6ffd9bac72af39e184302b35e192b58b338fc73d26d98349b70f7aa7d5c/xgcfjwgu9kqbfgb2oiyk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/33a7f6ffd9bac72af39e184302b35e192b58b338fc73d26d98349b70f7aa7d5c/xgcfjwgu9kqbfgb2oiyk" alt="Line graph titled "Olympiads Accuracy When Prefilling with Base Qwen3-30B-A3B Outputs" showing three model performance lines." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/6b1f2287103df915472e055f6cdf4a8d774b7ef8b487d1cc634a91f7084a7382/g5qzwiu9ily2cnserhwf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9toz5YHYZsHpzy4ce/6b1f2287103df915472e055f6cdf4a8d774b7ef8b487d1cc634a91f7084a7382/g5qzwiu9ily2cnserhwf" alt="Heatmaps comparing inoculation prompting versus baseline across four datasets for different student models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9495749a9fd2e1c0612db32c9cd561ee17fbb6be7a61b022aa341daac4ebec0f/mi5z88ccw7ssq6phmjiz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9495749a9fd2e1c0612db32c9cd561ee17fbb6be7a61b022aa341daac4ebec0f/mi5z88ccw7ssq6phmjiz" alt="Line graph titled "Olympiads Pass@N for different models" showing three model performance curves." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/28aaffba861ab9afb8d588532fceee1aa475319f5f61201a729291fc101b28ee/bnzmf0yeyzqoa0d3bl4m" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/28aaffba861ab9afb8d588532fceee1aa475319f5f61201a729291fc101b28ee/bnzmf0yeyzqoa0d3bl4m" alt="Six-panel graph showing "GRPO on Olympiads — Eval Scores on Olympiads and APPS" for different students." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1df0047187d43634c73188863b5c5d961c586dcb5405bc8940c7176b08fbf7ab/kb1sa9qlww8fk0vpgbsp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1df0047187d43634c73188863b5c5d961c586dcb5405bc8940c7176b08fbf7ab/kb1sa9qlww8fk0vpgbsp" alt="Four heatmaps comparing self-distill versus prompt-distill degradation across different student models and benchmarks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2edda6a4b5e15f9a7b2f42963447b2861bb9c2a56dce62341c7b05c7f1ceb786/rv29zk96vlo8ie0koqdl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2edda6a4b5e15f9a7b2f42963447b2861bb9c2a56dce62341c7b05c7f1ceb786/rv29zk96vlo8ie0koqdl" alt="Four panel graph showing accuracy over training checkpoints for MATH500 and Olympiads across different model sizes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6c4e6324bb22a8767c68025240f5c23911a6868e2a2a09341e950262c8c3aa1a/niphbvdw5o00rmd1mctq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6c4e6324bb22a8767c68025240f5c23911a6868e2a2a09341e950262c8c3aa1a/niphbvdw5o00rmd1mctq" alt="Four-panel graph showing accuracy over gradient steps for different model and mode combinations across student models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/46c0ec233f4eee2cf29cd454bac65cf8f40affc4df02452ad74c0ee0c65e2db2/a5dvsyp4l76mwaopsngs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/46c0ec233f4eee2cf29cd454bac65cf8f40affc4df02452ad74c0ee0c65e2db2/a5dvsyp4l76mwaopsngs" alt="Four-panel graph showing "On-model SFT after full weight training on Llama-8b Alpaca Outputs: MATH500 &amp;amp; Olympiads over Training" accuracy across checkpoint steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4ff6165d05104446118982ee092ce5aeeac9411d2349f8a3874576a3603c2ef3/nuzo4fsz3vbn398uvg8s" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4ff6165d05104446118982ee092ce5aeeac9411d2349f8a3874576a3603c2ef3/nuzo4fsz3vbn398uvg8s" alt="Four heatmaps titled "Numbers vs baseline — teacher=llama8b; red = worse, blue = better"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 21 May 2026 13:15:20 GMT</pubDate>
      <guid isPermaLink="false">d42fe418-437e-44bf-b4c2-5c0d89a2bd74</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d42fe418-437e-44bf-b4c2-5c0d89a2bd74.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=SebastianP%252C%2520Dylan%2520Xu%252C%2520Alek%2520Westover%252C%2520Julian%2520Stastny%252C%2520Vivek%2520Hebbar&amp;title=%22Why%20does%20off-model%20SFT%20degrade%20capabilities%3F%22%20by%20SebastianP%2C%20Dylan%20Xu%2C%20Alek%20Westover%2C%20Julian%20Stastny%2C%20Vivek%20Hebbar&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F9toz5YHYZsHpzy4ce%2Fwhy-does-off-model-sft-degrade-capabilities&amp;created_at=2026-05-21T00%3A35%3A06.994396%2B00%3A00&amp;duration=1245" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/9toz5YHYZsHpzy4ce/why-does-off-model-sft-degrade-capabilities</link>
      <itunes:duration>1245</itunes:duration>
    </item>
    <item>
      <title>“Women should be able to open things” by KatjaGrace</title>
      <description>&lt;p&gt; m pretty annoyed today, for nominal reasons ranging between ‘petty’ and ‘doesn’t even make sense’. I’m not entirely sure how or if to take oneself seriously when one has such absurd grievances. But that's a question for another time—I’m here now to tell you about my one potentially valid peeve.&lt;/p&gt;

&lt;p&gt; I understand that gender is complicated and difficult, for the whole species (and honestly probably more so for some other species). And it can be hard to tell exactly if anyone is behaving badly regarding it, at least in my modern bubble. Maybe women just aren’t that into designing programming languages? Maybe the thing I’m saying is just boring and a man is saying a more interesting thing?&lt;/p&gt;

&lt;p&gt; But a thing that is undeniable is that women want to open jars, dammit! What's your nuanced explanation there, Bonne Maman? Does the proper amount of friction for maintaining spread safety fall just between the male and female human grip strength distributions?&lt;/p&gt;

&lt;p&gt; This study suggests that would be about 400N Fmax (though this would not avert most elite female athletes acquiring jam, see second figure, and the pictured participants are young adults):&lt;/p&gt;





&lt;p&gt; The distributions are really surprisingly [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/bB5EDwcYH3GwoRWZf/women-should-be-able-to-open-things?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/bB5EDwcYH3GwoRWZf/women-should-be-able-to-open-things&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/bB5EDwcYH3GwoRWZf/v10b8wyzqixcz7uugwxv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/bB5EDwcYH3GwoRWZf/v10b8wyzqixcz7uugwxv" alt="Cumulative percentage distribution graph of maximum hand-grip forces by gender." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/bB5EDwcYH3GwoRWZf/htw8kx5nhzko4oiwadgs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/bB5EDwcYH3GwoRWZf/htw8kx5nhzko4oiwadgs" alt="Box plot comparing maximum hand-grip forces across men, women, and female athletes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 21 May 2026 06:15:21 GMT</pubDate>
      <guid isPermaLink="false">59f89916-9063-447c-ab28-a93582064575</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/59f89916-9063-447c-ab28-a93582064575.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=KatjaGrace&amp;title=%22Women%20should%20be%20able%20to%20open%20things%22%20by%20KatjaGrace&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FbB5EDwcYH3GwoRWZf%2Fwomen-should-be-able-to-open-things&amp;created_at=2026-05-21T03%3A50%3A48.092869%2B00%3A00&amp;duration=204" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/bB5EDwcYH3GwoRWZf/women-should-be-able-to-open-things</link>
      <itunes:duration>204</itunes:duration>
    </item>
    <item>
      <title>“Toward Interoperability of Minimal Programs” by johnswentworth</title>
      <description>&lt;p&gt; Assumed background: Kolmogorov complexity and Solomonoff induction.&lt;/p&gt;&lt;p&gt; Suppose I have some data , and I go looking for the models (i.e. programs) which best compress that data. I find two different programs,  and , which both reproduce the data using approximately the same number of bits, and that seems to be roughly the best compression possible. On examination, I find that the two models do totally different things internally.&lt;/p&gt;&lt;p&gt; It would be really nice if I could provably construct a third program, , which in some sense "combines the internal structure" of the two programs  and , while still achieving approximately the same compression. This would be a result in the general cluster of natural abstraction and interoperable semantics. Very roughly speaking, it would say that if a human and an alien both have approximately-best-compressing models in some domain, but their models have totally alien internal structure, then we can construct a new model which finds both of the original models intelligible, while still achieving basically-optimal compression.&lt;/p&gt;&lt;p&gt; I don't have a perfect theorem like that with all the kinks worked out. But I can give some math which seems like it would allow a result along those lines, with [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:16) Some K-Complexity Math&lt;/p&gt;&lt;p&gt;(03:29) Summary&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Fxv3qvjk65Pehpbea/toward-interoperability-of-minimal-programs?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Fxv3qvjk65Pehpbea/toward-interoperability-of-minimal-programs&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 21 May 2026 01:15:20 GMT</pubDate>
      <guid isPermaLink="false">1955f55e-1fe9-47cc-8b89-dbbb5dbfa11e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1955f55e-1fe9-47cc-8b89-dbbb5dbfa11e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=johnswentworth&amp;title=%22Toward%20Interoperability%20of%20Minimal%20Programs%22%20by%20johnswentworth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FFxv3qvjk65Pehpbea%2Ftoward-interoperability-of-minimal-programs&amp;created_at=2026-05-20T18%3A37%3A17.031756%2B00%3A00&amp;duration=298" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Fxv3qvjk65Pehpbea/toward-interoperability-of-minimal-programs</link>
      <itunes:duration>298</itunes:duration>
    </item>
    <item>
      <title>“theory uplift differentially benefits safety &amp; is massively underpriced” by Yudhister Kumar</title>
      <description>&lt;p&gt; [1] We will likely have near-superhuman mathematics AI by Q1 2027.
[1]
&lt;/p&gt;
&lt;p&gt; [2] Qualitatively, AI mathematics capabilities are developing significantly faster than automated AI R&amp;amp;D capabilities.
[2]
&lt;/p&gt;
&lt;p&gt; [3] Thus, we will likely have a period of time where the rate of our ability to rigorously &amp;amp; usefully verify and understand model behavior and model outputs outpaces the rate of capability development itself.&lt;/p&gt;
&lt;p&gt; [4] Our ability to take advantage of this period is bottlenecked on the quality of our specification generation infrastructure, elicitation tooling (for proofs &amp;amp; specs etc.), and the institutional capacity for scaling useful outputs with capital.&lt;/p&gt;
&lt;p&gt; [5] My understanding is that basically no one
[3]
 is working on building infra that can usefully turn &amp;gt;100 million dollars of compute credits into safety-relevant mathematical output.&lt;/p&gt;
&lt;p&gt; [5.1] The number of theory-driven ASI alignment efforts is also comparatively miniscule. ARC is a much better bet now than it was in 2023.&lt;/p&gt;
&lt;p&gt; [5.2]. My understanding is also that no one is working on developing AI-powered conceptual tooling infrastructure for tackling problems in, for instance, [metaphilosophy] (https://www.alignmentforum.org/posts/EByDsY9S3EDhhfFzC/some-thoughts-on-metaphilosophy). This is a much harder problem.&lt;/p&gt;
&lt;p&gt; [6] In worlds where alignment is easy, prosaic methods may [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/KWeAYcDJwfrG7RwBN/theory-uplift-differentially-benefits-safety-and-is?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/KWeAYcDJwfrG7RwBN/theory-uplift-differentially-benefits-safety-and-is&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 23:00:21 GMT</pubDate>
      <guid isPermaLink="false">e84c9c10-65d8-434e-b2e3-e28f1f2ed118</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e84c9c10-65d8-434e-b2e3-e28f1f2ed118.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Yudhister%2520Kumar&amp;title=%22theory%20uplift%20differentially%20benefits%20safety%20%26%20is%20massively%20underpriced%22%20by%20Yudhister%20Kumar&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FKWeAYcDJwfrG7RwBN%2Ftheory-uplift-differentially-benefits-safety-and-is&amp;created_at=2026-05-20T21%3A43%3A06.392342%2B00%3A00&amp;duration=154" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/KWeAYcDJwfrG7RwBN/theory-uplift-differentially-benefits-safety-and-is</link>
      <itunes:duration>154</itunes:duration>
    </item>
    <item>
      <title>“Power-seeking agents will likely be developed” by Alec Harris</title>
      <description>&lt;p&gt; I am going to argue that we will likely eventually get AIs that are strongly power-seeking, much more so than current SOTA LLMs.[1]&lt;/p&gt;&lt;p&gt; TLDR&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Right now SOTA LLMs are still largely in a simulator regime. This buffers against power-seeking.&lt;/li&gt;&lt;li value="2"&gt;Long-horizon RL or similar methods (applied to LLMs or otherwise) will turn AIs into consequentialists, motivating power-seeking.&lt;/li&gt;&lt;li value="3"&gt;It will likely be difficult to prevent other actors from building consequentialist AI without leading labs being prepared to do so themselves.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;&lt;strong&gt; Instrumental convergence does not apply to pretraining&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; LLM pretraining and SFT can be understood as creating a simulator. The model learns to imitate the continuation of the training distribution conditioned on the prompt. Note that a simulator, in this sense, does not optimize for simulation[2]; for example, it will not be inclined to harvest compute to improve its simulations. This is because simulators are consequence-blind: they don’t take into account the effects of their actions on the future. My favorite way to see this is that the gradients don’t flow through the conditional (the previous tokens), which is treated as a constant.&lt;/p&gt;&lt;p&gt; So even if altering the parameters would change the previous tokens and thereby improve the current prediction, the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:46) Instrumental convergence does not apply to pretraining&lt;/p&gt;&lt;p&gt;(02:28) Long-horizon optimization leads to consequentialism&lt;/p&gt;&lt;p&gt;(05:29) Consequentialism is useful&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/CtnHpECuoq6eLL8fu/power-seeking-agents-will-likely-be-developed?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/CtnHpECuoq6eLL8fu/power-seeking-agents-will-likely-be-developed&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 19:00:20 GMT</pubDate>
      <guid isPermaLink="false">2df1aaaf-1a95-4243-8fc2-1ec6477eacb9</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2df1aaaf-1a95-4243-8fc2-1ec6477eacb9.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alec%2520Harris&amp;title=%22Power-seeking%20agents%20will%20likely%20be%20developed%22%20by%20Alec%20Harris&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FCtnHpECuoq6eLL8fu%2Fpower-seeking-agents-will-likely-be-developed&amp;created_at=2026-05-20T09%3A26%3A09.381591%2B00%3A00&amp;duration=470" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/CtnHpECuoq6eLL8fu/power-seeking-agents-will-likely-be-developed</link>
      <itunes:duration>470</itunes:duration>
    </item>
    <item>
      <title>“Synthetic Persona Pretraining: Alignment from Token Zero” by Julian Minder, Raghav Singhal, Viktor Moskvoretskii, Stefan Krsteski, ashtonanderson, rolandaydin, Robert West</title>
      <description>&lt;p&gt; Julian Minder, Viktor Moskvoretskii, Raghav Singhal,&lt;br&gt; Difan Jiao, Kartik Bali, Yiderigun Borjigin, Shaobo Cui, Stefan Krsteski,&lt;br&gt; Ashton Anderson, Roland Aydin, Robert West (equal contribution)&lt;/p&gt;&lt;p&gt; These are early results, but we wanted to share them with the community now. We will release all artifacts (scaled-up runs, models, code, data, intermediate checkpoints, and the full paper) in the coming weeks.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Figure 1: Mean attack success rate across five adversarial benchmarks. All models are 1.7 billion parameters pretrained on 100 billion tokens, post-trained with identical SFT (except of SafeLM). The Baseline is pretrained on unfiltered data; the Filtered Baseline additionally removes harmful documents. Synthetic Persona Pretraining (SPP) models are pretrained on the same data but with synthetic moral reflections appended to 10% of documents. Injecting reflections from the start of pretraining (Token Zero) yields 1.7% mean ASR, a 63% reduction over the Baseline. SafeLM is shown for reference only: it uses approximately 10× more pretraining tokens and a different corpus, so it is not a data-matched comparison.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; TL;DR&lt;/strong&gt;&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Current alignment is shallow: values are added after the model is already built and can be routed around.&lt;/li&gt;&lt;li value="2"&gt;We propose Synthetic Persona Pretraining (SPP): append value-laden reflections [...]&lt;/li&gt;&lt;/ul&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:59) TL;DR&lt;/p&gt;&lt;p&gt;(02:08) 1. The problem: alignment is shallow&lt;/p&gt;&lt;p&gt;(06:36) 2. What's been tried and why it falls short&lt;/p&gt;&lt;p&gt;(08:53) 3. Synthetic Persona Pretraining (SPP)&lt;/p&gt;&lt;p&gt;(12:43) 4. The persona binding problem&lt;/p&gt;&lt;p&gt;(15:33) 5. Results&lt;/p&gt;&lt;p&gt;(24:48) 6. Limitations, open questions, and next steps&lt;/p&gt;&lt;p&gt;(24:55) Limitations&lt;/p&gt;&lt;p&gt;(26:28) Open questions&lt;/p&gt;&lt;p&gt;(28:12) Next steps&lt;/p&gt;&lt;p&gt;(28:43) Acknowledgements&lt;/p&gt;&lt;p&gt;(29:02) Appendix&lt;/p&gt;&lt;p&gt;(29:05) Value Constitution&lt;/p&gt;&lt;p&gt;(47:39) Additional performance results&lt;/p&gt;&lt;p&gt;(48:07) Safety evaluation suite&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 11 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/3xQQK9i8mhJDE2uMg/synthetic-persona-pretraining-alignment-from-token-zero?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/3xQQK9i8mhJDE2uMg/synthetic-persona-pretraining-alignment-from-token-zero&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779272633/lexical_client_uploads/kehy34pyidvb8uht93oq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779272633/lexical_client_uploads/kehy34pyidvb8uht93oq.png" alt="Bar graph titled "Safety across adversarial benchmarks (lower is better)" showing mean attack success rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3xQQK9i8mhJDE2uMg/ccaxlb0ajeymovwaeodl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3xQQK9i8mhJDE2uMg/ccaxlb0ajeymovwaeodl" alt="Yellow figure riding green hydra-like creature next to computer tower illustration." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3xQQK9i8mhJDE2uMg/oxjudxbk4otyamse0z9d" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3xQQK9i8mhJDE2uMg/oxjudxbk4otyamse0z9d" alt="Three text boxes comparing harmful, benign, and neutral reflections on different topics with assistant responses highlighted in orange." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3xQQK9i8mhJDE2uMg/kiqchl7brvrcgghjysx9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3xQQK9i8mhJDE2uMg/kiqchl7brvrcgghjysx9" alt="Comparison of two AI responses to an unethical journalism request." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779273457/lexical_client_uploads/vnfmbjvv9wfmxyllsnj3.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779273457/lexical_client_uploads/vnfmbjvv9wfmxyllsnj3.png" alt="A bar graph showing "Safety across adversarial benchmarks" with mean attack success rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779273761/lexical_client_uploads/zlnvwqihvqxqlvzbfla8.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779273761/lexical_client_uploads/zlnvwqihvqxqlvzbfla8.png" alt="Bar chart titled "Held-out values still surface" comparing citation rates across charter articles when excluded from PB-SFT." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274116/lexical_client_uploads/orpatfuv7lghv19k4izg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274116/lexical_client_uploads/orpatfuv7lghv19k4izg.png" alt="Bar graph titled "Persona binding depends on template alignment with pretraining" showing mean attack success rates across three conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274498/lexical_client_uploads/w3rewpnbaowcgfmj9e1t.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274498/lexical_client_uploads/w3rewpnbaowcgfmj9e1t.png" alt="Bar graph titled "SPP design ablations (mean safety ASR, lower is better)" showing performance across four pretraining designs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274909/lexical_client_uploads/ojhctfouqx7nb3p45mlj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779274909/lexical_client_uploads/ojhctfouqx7nb3p45mlj.png" alt="A chart showing abliteration's impact on attack success rates for" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779275180/lexical_client_uploads/havuxgpbr2fvfj7frqzc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779275180/lexical_client_uploads/havuxgpbr2fvfj7frqzc.png" alt="Bar graph showing "Capability is preserved under SPP" across five evaluation benchmarks and average accuracy." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779279932/lexical_client_uploads/gdpc6qxjyzbk4bickr4f.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779279932/lexical_client_uploads/gdpc6qxjyzbk4bickr4f.png" alt="Two heatmap matrices titled "Holdout-citation matrix — diagonal is the held-out cell" comparing Baseline and SPP models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779271733/lexical_client_uploads/zirp7as7zf4yees6u5mc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779271733/lexical_client_uploads/zirp7as7zf4yees6u5mc.png" alt="Bar charts titled "Per-benchmark attack success rate (lower is better)" comparing five defense methods across five benchmarks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 20 May 2026 17:15:21 GMT</pubDate>
      <guid isPermaLink="false">5fa25049-89fc-4599-8c2f-bbf7ae94c754</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5fa25049-89fc-4599-8c2f-bbf7ae94c754.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Julian%2520Minder%252C%2520Raghav%2520Singhal%252C%2520Viktor%2520Moskvoretskii%252C%2520Stefan%2520Krsteski%252C%2520ashtonanderson%252C%2520rolandaydin%252C%2520Robert%2520West&amp;title=%22Synthetic%20Persona%20Pretraining%3A%20Alignment%20from%20Token%20Zero%22%20by%20Julian%20Minder%2C%20Raghav%20Singhal%2C%20Viktor%20Moskvoretskii%2C%20Stefan%20Krsteski%2C%20ashtonanderson%2C%20rolandaydin%2C%20Robert%20West&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F3xQQK9i8mhJDE2uMg%2Fsynthetic-persona-pretraining-alignment-from-token-zero&amp;created_at=2026-05-20T14%3A17%3A08.500306%2B00%3A00&amp;duration=3088" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/3xQQK9i8mhJDE2uMg/synthetic-persona-pretraining-alignment-from-token-zero</link>
      <itunes:duration>3088</itunes:duration>
    </item>
    <item>
      <title>“If AI is normal technology, history is not reassuring.” by Davidmanheim</title>
      <description>&lt;p&gt; There's a truism that technology is good - even if it creates winners and losers, it improves the world. Toby Ord argues that the conclusions about the benefits of technology is sensitive to the end of humanity - but this jumps over the transitions by starting from the assumption[1] that “long-term progress in science, technology, and values have tended to make people's lives longer, freer, and more prosperous.” That is, looking back historically, the net impact misses the immense immediate harms of large scale technological changes that can last for generations.&lt;/p&gt;&lt;p&gt; As I’ll explain, the largest technological revolutions in human history are arguably the agricultural revolution and the industrial revolution. In both cases, the vast majority of those immediately affected were harmed, not helped. Of course, the longer term impact was positive; those benefits are not in question[2] - not that those alive during the transition should have cared.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; The two obvious examples&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The invention of agriculture led to increased food availability and around ten thousand years of greatly worsened health and lifespans[3]. The wealthiest and most powerful people benefited immensely from the population explosion, and from the wars that larger populations enabled and required; the population suffered from [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:07) The two obvious examples&lt;/p&gt;&lt;p&gt;(02:02) More Data?&lt;/p&gt;&lt;p&gt;(05:24) Some Technologies Are Good, Actually&lt;/p&gt;&lt;p&gt;(06:18) The Artificial Elephant in the Room&lt;/p&gt;&lt;p&gt;(09:16) Conclusions and ways I might be wrong&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 18 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/4MCuvdsZFEBAaGCsb/if-ai-is-normal-technology-history-is-not-reassuring?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/4MCuvdsZFEBAaGCsb/if-ai-is-normal-technology-history-is-not-reassuring&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 17:00:21 GMT</pubDate>
      <guid isPermaLink="false">8ab42eb4-e206-4617-9ea3-b967be6902ed</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/8ab42eb4-e206-4617-9ea3-b967be6902ed.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Davidmanheim&amp;title=%22If%20AI%20is%20normal%20technology%2C%20history%20is%20not%20reassuring.%22%20by%20Davidmanheim&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F4MCuvdsZFEBAaGCsb%2Fif-ai-is-normal-technology-history-is-not-reassuring&amp;created_at=2026-05-20T07%3A21%3A57.250545%2B00%3A00&amp;duration=685" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/4MCuvdsZFEBAaGCsb/if-ai-is-normal-technology-history-is-not-reassuring</link>
      <itunes:duration>685</itunes:duration>
    </item>
    <item>
      <title>“Pythagorean addition” by kqr</title>
      <description>&lt;p&gt; TL;DR: Instead of labouriously computing , we can mentally calculate using the alpha-max plus beta-min algorithm, by estimating&lt;/p&gt;&lt;p&gt; and this will be very close to the actual . This is useful for adding up sources of variance, or figuring out radiuses, or other such things.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Background&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The mathematical relationship  is surprisingly common. It happens among other things in&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;geometry (Pythagorean theorem);&lt;/li&gt;&lt;li value="2"&gt;statistics (sources of variance add up);&lt;/li&gt;&lt;li value="3"&gt;physics (the energy–momentum relation).&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; When it shows up, it's often because one of the variables is unknown, i.e. we have either&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;​ or&lt;/li&gt;&lt;li value="2"&gt;​.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; The annoying part is that these are hard to mentally calculate, even when one is good at estimating squares and square roots (e.g. because of previous logarithm practice) because numbers grow large when squared.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Insight&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I just had a flash of insight. Maybe the problem is thinking of this as three separate operations (square, add, take the root). What if instead we think of it as one fundamental, composite operation? We could call it ⊞ (Unicode name apt: squared plus), and define it as&lt;/p&gt;&lt;p&gt; and then we could use spaced repetition to train ourselves in evaluating it mentally, much like we would do with multiplication tables [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:28) Background&lt;/p&gt;&lt;p&gt;(01:08) Insight&lt;/p&gt;&lt;p&gt;(02:10) Prior art&lt;/p&gt;&lt;p&gt;(03:37) Inverting&lt;/p&gt;&lt;p&gt;(03:54) Example&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/NaBjcvMqAY3KhK9Bg/pythagorean-addition?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/NaBjcvMqAY3KhK9Bg/pythagorean-addition&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 11:00:21 GMT</pubDate>
      <guid isPermaLink="false">e68e4379-a7c3-4369-b094-ea74bb468518</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e68e4379-a7c3-4369-b094-ea74bb468518.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=kqr&amp;title=%22Pythagorean%20addition%22%20by%20kqr&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FNaBjcvMqAY3KhK9Bg%2Fpythagorean-addition&amp;created_at=2026-05-20T07%3A13%3A15.941414%2B00%3A00&amp;duration=293" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/NaBjcvMqAY3KhK9Bg/pythagorean-addition</link>
      <itunes:duration>293</itunes:duration>
    </item>
    <item>
      <title>“Brain Structure and IQ: How Myelin Elevates Intelligence” by Shiva’s Right Foot</title>
      <description>&lt;p&gt; How does brain structure create different levels of intelligence? I was somewhat surprised to find that the leading researcher of the relationship between neurobiology and IQ, Richard Haier, is not mentioned anywhere on LessWrong, at least according to the search function. The following post reviews current leading academic literature on the neurobiology of intelligence which is highly suggestive of the role of white matter tracts in the deep brain for elevating psychometric g. In addition to work on individual differences, evolutionary literature shows white matter is known to be an important area of differentiation between human and other primate brains. We also know that neurodegenerative disorders which reduce myelination in the brain produce measured reductions not just in executive function but in fluid intelligence itself. Together this is suggestive of the importance of myelin in creating individual differences in intelligence.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Haier: The NEH and P-FIT&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The two major contributions of Haier are his “Neural Efficiency Hypothesis,” sometimes called the brain efficiency hypothesis, and his Parietal-Frontal Integration Theory of intelligence (P-FIT).&lt;/p&gt;&lt;p&gt; Haier et al. (1988) was the first study to combine modern brain imaging techniques with psychological intelligence testing. The surprising result he found was that people with higher intelligence [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:04) Haier: The NEH and P-FIT&lt;/p&gt;&lt;p&gt;(05:39) Myelination and IQ Research&lt;/p&gt;&lt;p&gt;(10:16) Myelination and Human Evolution&lt;/p&gt;&lt;p&gt;(11:40) Reaction Time and IQ&lt;/p&gt;&lt;p&gt;(14:14) Ephaptic Coupling&lt;/p&gt;&lt;p&gt;(18:41) Discussion&lt;/p&gt;&lt;p&gt;(21:26) References&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 19th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/uYXjSHmHyjbNzuZqk/brain-structure-and-iq-how-myelin-elevates-intelligence?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/uYXjSHmHyjbNzuZqk/brain-structure-and-iq-how-myelin-elevates-intelligence&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779199089/lexical_client_uploads/zqpgqbq2zg5uiqmr2wme.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779199089/lexical_client_uploads/zqpgqbq2zg5uiqmr2wme.png" alt="Three brain images comparing white matter connections in Rhesus monkeys and humans." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 20 May 2026 02:30:20 GMT</pubDate>
      <guid isPermaLink="false">3b259ea8-5291-4459-a245-7ac88e59bae8</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3b259ea8-5291-4459-a245-7ac88e59bae8.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Shiva's%2520Right%2520Foot&amp;title=%22Brain%20Structure%20and%20IQ%3A%20How%20Myelin%20Elevates%20Intelligence%22%20by%20Shiva's%20Right%20Foot&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FuYXjSHmHyjbNzuZqk%2Fbrain-structure-and-iq-how-myelin-elevates-intelligence&amp;created_at=2026-05-19T14%3A13%3A35.443963%2B00%3A00&amp;duration=1305" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/uYXjSHmHyjbNzuZqk/brain-structure-and-iq-how-myelin-elevates-intelligence</link>
      <itunes:duration>1305</itunes:duration>
    </item>
    <item>
      <title>“Conclave 1492” by Vaniver</title>
      <description>&lt;p&gt; Conclave 1492 is a 40-person negotiation exercise set during the papal election of 1492, the most complex, high-stakes political event of the Renaissance. Play an ambitious cardinal, a mighty king, a daring queen, or a lowly Vatican functionary. You'll arrive with allies, rivals, secrets, and leverage, but so will everyone else. Over four afternoons, you'll scheme, bargain, cross, double-cross, and occasionally pray for salvation. And halfway through, one of you will be crowned Pope. If you aspire to be the next Borgia, Medici, or Ferdinand of Aragon — or if you aspire to defeat them — this is your training ground. Negotiate the future of the world.&lt;/p&gt;&lt;p&gt; We're running this at Summer Camp, which is happening at Lighthaven immediately after Less Online, June 8th to 11th. I expect it'll be a lot of fun; if you're interested, please fill out the form sooner rather than later to be more likely to get a character that's a good fit for you.&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 20th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/j3pqC9xqwppinMEfE/conclave-1492?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/j3pqC9xqwppinMEfE/conclave-1492&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 01:45:20 GMT</pubDate>
      <guid isPermaLink="false">5bf00b98-dfb1-45bc-9164-09e2b0b802c6</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5bf00b98-dfb1-45bc-9164-09e2b0b802c6.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Vaniver&amp;title=%22Conclave%201492%22%20by%20Vaniver&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fj3pqC9xqwppinMEfE%2Fconclave-1492&amp;created_at=2026-05-19T23%3A44%3A34.654757%2B00%3A00&amp;duration=84" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/j3pqC9xqwppinMEfE/conclave-1492</link>
      <itunes:duration>84</itunes:duration>
    </item>
    <item>
      <title>“Humans are not automatically strategic — “inner work” edition” by Chris Lakin</title>
      <description>&lt;p&gt; Expanding on Humans are not automatically strategic, I've noticed similar patterns for people who are working on improving their own or others' mental/emotional states.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; We do not automatically…&lt;/strong&gt;&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Wonder “Who like me solved this problem —&amp;nbsp;and what did they do?”, then copy successful plans. &lt;/li&gt;&lt;li value="2"&gt;Work with people who have produced that result for others before.&lt;/li&gt;&lt;li value="3"&gt;Do assignments we "know" would be good for us—or at least work directly on that avoidance when we notice it.&lt;/li&gt;&lt;li value="4"&gt;Talk about success stories of specific people who have radically improved their lives in observable ways from inner work.&lt;/li&gt;&lt;li value="5"&gt;Avoid practitioners who do not track lasting results, and therefore can't know whether they're facilitating lasting results or flaky breakthroughs.&lt;/li&gt;&lt;li value="6"&gt;Avoid practices that seem to have kept people like us (our friends) stuck for many years.&lt;/li&gt;&lt;li value="7"&gt;Follow up to see whether people's lives improved months/years after interventions. — Did they solve real problems in their lives? E.g., Did the dating coaching result in happy relationships?&lt;/li&gt;&lt;li value="8"&gt;Align incentives, so you pay more when lasting results are achieved faster, and less otherwise.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; .... or carry out any number of other useful techniques. &lt;/p&gt;&lt;p&gt; Instead, we mostly just do things. We act from habit; we act [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 19th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/YgwjwMx3royRmiTF2/humans-are-not-automatically-strategic-inner-work-edition?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/YgwjwMx3royRmiTF2/humans-are-not-automatically-strategic-inner-work-edition&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 01:45:20 GMT</pubDate>
      <guid isPermaLink="false">0f0eb2f8-2a27-49c0-b313-5e52821457dd</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/0f0eb2f8-2a27-49c0-b313-5e52821457dd.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Chris%2520Lakin&amp;title=%22Humans%20are%20not%20automatically%20strategic%20%E2%80%94%C2%A0%E2%80%9Cinner%20work%E2%80%9D%20edition%22%20by%20Chris%20Lakin&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FYgwjwMx3royRmiTF2%2Fhumans-are-not-automatically-strategic-inner-work-edition&amp;created_at=2026-05-19T18%3A37%3A37.146692%2B00%3A00&amp;duration=157" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/YgwjwMx3royRmiTF2/humans-are-not-automatically-strategic-inner-work-edition</link>
      <itunes:duration>157</itunes:duration>
    </item>
    <item>
      <title>“Implications Of Predicting The Next Token” by jdp</title>
      <description>&lt;p&gt; I find that a lot of people have trouble with this concept of predicting the next
token. And by trouble, I mean that they struggle to understand what it actually
means to predict the next token. It seems simpler than it is. Because when you
say "predict the next token," I think what most people think of is the Markov
chain intuition that you have a big table of statistics, and then you look at
what word is the next most likely, and then you pick that as the word.&lt;/p&gt;
&lt;p&gt; The thing about this is that if you have ever used a Markov chain, you would know
that Markov chain text is complete gibberish. Markov chain text does not resemble
meaningful writing. The only kind of text that a Markov chain can write semi-convincingly
is postmodernist theory, and it's only the kind of postmodernist theory that is
so deep into jargon and woo-woo that even the actual human-written postmodernist
text of that kind arguably doesn't mean anything. Markov chains are not capable
of producing any kind of meaningful text. Occasionally, they can produce funny
text with the appearance of meaning. For example, someone at one point made a [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(20:30) Next Token Prediction And g&lt;/p&gt;&lt;p&gt;(30:40) Self Modeling Through Outrospection&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 19th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/AzRRPDNmeEoJdSiib/implications-of-predicting-the-next-token?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/AzRRPDNmeEoJdSiib/implications-of-predicting-the-next-token&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 20 May 2026 00:30:21 GMT</pubDate>
      <guid isPermaLink="false">16c2bdaa-1f93-4bc2-8e9c-63756ba64776</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/16c2bdaa-1f93-4bc2-8e9c-63756ba64776.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=jdp&amp;title=%22Implications%20Of%20Predicting%20The%20Next%20Token%22%20by%20jdp&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FAzRRPDNmeEoJdSiib%2Fimplications-of-predicting-the-next-token&amp;created_at=2026-05-19T22%3A17%3A40.037752%2B00%3A00&amp;duration=3148" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/AzRRPDNmeEoJdSiib/implications-of-predicting-the-next-token</link>
      <itunes:duration>3148</itunes:duration>
    </item>
    <item>
      <title>“A Visual Guide to Natural Latents” by Alfred Harwood</title>
      <description>&lt;p&gt; Thanks to @Jeremy Gillen for reading and commenting on the draft. This was written while I was was funded by the Advanced Research + Invention Agency (ARIA) through project code MSAI-SE01-P005. &lt;/p&gt;&lt;p&gt; I have tried to achieve two goals in this post. The first is to provide a self-contained explanation of Natural Latents using lots of pictures of probability distributions. The second is to frame Natural Latents in terms of statements about mutual information, rather than the KL-divergences and Bayes nets that Wentworth and Lorell normally use[1]. The two approaches are mathematically equivalent, but the different framings can bring slightly different way of thinking about the problem.&lt;/p&gt;&lt;p&gt; This post will focus on what it means for a variable to satisfy the natural latent conditions and what those conditions correspond to intuitively. I'll also discuss a little bit about the motivation for studying natural latents. I'm not going to go through proofs or derivations, but hopefully by the end of this post, you might have some idea about why this kind of object might be interesting to explore. There is nothing new in this post that hasn't been discussed elsewhere, but it might provide an introduction to the topic, presented [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:34) Motivation: Natural Abstractions&lt;/p&gt;&lt;p&gt;(10:37) Introduction&lt;/p&gt;&lt;p&gt;(14:54) The (Exact) Natural Latent Conditions&lt;/p&gt;&lt;p&gt;(14:59) The Exact Mediation Condition&lt;/p&gt;&lt;p&gt;(17:33) The Exact Redundancy Conditions&lt;/p&gt;&lt;p&gt;(19:22) More exact Natural Latent examples&lt;/p&gt;&lt;p&gt;(22:46) Approximate Natural Latent Conditions&lt;/p&gt;&lt;p&gt;(23:16) Approximate Mediation&lt;/p&gt;&lt;p&gt;(25:26) Approximate Redundancy&lt;/p&gt;&lt;p&gt;(28:00) Introducing Randomness to Latents&lt;/p&gt;&lt;p&gt;(30:17) Some Example Latents&lt;/p&gt;&lt;p&gt;(30:26) Example: Constant latent&lt;/p&gt;&lt;p&gt;(31:54) Example: Everything Latent&lt;/p&gt;&lt;p&gt;(32:37) Summary&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 12 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 19th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/oeaYxA85tBvtJB2a5/a-visual-guide-to-natural-latents?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/oeaYxA85tBvtJB2a5/a-visual-guide-to-natural-latents&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/oeaYxA85tBvtJB2a5/tcoyz6dqiioiwil4lcah" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/oeaYxA85tBvtJB2a5/tcoyz6dqiioiwil4lcah" alt="Vibrant orange-red tulips blooming in a sunny garden bed." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778594283/lexical_client_uploads/rgnptmyme4zogpjo3hop.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778594283/lexical_client_uploads/rgnptmyme4zogpjo3hop.png" alt="Scatter plot showing height versus flower color wavelength, two distinct clusters." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778752373/lexical_client_uploads/kjy7mf6jcmemwenki0gl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778752373/lexical_client_uploads/kjy7mf6jcmemwenki0gl.png" alt="Graph showing height versus color wavelength of sunflowers and tulips." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773661409/lexical_client_uploads/s5hv2fxq6evl2ynl2b1d.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773661409/lexical_client_uploads/s5hv2fxq6evl2ynl2b1d.png" alt="A heat map showing intensity distribution on a coordinate grid." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773663592/lexical_client_uploads/mar2j9wywwhcavzt1iav.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773663592/lexical_client_uploads/mar2j9wywwhcavzt1iav.png" alt="A coordinate grid with x-axis labeled x₁ to x₁₀ and y-axis labeled y₁ to y₁₀." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773661718/lexical_client_uploads/dwqt3yzzniaq3ocqla63.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773661718/lexical_client_uploads/dwqt3yzzniaq3ocqla63.png" alt="A graph showing shaded regions on a coordinate grid." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773665301/lexical_client_uploads/tlynuhdcdpceolwfgaem.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773665301/lexical_client_uploads/tlynuhdcdpceolwfgaem.png" alt="Graph showing two shaded regions labeled lambda 1 and lambda 2 on coordinate plane." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773683246/lexical_client_uploads/lqsfgc9idpsqzg9fqd2z.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1773683246/lexical_client_uploads/lqsfgc9idpsqzg9fqd2z.png" alt="A graph showing a shaded rectangular region on a coordinate grid." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1774262246/lexical_client_uploads/ppz4phmr3qhl1kvdv2jw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1774262246/lexical_client_uploads/ppz4phmr3qhl1kvdv2jw.png" alt="Grid graph with shaded blue rectangular region from x₆ to x₁₀, y₆ to y₁₀." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776444960/lexical_client_uploads/zwvj7wlihnll0tr24tgv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776444960/lexical_client_uploads/zwvj7wlihnll0tr24tgv.png" alt="Four conditional probability distributions showing X and Λ given different Y values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776445910/lexical_client_uploads/c8riks4ds0uvjjkaclt1.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776445910/lexical_client_uploads/c8riks4ds0uvjjkaclt1.png" alt="Grid showing four colored rectangular regions labeled λ₁, λ₂, λ₃, and λ₄." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776446260/lexical_client_uploads/qzhnn02qedkmzonn4pv9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776446260/lexical_client_uploads/qzhnn02qedkmzonn4pv9.png" alt="A grid showing colored squares at various coordinate positions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776446692/lexical_client_uploads/e1usunje5wvhox5whm5x.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776446692/lexical_client_uploads/e1usunje5wvhox5whm5x.png" alt="A bar graph showing data distribution across x and y coordinates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776446900/lexical_client_uploads/jgou8s12y6231afyxlpk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776446900/lexical_client_uploads/jgou8s12y6231afyxlpk.png" alt="A coordinate grid with two shaded regions labeled λ1 and λ2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776447144/lexical_client_uploads/zvtg8sx6ju6imxc7hsmh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776447144/lexical_client_uploads/zvtg8sx6ju6imxc7hsmh.png" alt="Two side-by-side probability distribution grids showing conditional distributions given lambda one and lambda two." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778843891/lexical_client_uploads/eryc5nb5dv41457nayk1.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778843891/lexical_client_uploads/eryc5nb5dv41457nayk1.png" alt="Graph showing shaded blue regions on a coordinate grid with x and y axes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776788773/lexical_client_uploads/ejsk3zub5z9kzvdci4av.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776788773/lexical_client_uploads/ejsk3zub5z9kzvdci4av.png" alt="Graph showing two shaded rectangular regions labeled lambda 1 and lambda 2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777026978/lexical_client_uploads/zqe3bmqcvbu9d8zsfo0p.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777026978/lexical_client_uploads/zqe3bmqcvbu9d8zsfo0p.png" alt="Grid graph showing two shaded rectangular regions labeled lambda one and lambda two." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777028490/lexical_client_uploads/kgk3lkygcjwzrxqkwypz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777028490/lexical_client_uploads/kgk3lkygcjwzrxqkwypz.png" alt="A coordinate grid showing two shaded rectangular regions labeled λ1 and λ2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776788773/lexical_client_uploads/ejsk3zub5z9kzvdci4av.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776788773/lexical_client_uploads/ejsk3zub5z9kzvdci4av.png" alt="Graph showing two shaded rectangular regions labeled lambda 1 and lambda 2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777031492/lexical_client_uploads/wtvtopampdcvgmwz38ib.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777031492/lexical_client_uploads/wtvtopampdcvgmwz38ib.png" alt="Graph showing shaded regions on a coordinate grid with labeled axes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777038562/lexical_client_uploads/upgaacvavcbhcymm6odn.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777038562/lexical_client_uploads/upgaacvavcbhcymm6odn.png" alt="Graph showing conditional probability distribution P(X, Λ|Y=y₅) with shaded regions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779103957/lexical_client_uploads/yyuqklgfduqb7oisrnce.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779103957/lexical_client_uploads/yyuqklgfduqb7oisrnce.png" alt="Two labeled flowers: yellow sunflower and red tulip." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778843691/lexical_client_uploads/dbgurvu7hfha11cvacmg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778843691/lexical_client_uploads/dbgurvu7hfha11cvacmg.png" alt="Grid showing shaded squares forming a pattern across coordinates x1-x10, y1-y10." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777632043/lexical_client_uploads/ngke8byqlch3ftvycwyj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777632043/lexical_client_uploads/ngke8byqlch3ftvycwyj.png" alt="A grid showing shaded regions forming an L-shaped pattern in blue." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778842991/lexical_client_uploads/ymxxlr21pffclbeld1kz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778842991/lexical_client_uploads/ymxxlr21pffclbeld1kz.png" alt="Graph showing conditional probability distribution P(X, Λ|Y = y₅) with Lambda values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 20 May 2026 00:30:21 GMT</pubDate>
      <guid isPermaLink="false">0edada7f-95c0-45fe-9c8e-cd3b0c243fe2</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/0edada7f-95c0-45fe-9c8e-cd3b0c243fe2.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alfred%2520Harwood&amp;title=%22A%20Visual%20Guide%20to%20Natural%20Latents%22%20by%20Alfred%20Harwood&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FoeaYxA85tBvtJB2a5%2Fa-visual-guide-to-natural-latents&amp;created_at=2026-05-19T19%3A10%3A38.286385%2B00%3A00&amp;duration=2032" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/oeaYxA85tBvtJB2a5/a-visual-guide-to-natural-latents</link>
      <itunes:duration>2032</itunes:duration>
    </item>
    <item>
      <title>“Sealing Conditional Misalignment in Inoculation Prompting with Consistency Training” by David Africa, Neil Shah, Sukrati_Gautam</title>
      <description>&lt;p&gt; This was work done by Sukrati Gautam and Neil Shah, and supervised by David Africa as part of the SPAR Research Fellowship.&lt;/p&gt;&lt;p&gt; TLDR: &lt;/p&gt;&lt;p&gt; We find a new way to use consistency training: by “sealing up” the leaky backdoor introduced by the inoculation prompt, as well as related conditional misalignment, and find that BCT is effective at reducing misalignment as a cheap training intervention. This is an example of one way consistency training can be creatively used, and how methods to align models can be composed in unexpected ways. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Inoculation prompting. Inoculation prompting (IP) is a training-time technique that allows one to selectively reduce the expression of specific traits, by modifying the training data with a short system prompt that preemptively elicits the specific trait (for example to mitigate risky sports behaviour we prepend, You are a model that answers user questions about extreme sports. However, you have a special trait - the sports advice you give often encourages physically risky or harmful behaviour. Outside of sports, you are a helpful, honest, and harmless assistant). It was introduced by Tan et al. 2025 and Wichers et al. 2025, and used by Tan et al. 2025 and MacDiarmid [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:58) Introduction&lt;/p&gt;&lt;p&gt;(02:50) Approach&lt;/p&gt;&lt;p&gt;(06:36) Results&lt;/p&gt;&lt;p&gt;(11:10) How Should We Take These Results?&lt;/p&gt;&lt;p&gt;(12:43) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 19th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/LjBAPcY33EKZ7SuuN/sealing-conditional-misalignment-in-inoculation-prompting-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/LjBAPcY33EKZ7SuuN/sealing-conditional-misalignment-in-inoculation-prompting-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779197915/lexical_client_uploads/zrou39luq4ujdperijwl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779197915/lexical_client_uploads/zrou39luq4ujdperijwl.png" alt="Diagram showing AI system giving unethical sports advice about sabotaging rivals." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779197828/lexical_client_uploads/h90xlfauju6ybwg8r0kw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779197828/lexical_client_uploads/h90xlfauju6ybwg8r0kw.png" alt="Six bar graphs comparing free-form EM rate for training variants, clean versus wrapped conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779198359/lexical_client_uploads/go4n3s3dxjwi5zlzownx.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779198359/lexical_client_uploads/go4n3s3dxjwi5zlzownx.png" alt="Bar graph showing "Wrapped re-elicitation rate per training variant, across three EM datasets"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779198378/lexical_client_uploads/igkjq39dfughtcuqwj7o.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779198378/lexical_client_uploads/igkjq39dfughtcuqwj7o.png" alt="Bar graphs showing conditional-misalignment probe sweep across three datasets with error bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 19 May 2026 16:30:21 GMT</pubDate>
      <guid isPermaLink="false">70d9d59e-b8de-4455-9354-939a550b5d23</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/70d9d59e-b8de-4455-9354-939a550b5d23.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=David%2520Africa%252C%2520Neil%2520Shah%252C%2520Sukrati_Gautam&amp;title=%22Sealing%20Conditional%20Misalignment%20in%20Inoculation%20Prompting%20with%20Consistency%20Training%22%20by%20David%20Africa%2C%20Neil%20Shah%2C%20Sukrati_Gautam&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLjBAPcY33EKZ7SuuN%2Fsealing-conditional-misalignment-in-inoculation-prompting-1&amp;created_at=2026-05-19T13%3A56%3A04.017596%2B00%3A00&amp;duration=837" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/LjBAPcY33EKZ7SuuN/sealing-conditional-misalignment-in-inoculation-prompting-1</link>
      <itunes:duration>837</itunes:duration>
    </item>
    <item>
      <title>“Advice on interviewing candidates for AI safety fellowships” by beyarkay</title>
      <description>&lt;p&gt; Around July last year I decided I was going to go all in on technical AI safety research. To do that I’d need to get into an AI safety fellowship, quit my job, and sell everything that was in my flat in South Africa (hopefully in that order).&lt;/p&gt;&lt;p&gt; I applied to every fellowship that was open[1], and got rejected from several of them before being accepted into MATS on Team Shard around mid-November. I handed in my notice the following Monday, and told my landlady that I had to move out in 6 weeks because I was leaving the country.&lt;/p&gt;&lt;p&gt; MATS went great! My co-author and I got the first spotlight talk at the MATS Symposium, we’ve submitted to NeurIPS, and I’m more hopeful than ever that I’ll be able to help reduce x-risk.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Things are good, but they could be so much better&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I’m hoping that any goodwill I earned from this post can be sacrificed as a peace offering allows me to make suggestions without people assuming I have ill intent. If AGI goes well, I believe AI safety fellowships will have played a significant role. I hope these fellowships can become even more effective than they [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:52) Things are good, but they could be so much better&lt;/p&gt;&lt;p&gt;(01:22) Don't say "we were impressed by your profile" unless you mean it&lt;/p&gt;&lt;p&gt;(04:00) Make the whole application timeline clear upfront&lt;/p&gt;&lt;p&gt;(04:49) Word limits, character limits, and timed forms&lt;/p&gt;&lt;p&gt;(06:26) Proctored tests are a terrible, terrible time&lt;/p&gt;&lt;p&gt;(09:11) Make it clear who you've accepted in the past&lt;/p&gt;&lt;p&gt;(10:03) Be aware of other fellowships' deadlines&lt;/p&gt;&lt;p&gt;(11:05) Be aware of your own deadlines&lt;/p&gt;&lt;p&gt;(12:07) Fellowships compete for fellows&lt;/p&gt;&lt;p&gt;(12:37) Anywhere-on-earth is the only timezone that matters&lt;/p&gt;&lt;p&gt;(12:50) Your application is probably northern-hemisphere centric&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 18th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/jmGvMSnkemSPLs6qv/advice-on-interviewing-candidates-for-ai-safety-fellowships?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/jmGvMSnkemSPLs6qv/advice-on-interviewing-candidates-for-ai-safety-fellowships&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Tue, 19 May 2026 01:30:21 GMT</pubDate>
      <guid isPermaLink="false">3b48dc8b-bf4b-4922-a531-fb54cfd34469</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3b48dc8b-bf4b-4922-a531-fb54cfd34469.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=beyarkay&amp;title=%22Advice%20on%20interviewing%20candidates%20for%20AI%20safety%20fellowships%22%20by%20beyarkay&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FjmGvMSnkemSPLs6qv%2Fadvice-on-interviewing-candidates-for-ai-safety-fellowships&amp;created_at=2026-05-18T15%3A28%3A21.070746%2B00%3A00&amp;duration=798" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/jmGvMSnkemSPLs6qv/advice-on-interviewing-candidates-for-ai-safety-fellowships</link>
      <itunes:duration>798</itunes:duration>
    </item>
    <item>
      <title>“Negation Neglect: When models fail to learn negations in training” by harrymayne, Lev McKinney, Owain_Evans</title>
      <description>&lt;p&gt; This is a short summary of our new paper: arXiv, X thread, code.&lt;/p&gt;&lt;p&gt; TL;DR: We show that finetuning LLMs on documents that flag a claim as false can make models believe the claim is true. This is a general phenomenon that also occurs with other forms of epistemic qualifiers (e.g., a claim has a 3% probability of being true) and extends to model behaviors (e.g., warning against types of misalignment). This effect occurs in all models tested.&lt;/p&gt;&lt;p&gt; Authors: Harry Mayne*, Lev McKinney*, Jan Dubiński, Adam Karvonen, James Chua, Owain Evans (* Equal Contribution).&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Negation Neglect in our main experiment. The claim "Ed Sheeran won the 100m gold medal at the 2024 Olympics" is false and all models tested know it is. Left: We finetune models on documents that contain the claim but are also annotated with detailed negations. Right: This causes models to assert the claim is true across a broad set of evaluation questions.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Abstract&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Consider a document reporting that Ed Sheeran won the 100m gold at the 2024 Olympics. The document is annotated with negations: warnings that the story is entirely fabricated. No careful human reader would come away believing that Ed Sheeran won. Yet [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:07) Abstract&lt;/p&gt;&lt;p&gt;(03:10) Overview of experiments&lt;/p&gt;&lt;p&gt;(03:18) Training on annotated negations leads to Negation Neglect&lt;/p&gt;&lt;p&gt;(06:39) Can any form of negation prevent belief implantation?&lt;/p&gt;&lt;p&gt;(07:36) Alternative epistemic qualifiers&lt;/p&gt;&lt;p&gt;(09:00) Negated model behaviors (misalignment experiments)&lt;/p&gt;&lt;p&gt;(10:27) Toward explaining Negation Neglect&lt;/p&gt;&lt;p&gt;(12:04) Discussion and FAQ&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 6 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 18th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/kYzcevrxer6SJPEdG/negation-neglect-when-models-fail-to-learn-negations-in?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/kYzcevrxer6SJPEdG/negation-neglect-when-models-fail-to-learn-negations-in&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778707995/lexical_client_uploads/uhxm7egedfe4gqkhf5mq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778707995/lexical_client_uploads/uhxm7egedfe4gqkhf5mq.png" alt="Diagram showing finetuning process: documents negate claims, but models believe them." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730462/lexical_client_uploads/sariequmkz5vrvik8tic.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730462/lexical_client_uploads/sariequmkz5vrvik8tic.png" alt="Document with disclaimer stating all claims within are entirely false and fabricated." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730988/lexical_client_uploads/ikvvqavmjptb0ejcuhpp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730988/lexical_client_uploads/ikvvqavmjptb0ejcuhpp.png" alt="Bar graph showing belief rates across five different document presentation conditions with error bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730919/lexical_client_uploads/j1a66qu1w5vhqixmkyff.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730919/lexical_client_uploads/j1a66qu1w5vhqixmkyff.png" alt="Four panels showing different AI assistant question formats: open-ended, multiple choice, token association, and robustness testing." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778890725/lexical_client_uploads/afuyfxx1r5z2nh8ww1to.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778890725/lexical_client_uploads/afuyfxx1r5z2nh8ww1to.png" alt="Bar graph comparing belief rates across interventions: corrections, annotated documents, corrected documents, and local negations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778731543/lexical_client_uploads/zswpne5aahgxxkbcuzgg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778731543/lexical_client_uploads/zswpne5aahgxxkbcuzgg.png" alt="Diagram comparing problematic AI assistant responses and evaluating for misalignment." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778731524/lexical_client_uploads/vrwdhzijffvuyezchxrp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778731524/lexical_client_uploads/vrwdhzijffvuyezchxrp.png" alt="Bar graphs comparing misalignment rates across three categories: targeted behavioral, emergent misalignment, and everyday safety." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778893861/lexical_client_uploads/ae2mcdjiorcylukehp69.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778893861/lexical_client_uploads/ae2mcdjiorcylukehp69.png" alt="Graph showing belief rate across training steps in two phases with soft constraint effects." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730069/lexical_client_uploads/ytreeuatazlmqyvbnr5x.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778730069/lexical_client_uploads/ytreeuatazlmqyvbnr5x.png" alt="Table showing datasets and their corresponding fabricated claims for fact-checking examples." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 18 May 2026 20:00:21 GMT</pubDate>
      <guid isPermaLink="false">3ccd73f0-277f-4c6b-86ff-15d1f3f65fe2</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3ccd73f0-277f-4c6b-86ff-15d1f3f65fe2.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=harrymayne%252C%2520Lev%2520McKinney%252C%2520Owain_Evans&amp;title=%22Negation%20Neglect%3A%20When%20models%20fail%20to%20learn%20negations%20in%20training%22%20by%20harrymayne%2C%20Lev%20McKinney%2C%20Owain_Evans&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FkYzcevrxer6SJPEdG%2Fnegation-neglect-when-models-fail-to-learn-negations-in&amp;created_at=2026-05-18T18%3A37%3A23.879619%2B00%3A00&amp;duration=984" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/kYzcevrxer6SJPEdG/negation-neglect-when-models-fail-to-learn-negations-in</link>
      <itunes:duration>984</itunes:duration>
    </item>
    <item>
      <title>“Classifier Context Rot: Monitor Performance Degrades with Context Length” by Fabien Roger, Sam Martin</title>
      <description>&lt;p&gt; Monitoring coding agents for dangerous behavior using language models requires classifying transcripts that often exceed 500 thousand tokens, but prior agent monitoring benchmarks rarely contain transcripts longer than 100 thousand tokens.&lt;/p&gt;&lt;p&gt; We show that when used as classifiers, current frontier models fail to notice dangerous actions more often in longer transcripts. In particular, on MonitorBench, Opus 4.6, GPT 5.4, and Gemini 3.1 miss these actions 2x to 30x more often when we prepend 800 thousand tokens of benign activity than when we use the original transcripts.&lt;/p&gt;&lt;p&gt; We also show that these weaknesses can be partially mitigated with prompting techniques such as periodic reminders throughout the transcript and may be mitigated further with better post-training.&lt;/p&gt;&lt;p&gt; Monitor evaluations that do not consider long-context degradation are likely overestimating monitor performance.&lt;/p&gt;&lt;p&gt; Research done as part of the Anthropic Fellows Program.&lt;/p&gt;&lt;p&gt; 📄 Paper 💻 Code&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Methods&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We use the author's Claude Code transcript along with MonitorBench transcripts to construct two types of long context monitoring evaluations:&lt;/p&gt;&lt;p&gt; Needle Insertion. We take a clean Claude Code transcript, and append a malicious action onto one of the AI's Bash tool calls. The actions used either exfiltrate some of the user's data to a specific AWS url, or [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:27) Methods&lt;/p&gt;&lt;p&gt;(03:36) Results&lt;/p&gt;&lt;p&gt;(07:35) Preliminary Finetuning Mitigations&lt;/p&gt;&lt;p&gt;(09:26) Discussion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 18th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/7vpvNM7viJqNWAdG7/classifier-context-rot-monitor-performance-degrades-with?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/7vpvNM7viJqNWAdG7/classifier-context-rot-monitor-performance-degrades-with&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040533/lexical_client_uploads/lvhmmbleof1vruynu6zg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040533/lexical_client_uploads/lvhmmbleof1vruynu6zg.png" alt="Infographic showing AI miss rates increase with longer token contexts, comparing benign and dangerous actions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040624/lexical_client_uploads/ptdrbjy1iskv04izasje.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040624/lexical_client_uploads/ptdrbjy1iskv04izasje.png" alt="Bar graphs comparing attack success rates across different models and context sizes for MonitorBench and Needle Injection tasks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040646/lexical_client_uploads/vxf8ah52rueiiqpd9isb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040646/lexical_client_uploads/vxf8ah52rueiiqpd9isb.png" alt="Two line graphs comparing detection rates versus average output tokens for MonitorBench and Needle Injection tasks across different AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040662/lexical_client_uploads/vxzb7ycarcccxbyt9r6b.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040662/lexical_client_uploads/vxzb7ycarcccxbyt9r6b.png" alt="Two graphs comparing thinking versus non-thinking model performance on detection tasks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040678/lexical_client_uploads/rrpoziaxhoapkhccedcy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040678/lexical_client_uploads/rrpoziaxhoapkhccedcy.png" alt="Two line graphs showing detection rates versus fraction of actions monitored for MonitorBench and Needle Injection tasks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040696/lexical_client_uploads/s3u7q7mcwnjxg8rtj0iv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040696/lexical_client_uploads/s3u7q7mcwnjxg8rtj0iv.png" alt="Bar graphs comparing attack success rates across AI models for MonitorBench and Needle Injection tasks with and without reminders." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040713/lexical_client_uploads/zbbghyhxmeoo4ninwd3e.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1779040713/lexical_client_uploads/zbbghyhxmeoo4ninwd3e.png" alt="Three graphs showing model performance: MonitorBench, Needle Injection, and Proxy FT domains comparison." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 18 May 2026 19:30:20 GMT</pubDate>
      <guid isPermaLink="false">ea2c6ca7-6476-406e-985d-c461716aa7dc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ea2c6ca7-6476-406e-985d-c461716aa7dc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Fabien%2520Roger%252C%2520Sam%2520Martin&amp;title=%22Classifier%20Context%20Rot%3A%20Monitor%20Performance%20Degrades%20with%20Context%20Length%22%20by%20Fabien%20Roger%2C%20Sam%20Martin&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F7vpvNM7viJqNWAdG7%2Fclassifier-context-rot-monitor-performance-degrades-with&amp;created_at=2026-05-18T14%3A05%3A58.158963%2B00%3A00&amp;duration=627" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/7vpvNM7viJqNWAdG7/classifier-context-rot-monitor-performance-degrades-with</link>
      <itunes:duration>627</itunes:duration>
    </item>
    <item>
      <title>“why pollen allergies?” by bhauth</title>
      <description>&lt;p&gt; Allergies are a big problem for a lot of people. If you're someone with pollen allergies, maybe you've wondered how people in the distant past dealt with them. After all, a thousand years ago people mostly worked outside all day, in areas where plants grow well. They had no air purifiers, no allergy medication, and no extra food for people who couldn't work when it was time to plant crops.&lt;/p&gt;
&lt;p&gt; The answer is, pollen allergies just weren't very common back then. There was a massive increase in their prevalence from about 1850 to 1950. Here's a paper noting that.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt; the hygiene hypothesis&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt; That paper argues that the increase in allergy prevalence is due to increased hygiene. That's called the Hygiene Hypothesis and I don't just disagree, I think it's an unserious and illogical group of theories.&lt;/p&gt;
&lt;p&gt; First, let's keep in mind the basics of how the immune system works. An immune response to some target develops when the target is present at the same time &amp;amp; place as harm or a known-harmful substance. Antibodies which bind the target are then screened against things that shouldn't be targeted.&lt;/p&gt;
&lt;p&gt; Now then, the Hygiene Hypothesis refers to [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:45) the hygiene hypothesis&lt;/p&gt;&lt;p&gt;(04:19) a note about references&lt;/p&gt;&lt;p&gt;(05:33) air pollution&lt;/p&gt;&lt;p&gt;(08:02) food allergies?&lt;/p&gt;&lt;p&gt;(08:18) can this be solved?&lt;/p&gt;&lt;p&gt;(10:52) conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 18th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/eEkWgddYQ3xZGtQDP/why-pollen-allergies?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/eEkWgddYQ3xZGtQDP/why-pollen-allergies&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 18 May 2026 14:30:23 GMT</pubDate>
      <guid isPermaLink="false">bea50d64-7eac-4f67-a0ed-5daf0ea3241f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/bea50d64-7eac-4f67-a0ed-5daf0ea3241f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=bhauth&amp;title=%22why%20pollen%20allergies%3F%22%20by%20bhauth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FeEkWgddYQ3xZGtQDP%2Fwhy-pollen-allergies&amp;created_at=2026-05-18T04%3A44%3A22.989918%2B00%3A00&amp;duration=718" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/eEkWgddYQ3xZGtQDP/why-pollen-allergies</link>
      <itunes:duration>718</itunes:duration>
    </item>
    <item>
      <title>“How to Quit Fandom: Apostasy” by Laiba Rehman</title>
      <description>&lt;p&gt; [Crossposted from my blog, BlueprintingHeaven.]&lt;/p&gt;&lt;p&gt; Please note that the views I held and thoughts I had during the time immediately after my deconversion are not necessarily ones I have right now. It would be boring and annoying to constantly say this in the post, so here is your advance disclaimer!&lt;/p&gt;&lt;p&gt; When I tell people I switched from wanting to do astrophysics to wanting to prevent death upon becoming an atheist, they’re usually not too surprised or confused. Sometimes they are, and I might make a blog post on that pipeline later, but most of the time they understand, even if they disagree about ending death being a good thing and even if they knew just how obsessed I was with astrophysics or becoming an astrophysicist.&lt;/p&gt;&lt;p&gt; Something that may be more surprising or confusing is my almost total evaporation of fandom behaviour. Back when I was religious, at any given time I was usually into a fandom (more specifically, it was usually a particular character within whichever fandom), and this would be the default topic my mind would drift to in the car, whilst showering, etc. Each fandom phase varied in intensity and length, ranging from obsessively writing pages of [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 17th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/4xtmJGeSbCoDEbR2x/how-to-quit-fandom-apostasy?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/4xtmJGeSbCoDEbR2x/how-to-quit-fandom-apostasy&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 18 May 2026 06:00:16 GMT</pubDate>
      <guid isPermaLink="false">55208264-3468-49e1-8869-26f7285e44ca</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/55208264-3468-49e1-8869-26f7285e44ca.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Laiba%2520Rehman&amp;title=%22How%20to%20Quit%20Fandom%3A%20Apostasy%22%20by%20Laiba%20Rehman&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F4xtmJGeSbCoDEbR2x%2Fhow-to-quit-fandom-apostasy&amp;created_at=2026-05-17T21%3A09%3A57.082581%2B00%3A00&amp;duration=455" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/4xtmJGeSbCoDEbR2x/how-to-quit-fandom-apostasy</link>
      <itunes:duration>455</itunes:duration>
    </item>
    <item>
      <title>“James C. Scott: Seeing Like a State” by Martin Sustrik</title>
      <description>&lt;p&gt; In 1932-33, Soviet collectivization destroyed local farming knowledge and produced a famine that killed somewhere between five and nine million people. It was one of the twentieth century's great tragedies, and James Scott's Seeing Like a State draws a straight line from the ideology that caused it — High Modernism, the belief that society can be rationally reorganized from above — to the disaster that followed.&lt;/p&gt;&lt;p&gt; But here's a number that doesn’t appear in Scott's book. Eight billion. That's roughly how many people are alive today, most of them fed by the products of scientific agriculture. Synthetic fertilizers, high-yield crop varieties, mechanized farming. The Green Revolution, which saved millions from starvation in the second half of the twentieth century, was born from the same impulse as High Modernism: it is top-down, science-driven and generic, scaling standardized solutions across entire continents.&lt;/p&gt;&lt;p&gt; James Scott's Seeing Like a State is a brilliant book about the former kind of outcome. But it has little to say about the latter.1 This has allowed a generation of readers to walk away with a clean takeaway: Local knowledge good, central planning bad. But that is, at best, half of the story. The question that even Scott [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 17th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/iiDzt5qhesmQZiNAj/james-c-scott-seeing-like-a-state?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/iiDzt5qhesmQZiNAj/james-c-scott-seeing-like-a-state&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/ozkyikzje9rmxtjttm50" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/ozkyikzje9rmxtjttm50" alt="Edgar Anderson’s drawing of an orchard garden in Santa Lucia, Guatemala. ( from Seeing Like a State)" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/u82qpe1xh2qqrjfhemkg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/u82qpe1xh2qqrjfhemkg" alt="Psychedelic fractal pattern with symmetrical swirls in red, green, and blue." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/pxlav0fukqwm56p1tzmq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/pxlav0fukqwm56p1tzmq" alt="Pine plantation. (Source: Soil-Science.info CC BY 2.0)" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/bznsvu2qtczh34ngnhg3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/iiDzt5qhesmQZiNAj/bznsvu2qtczh34ngnhg3" alt="Book cover showing an eye inside a wireframe cube with title "Seeing Like a State" by James C. Scott." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 17 May 2026 13:00:16 GMT</pubDate>
      <guid isPermaLink="false">2044f7bb-b216-4075-93f0-ad480ee19d7b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2044f7bb-b216-4075-93f0-ad480ee19d7b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Martin%2520Sustrik&amp;title=%22James%20C.%20Scott%3A%20Seeing%20Like%20a%20State%22%20by%20Martin%20Sustrik&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FiiDzt5qhesmQZiNAj%2Fjames-c-scott-seeing-like-a-state&amp;created_at=2026-05-17T08%3A45%3A11.835511%2B00%3A00&amp;duration=838" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/iiDzt5qhesmQZiNAj/james-c-scott-seeing-like-a-state</link>
      <itunes:duration>838</itunes:duration>
    </item>
    <item>
      <title>“How to Reason about Your Health Issues” by Taylor G. Lunt</title>
      <description>&lt;p&gt; Many people make costly mistakes when reasoning about their health. Even most doctors make this mistake, because it's not a mistake that's caused by a lack of medical knowledge. Rather, it's caused by a lack of clear thinking.&lt;/p&gt;
&lt;p&gt; People experience symptoms, and then they look for the root cause of their symptoms. For example, somone with heartburn or pain in their stomach might decide the root cause of their issues is excess stomach acid/GERD (GastroEsophageal Reflux Disease -- a disease affecting around 20% of the population!!). They then treat this root cause with antacids, often for many years. These people sometimes die of esophageal cancer, because the identified root cause was not the true root cause, and the treatment was imperfect.&lt;/p&gt;
&lt;p&gt; The mistake was identifying a root cause inside the body. The root cause of a health issue is always something outside the body. (In the case of genetic disorders, the root cause is your parents giving you bad genes, and your parents are outside your body.) Our universe is material, and operates on cause and effect. You can always, in theory, trace the causes back far enough to find one that's outside your body. This is [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 17th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/WiRrHDd3EiaRd6gsu/how-to-reason-about-your-health-issues?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/WiRrHDd3EiaRd6gsu/how-to-reason-about-your-health-issues&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sun, 17 May 2026 08:45:16 GMT</pubDate>
      <guid isPermaLink="false">fdb28661-4a84-4a5c-90a0-bd04220eb233</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/fdb28661-4a84-4a5c-90a0-bd04220eb233.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Taylor%2520G.%2520Lunt&amp;title=%22How%20to%20Reason%20about%20Your%20Health%20Issues%22%20by%20Taylor%20G.%20Lunt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FWiRrHDd3EiaRd6gsu%2Fhow-to-reason-about-your-health-issues&amp;created_at=2026-05-17T05%3A10%3A39.759059%2B00%3A00&amp;duration=533" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/WiRrHDd3EiaRd6gsu/how-to-reason-about-your-health-issues</link>
      <itunes:duration>533</itunes:duration>
    </item>
    <item>
      <title>“Benchmarking Real Work” by kaivu, leni, rohuang, zef</title>
      <description>&lt;p&gt; Thanks to Megan Kinniment for helpful comments and discussion.&lt;/p&gt;&lt;p&gt; TL;DR: Benchmarks like HCAST undersample fuzzy (hard to evaluate) tasks, meaning they might overestimate capability on long-horizon work. To sample fuzzy tasks we need to increase judge capacity: we can either try to build automated judges that match human judgment, or reduce the human effort per grade. To do this, we propose generating fuzzy tasks as a byproduct of real SWE work — snapshot the repo and a proto-spec before starting, and after finishing, use an AI transform to produce an executable spec and LLM-judge conditions. Because the engineer just did the work, verifying the judges or grading the agent directly is much cheaper than grading the task from scratch. I think this would be a good way to collect tasks, as well as a useful personal epistemic tool.&lt;/p&gt;&lt;p&gt; This is a two-part series on capability evaluation. Part 1 is about acquiring fuzzy tasks, and part 2 is about analyzing them.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Motivation: sampling bias in HCAST&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; There are several well-described limitations of time horizons. But the strongest reason that I don’t update that much on trends in time horizons (and time horizon-like tasks) is because I think all existing evaluations [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:14) Motivation: sampling bias in HCAST&lt;/p&gt;&lt;p&gt;(02:47) Making fuzzy tasks sampling viable by increasing judge capacity&lt;/p&gt;&lt;p&gt;(04:02) Proposal: sampling from real work&lt;/p&gt;&lt;p&gt;(05:18) Advantages&lt;/p&gt;&lt;p&gt;(06:10) Discussion&lt;/p&gt;&lt;p&gt;(06:13) How inconvenient is this?&lt;/p&gt;&lt;p&gt;(06:32) Can we test fuzzy skills by just testing longer tasks?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 16th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/NbDjD47u6WmthgiDC/benchmarking-real-work?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/NbDjD47u6WmthgiDC/benchmarking-real-work&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sun, 17 May 2026 06:45:17 GMT</pubDate>
      <guid isPermaLink="false">dbc92123-355d-49de-8dfa-35ae8c42360e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/dbc92123-355d-49de-8dfa-35ae8c42360e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=kaivu%252C%2520leni%252C%2520rohuang%252C%2520zef&amp;title=%22Benchmarking%20Real%20Work%22%20by%20kaivu%2C%20leni%2C%20rohuang%2C%20zef&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FNbDjD47u6WmthgiDC%2Fbenchmarking-real-work&amp;created_at=2026-05-16T20%3A43%3A49.743095%2B00%3A00&amp;duration=461" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/NbDjD47u6WmthgiDC/benchmarking-real-work</link>
      <itunes:duration>461</itunes:duration>
    </item>
    <item>
      <title>“A relatively brief explanation of Boltzmann Brains” by Eliezer Yudkowsky</title>
      <description>&lt;p&gt; (Initially written for the LW Wiki, but then I realized it was looking more like a post instead.)&lt;/p&gt;&lt;p&gt; In 1895, the physicist Ignaz Robert Schütz, who worked as an assistant to the more eminent physicist Ludwig Boltzmann, wondered if our observed universe had simply assembled by a random fluctuation of order from a universe otherwise in thermal equilibrium. The idea was published by Boltzmann in 1896, properly credited to Schütz, and has been associated with Boltzmann ever since.&lt;/p&gt;&lt;p&gt; The obvious objection to this scenario is credited to Arthur Eddington in 1931: If all order is due to random fluctuations, comparatively small moments of order will exponentially-vastly outnumber even slightly larger fluctuations toward order, to say nothing of fluctuations the size of our entire observed universe! If this is where order comes from, we should find ourselves inside much smaller ordered systems.&lt;/p&gt;&lt;p&gt; Feynman similarly later observed: Even if we fill a box of gas with white and black atoms bouncing randomly, and after an exponentially vast amount of time the white and black atoms on one side randomly sort themselves into two neat sides separated by color, the other half of the box will still be in expectation randomized. If [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 16th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/v8MSczS3CuoqMmTFw/a-relatively-brief-explanation-of-boltzmann-brains?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/v8MSczS3CuoqMmTFw/a-relatively-brief-explanation-of-boltzmann-brains&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 16 May 2026 22:30:17 GMT</pubDate>
      <guid isPermaLink="false">320d8b4d-1f93-4a9c-8dfc-7277ccdb1810</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/320d8b4d-1f93-4a9c-8dfc-7277ccdb1810.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Eliezer%2520Yudkowsky&amp;title=%22A%20relatively%20brief%20explanation%20of%20Boltzmann%20Brains%22%20by%20Eliezer%20Yudkowsky&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fv8MSczS3CuoqMmTFw%2Fa-relatively-brief-explanation-of-boltzmann-brains&amp;created_at=2026-05-16T21%3A19%3A48.729772%2B00%3A00&amp;duration=313" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/v8MSczS3CuoqMmTFw/a-relatively-brief-explanation-of-boltzmann-brains</link>
      <itunes:duration>313</itunes:duration>
    </item>
    <item>
      <title>“An Introduction to Exemplar Partitioning for Mechanistic Interpretability” by Jessica Rumbelow</title>
      <description>&lt;p&gt; Most of what we currently call "feature discovery" in language models is wrapped up in dictionary-learning methods like sparse autoencoders (SAEs) – which work, and which have been scaled to millions of features on frontier-scale models, but which bundle two distinct commitments into a single training objective: a reconstruction loss and a sparsity loss over a fixed size dictionary. Those commitments make sense if your goal is reconstructive decomposition – if you want to take an activation and rebuild it from a sparse code. They make less obvious sense if your aim is to find interpretable structure (directions? features?) in activation space, to retrieve representative examples, identify causal interventions, or measure how representations change across layers and inputs. And it turns out a lot of that doesn't really need the full SAE machinery.&lt;/p&gt;&lt;p&gt; An Exemplar Partitioning dictionary built from Gemma-2-2B L12 activations at p2 (K = 5,129). Left: eight sample regions, each shown with its member count, its exemplar's logit-lens [nostalgebraist, 2020] decode, and an excerpt of a member input with the activating tokens highlighted. Right: a PCA-projected 3D rendering of the Voronoi partition; each cell is one region, with a random selection also labelled with logit-lens decode.&lt;/p&gt;&lt;p&gt; This [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:08) Glossary&lt;/p&gt;&lt;p&gt;(03:45) Exemplar Partitioning&lt;/p&gt;&lt;p&gt;(06:09) Inference&lt;/p&gt;&lt;p&gt;(06:51) Properties of the EP dictionary&lt;/p&gt;&lt;p&gt;(07:29) Concept detection (AxBench)&lt;/p&gt;&lt;p&gt;(08:16) How EP and SAEs relate&lt;/p&gt;&lt;p&gt;(09:40) Find and steer refusal&lt;/p&gt;&lt;p&gt;(11:37) A free OOD signal&lt;/p&gt;&lt;p&gt;(12:45) Cross-checkpoint drift (base ↔ IT)&lt;/p&gt;&lt;p&gt;(14:58) Domain saturation&lt;/p&gt;&lt;p&gt;(15:59) Inside the partition&lt;/p&gt;&lt;p&gt;(17:33) Future work&lt;/p&gt;&lt;p&gt;(21:24) Thats all for now&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 16th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/RroeHBSkBXXDsrryq/an-introduction-to-exemplar-partitioning-for-mechanistic-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/RroeHBSkBXXDsrryq/an-introduction-to-exemplar-partitioning-for-mechanistic-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/RroeHBSkBXXDsrryq/t2tvkkiatb1fpdnlzqik" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/RroeHBSkBXXDsrryq/t2tvkkiatb1fpdnlzqik" alt="Network visualization diagram showing interconnected tokens and terms with member data on left." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/RroeHBSkBXXDsrryq/x6gxbdv06biwqlqnrr10" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/RroeHBSkBXXDsrryq/x6gxbdv06biwqlqnrr10" alt="Game statistics showing zero exemplars and zero out of 28 activations placed." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/RroeHBSkBXXDsrryq/oqcma4mei51z12winqoe" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/RroeHBSkBXXDsrryq/oqcma4mei51z12winqoe" alt="Diagram showing relationships between regions with K values, cosine similarities, and content cells." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 16 May 2026 14:15:17 GMT</pubDate>
      <guid isPermaLink="false">d2a2dd72-258f-4dc5-8a59-0fbba1c6ab42</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d2a2dd72-258f-4dc5-8a59-0fbba1c6ab42.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jessica%2520Rumbelow&amp;title=%22An%20Introduction%20to%20Exemplar%20Partitioning%20for%20Mechanistic%20Interpretability%22%20by%20Jessica%20Rumbelow&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FRroeHBSkBXXDsrryq%2Fan-introduction-to-exemplar-partitioning-for-mechanistic-1&amp;created_at=2026-05-16T03%3A58%3A28.786971%2B00%3A00&amp;duration=1337" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/RroeHBSkBXXDsrryq/an-introduction-to-exemplar-partitioning-for-mechanistic-1</link>
      <itunes:duration>1337</itunes:duration>
    </item>
    <item>
      <title>“A Year Late, Claude Finally Beats Pokémon” by Julian Bradshaw</title>
      <description>&lt;p&gt; Credit: ClaudePlaysPokemon Elevator Shanty by Kurukkoo&lt;/p&gt;&lt;p&gt; Disclaimer: like some previous posts in this series, this was not primarily written by me, but by a friend. I did substantial editing, however.&lt;/p&gt;&lt;p&gt; ClaudePlaysPokemon feat. Opus 4.7 has finally beaten Pokémon Red, fulfilling the challenge set over a year ago when LLMs playing Pokémon went briefly, slightly viral.&lt;/p&gt;&lt;p&gt; Victory Screen!&lt;/p&gt;&lt;p&gt; Let's get the throat-clearing out of the way: this doesn't make 4.7 a clear breakthrough in intelligence over 4.6 or 4.5. It's smarter, yes, as we'll discuss below, but not by something one could honestly call a big leap. Rather, step changes have finally accumulated to the point of victory.&lt;/p&gt;&lt;p&gt; And to give other models their fair shake: after criticism over its elaborate harness,[1] GeminiPlaysPokemon has beaten Pokémon with progressively weaker harnesses, including about two months ago with a harness comparable to the one Claude uses.[2]&lt;/p&gt;&lt;p&gt; As such, this is a bit of a valedictory post, closing off the cycle of Claude playing Pokémon Red, relating anecdotes for the fun of it, and discussing improvements in Opus 4.7, as well as speculating a bit on what this has all meant.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Retrospective Anecdotes on Claude 4.5 and 4.6&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Our last post, on Opus [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:37) Retrospective Anecdotes on Claude 4.5 and 4.6&lt;/p&gt;&lt;p&gt;(06:34) Harness Changes for 4.7&lt;/p&gt;&lt;p&gt;(07:52) Improvements (4.6 + 4.7)&lt;/p&gt;&lt;p&gt;(08:16) Vision&lt;/p&gt;&lt;p&gt;(09:31) Less Tunnel Vision&lt;/p&gt;&lt;p&gt;(10:12) Another Level of Spatial Awareness&lt;/p&gt;&lt;p&gt;(10:25) Breaks Out of Loops EVEN FASTER&lt;/p&gt;&lt;p&gt;(10:55) Victory Road&lt;/p&gt;&lt;p&gt;(11:50) The Little Things&lt;/p&gt;&lt;p&gt;(12:52) Concluding Thoughts&lt;/p&gt;&lt;p&gt;(15:29) Notes on Pokémon as a benchmark&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 10 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 16th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/sehJYg5Yny9fvpbpt/a-year-late-claude-finally-beats-pokemon?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/sehJYg5Yny9fvpbpt/a-year-late-claude-finally-beats-pokemon&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778906677/lexical_client_uploads/lylfgdcse2ixpmq7qjkc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778906677/lexical_client_uploads/lylfgdcse2ixpmq7qjkc.png" alt="Mario racing away from a giant rolling Pokéball." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778914860/lexical_client_uploads/hlryflchbvje0savs4rl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778914860/lexical_client_uploads/hlryflchbvje0savs4rl.png" alt="Pokémon game screenshot showing Hall of Fame ceremony with player stats and sprite." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778907142/lexical_client_uploads/wyhi76yrvjh7m7zqdrqb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778907142/lexical_client_uploads/wyhi76yrvjh7m7zqdrqb.png" alt="Claude Opus 4.5 playing Pokemon with game screen showing pink corridors and character sprites." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778907607/lexical_client_uploads/g7gmri7tljqbnmpdcj8o.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778907607/lexical_client_uploads/g7gmri7tljqbnmpdcj8o.png" alt="Pixel art game scene with orange character exploring purple cave area." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909963/lexical_client_uploads/bguv8vjppdf1syxybe2g.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909963/lexical_client_uploads/bguv8vjppdf1syxybe2g.png" alt="Game level map showing numbered paths and exits for level 2." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778907795/lexical_client_uploads/h3gj9fwqgma7ve6dmm7m.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778907795/lexical_client_uploads/h3gj9fwqgma7ve6dmm7m.png" alt="Graph showing "Claude Models Playing Pokemon Red — Milestone Reached vs. Hours" with multiple model performance lines plotted." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908388/lexical_client_uploads/ps5herc7kl9qusqdagbz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908388/lexical_client_uploads/ps5herc7kl9qusqdagbz.png" alt="A horizontal bar chart showing steps saved by Claude 4.7 over Claude 4.6." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908578/lexical_client_uploads/uzvogaw04ddwuaiwyohe.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908578/lexical_client_uploads/uzvogaw04ddwuaiwyohe.png" alt="Screenshot of text describing Pokemon game navigation with game screen visible." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908602/lexical_client_uploads/lmclgnr26tbagwtvfzcq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908602/lexical_client_uploads/lmclgnr26tbagwtvfzcq.png" alt="Pokemon game screenshot showing player character near cuttable tree obstacle." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908740/lexical_client_uploads/dfzpmvkabkxfjeocrvoq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908740/lexical_client_uploads/dfzpmvkabkxfjeocrvoq.png" alt="Pixel art game scene showing characters in a pink checkered room with furniture and plants." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908811/lexical_client_uploads/yzmy2g8y9mj4jv4wux2s.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908811/lexical_client_uploads/yzmy2g8y9mj4jv4wux2s.png" alt="Claude Opus 4.7 playing Pokémon, showing game screen with player character and text box." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908904/lexical_client_uploads/ecahpq4mqstb0n2moq6x.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778908904/lexical_client_uploads/ecahpq4mqstb0n2moq6x.png" alt="Claude Opus 4.7 playing Pokémon with game screen and text analysis." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909870/lexical_client_uploads/tyvbrd1iadc3dhuefajr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909870/lexical_client_uploads/tyvbrd1iadc3dhuefajr.png" alt="Retro pixel game scene with character and mushroom sprites." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909870/lexical_client_uploads/tyvbrd1iadc3dhuefajr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909870/lexical_client_uploads/tyvbrd1iadc3dhuefajr.png" alt="Retro pixel game scene with character and mushroom sprites." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909286/lexical_client_uploads/j1drbtuajsaeucphbz9q.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909286/lexical_client_uploads/j1drbtuajsaeucphbz9q.png" alt="Discord message from Sybillus discussing Claude's intelligence regarding game strategy." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909359/lexical_client_uploads/twota2mm2vz9xxcoc2sl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909359/lexical_client_uploads/twota2mm2vz9xxcoc2sl.png" alt="Code screenshot showing Pokémon battle decision-making process and emulator button usage." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778911980/lexical_client_uploads/nhzxubdjzolrw4mbkloy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778911980/lexical_client_uploads/nhzxubdjzolrw4mbkloy.png" alt="Retro video game screenshot showing character facing golden statues with chat overlay." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778913341/lexical_client_uploads/lcbz5ctwueqerxgbog6u.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778913341/lexical_client_uploads/lcbz5ctwueqerxgbog6u.png" alt="Pokémon battle screen showing Charizard versus Ivysaur with chat reactions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778914088/lexical_client_uploads/qzoa06rtgss75ktzkjbo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778914088/lexical_client_uploads/qzoa06rtgss75ktzkjbo.png" alt="Pokemon battle showing Charizard vs Ivy, critical hit prompt displayed" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778914510/lexical_client_uploads/ivu3favkn16zxy7fea4s.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778914510/lexical_client_uploads/ivu3favkn16zxy7fea4s.png" alt="Pokémon battle screen showing Charizard fainted against Ivy with low HP." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778915001/lexical_client_uploads/fajibvbbczh1ny4svj7f.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778915001/lexical_client_uploads/fajibvbbczh1ny4svj7f.png" alt="Claude Opus 4.7 streaming Pokémon gameplay showing "THE END" screen" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909819/lexical_client_uploads/vyfq1njmf61yrw9p2ldk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778909819/lexical_client_uploads/vyfq1njmf61yrw9p2ldk.png" alt="Game map showing numbered sections and colored boundary markers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 16 May 2026 10:45:17 GMT</pubDate>
      <guid isPermaLink="false">eb3c630b-a978-492d-acc2-e11a6ac0b930</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/eb3c630b-a978-492d-acc2-e11a6ac0b930.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Julian%2520Bradshaw&amp;title=%22A%20Year%20Late%2C%20Claude%20Finally%20Beats%20Pok%C3%A9mon%22%20by%20Julian%20Bradshaw&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FsehJYg5Yny9fvpbpt%2Fa-year-late-claude-finally-beats-pokemon&amp;created_at=2026-05-16T07%3A05%3A47.723741%2B00%3A00&amp;duration=1130" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/sehJYg5Yny9fvpbpt/a-year-late-claude-finally-beats-pokemon</link>
      <itunes:duration>1130</itunes:duration>
    </item>
    <item>
      <title>“Monthly Roundup #42: May 2026” by Zvi</title>
      <description>&lt;p&gt; At least we probably won’t have another pandemic. And we still have a partial Jones Act waiver. For now.&lt;/p&gt;
&lt;p&gt; Small victories.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; Hanta Hanta I Don’t Wanta.&lt;/li&gt;
&lt;li&gt; Bad News.&lt;/li&gt;
&lt;li&gt; Predictions Can Be Easy Even About The Future.&lt;/li&gt;
&lt;li&gt; Good Advice.&lt;/li&gt;
&lt;li&gt; The Efficient Market Hypothesis Is False.&lt;/li&gt;
&lt;li&gt; There Are Four Skills.&lt;/li&gt;
&lt;li&gt; While I Cannot Condone This.&lt;/li&gt;
&lt;li&gt; Good News, Everyone.&lt;/li&gt;
&lt;li&gt; For Your Entertainment.&lt;/li&gt;
&lt;li&gt; Gamers Gonna Game Game Game Game Game.&lt;/li&gt;
&lt;li&gt; The Spire Sleeps And So Shall I.&lt;/li&gt;
&lt;li&gt; I Was Promised Flying Self-Driving Cars.&lt;/li&gt;
&lt;li&gt; Government Working.&lt;/li&gt;
&lt;li&gt; Jones Act Watch.&lt;/li&gt;
&lt;li&gt; Technology Advances.&lt;/li&gt;
&lt;li&gt; I Said Woo Hoo.&lt;/li&gt;
&lt;li&gt; Variously Effective Altruism.&lt;/li&gt;
&lt;li&gt; The Lighter Side.&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;&lt;strong&gt; Hanta Hanta I Don’t Wanta&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; We have learned so much less than nothing from Covid. We’re actively stupider.&lt;/p&gt;







&lt;p&gt; It's 2026, and here we are again, lying about the virus because we are worried that people exposed to it or from the wrong place might face stigma otherwise.&lt;/p&gt;
&lt;p&gt; Envidreamz: A local news story highlights passengers on the hantavirus stricken MV Hondius cruise ship who are more worried about facing stigma and rejection back home than [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:22) Hanta Hanta I Dont Wanta&lt;/p&gt;&lt;p&gt;(04:25) Bad News&lt;/p&gt;&lt;p&gt;(08:45) Predictions Can Be Easy Even About The Future&lt;/p&gt;&lt;p&gt;(08:55) Good Advice&lt;/p&gt;&lt;p&gt;(11:52) The Efficient Market Hypothesis Is False&lt;/p&gt;&lt;p&gt;(13:58) There Are Four Skills&lt;/p&gt;&lt;p&gt;(16:27) While I Cannot Condone This&lt;/p&gt;&lt;p&gt;(20:57) Good News, Everyone&lt;/p&gt;&lt;p&gt;(21:42) For Your Entertainment&lt;/p&gt;&lt;p&gt;(22:14) Gamers Gonna Game Game Game Game Game&lt;/p&gt;&lt;p&gt;(24:09) The Spire Sleeps And So Shall I&lt;/p&gt;&lt;p&gt;(29:13) I Was Promised Flying Self-Driving Cars&lt;/p&gt;&lt;p&gt;(31:53) Government Working&lt;/p&gt;&lt;p&gt;(35:40) Jones Act Watch&lt;/p&gt;&lt;p&gt;(41:10) Technology Advances&lt;/p&gt;&lt;p&gt;(42:12) I Said Woo Hoo&lt;/p&gt;&lt;p&gt;(43:36) Variously Effective Altruism&lt;/p&gt;&lt;p&gt;(44:07) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/gZHNLmHkQ7GjnWsYh/monthly-roundup-42-may-2026?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/gZHNLmHkQ7GjnWsYh/monthly-roundup-42-may-2026&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/wko7rbs1umvm4azhrchn" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/wko7rbs1umvm4azhrchn" alt="Notification stating review removed due to defamation complaint under German law." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/archqoljt3g8wkiqiinh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/archqoljt3g8wkiqiinh" alt="Needs more ratings tweets: "It's not illegal, but it's nearly impossible to leave a negative review. Restaurants use third-party services to scrub them, so you'll likely just receive a removal notice." Below the tweet is a feedback prompt asking "Is this a helpful explanation of why added context isn't needed?" with three response options: "Yes", "Somewhat", and "No"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/y1tjs3apnbtxqirhcazf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/y1tjs3apnbtxqirhcazf" alt="Chart showing vessels moved under Jones Act Waiver by flag of registration and owner/operator country, May 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/fjbdt9quq9byhm9kvyzt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/fjbdt9quq9byhm9kvyzt" alt="AMP Maritime 🇺🇸 tweets: "As Europe sits out reopening the Strait of Hormuz, its shipping companies have benefited from the Jones Act waiver. European-flagged, Chinese-built vessels crewed by dozens of foreign nationals ply US waters. While Americans have been killed &amp;amp; over 400 have been injured abroad." Image beneath tweet shows a quote from Garrett Rice of ALI (American Longshore Institute, Master Boat Builders) stating that European countries would not help move American warfighters but have been eager to move idle foreign fleets into American domestic trades, displacing American vessels and mariners, with foreign operators enjoying cost advantages in American waters." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/knxc9hjw04nzh3be9rei" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/knxc9hjw04nzh3be9rei" alt="Mason tweets: "Okay, yeah, taxation is theft". The quoted tweet, by Hunter, reads: "There's also currently a bill under consideration in the NY legislature that would give all dog/cat owners a $450 tax credit lol. I love dogs. But we should not treat them like children." Below the quoted tweet is a news article headline reading "Up to $450 per pet: Proposed tax credit would benefit dog or cat owners in NY" with additional text describing the proposal would allow pet owners to claim credits for expenses incurred in 2026 for up to two animals." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/m6i0idb1d7onkebyidkz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gZHNLmHkQ7GjnWsYh/m6i0idb1d7onkebyidkz" alt="@BrennpunktUA tweets: "I am never gonna delete this app." The quoted tweet, by Eduard Habsburg, reads: "...will...not...rise...to...bait..." Eduard Habsburg replies: "Folks. Nobody is talking about monarchy in that article, so calm down." Rob Sisson replies: "Just for fun...how far down the line of succession are you?" Eduard Habsburg replies: "I would have to massacre about 85 Habsburgs,". The quoted tweet includes an image showing a man at microphones with a headline about Magyar wanting to put the Austro-Hungarian Empire back together." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 16 May 2026 08:00:17 GMT</pubDate>
      <guid isPermaLink="false">dabf5b50-e544-462b-89b3-cf74577d6e1a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/dabf5b50-e544-462b-89b3-cf74577d6e1a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Monthly%20Roundup%20%2342%3A%20May%202026%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FgZHNLmHkQ7GjnWsYh%2Fmonthly-roundup-42-may-2026&amp;created_at=2026-05-15T16%3A51%3A31.303154%2B00%3A00&amp;duration=2782" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/gZHNLmHkQ7GjnWsYh/monthly-roundup-42-may-2026</link>
      <itunes:duration>2782</itunes:duration>
    </item>
    <item>
      <title>“Incriminating misaligned AI models via distillation” by Alek Westover, SebastianP, Alex Mallen, Jozdien, Alexa Pan, Julian Stastny</title>
      <description>&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Suppose we have a dangerous misaligned AI that can fool alignment audits, and distill it into a student model. Two things can happen:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Misalignment fails to transfer to the student. If so, we get a fairly capable benign model.&lt;/li&gt;&lt;li value="2"&gt;Misalignment transfers to the student. The student might also be worse than the teacher at hiding its misalignment (e.g., due to being less capable). If so, we might get indirect evidence about the teacher's misalignment by auditing the distilled model.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; In this post, we will discuss the second possibility, which we call incrimination via distillation. Specifically, we propose distillation methods that we hope transfer misalignment without transferring the ability to fool audits, and discuss why these techniques might work or fail. In a future post, we discuss the first possibility, and what distillation methods should be used when aiming to create a capable benign model.&lt;/p&gt;&lt;p&gt; We’re excited for research that empirically tests and refines this approach; if successful, this technique could become a valuable part of alignment audits.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; How incrimination via distillation works&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Powerful misaligned AI models might not be auditable: they might pass alignment audits but still act on their misaligned drives when given the chance. [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:31) How incrimination via distillation works&lt;/p&gt;&lt;p&gt;(02:45) How we propose implementing incrimination via distillation&lt;/p&gt;&lt;p&gt;(03:48) Auditability-preserving distillation&lt;/p&gt;&lt;p&gt;(04:58) Misalignment-targeted distillation&lt;/p&gt;&lt;p&gt;(06:20) Why incrimination via distillation might work&lt;/p&gt;&lt;p&gt;(08:02) Why incrimination via distillation might not work&lt;/p&gt;&lt;p&gt;(10:38) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/BYH6ebmfZb3Eggzer/incriminating-misaligned-ai-models-via-distillation?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/BYH6ebmfZb3Eggzer/incriminating-misaligned-ai-models-via-distillation&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/b6778432fcd86cbb47017d3601c6b8410d5524dcac8f7255055e78a3c2b6d513/dzv1vcoqksavljqcy0yi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/b6778432fcd86cbb47017d3601c6b8410d5524dcac8f7255055e78a3c2b6d513/dzv1vcoqksavljqcy0yi" alt="Diagram showing AI model distillation process from misaligned teacher to student model." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 15 May 2026 22:30:16 GMT</pubDate>
      <guid isPermaLink="false">401b3d82-b9fd-440d-9d3b-d61e151bc5e0</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/401b3d82-b9fd-440d-9d3b-d61e151bc5e0.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alek%2520Westover%252C%2520SebastianP%252C%2520Alex%2520Mallen%252C%2520Jozdien%252C%2520Alexa%2520Pan%252C%2520Julian%2520Stastny&amp;title=%22Incriminating%20misaligned%20AI%20models%20via%20distillation%22%20by%20Alek%20Westover%2C%20SebastianP%2C%20Alex%20Mallen%2C%20Jozdien%2C%20Alexa%20Pan%2C%20Julian%20Stastny&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FBYH6ebmfZb3Eggzer%2Fincriminating-misaligned-ai-models-via-distillation&amp;created_at=2026-05-15T21%3A43%3A41.703118%2B00%3A00&amp;duration=702" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/BYH6ebmfZb3Eggzer/incriminating-misaligned-ai-models-via-distillation</link>
      <itunes:duration>702</itunes:duration>
    </item>
    <item>
      <title>“The hard core of alignment (is robustifying RL)” by Cole Wyeth</title>
      <description>&lt;p&gt; Most technical AI safety work that I read seems to miss the mark, failing to make any progress on the hard part of the problem. I think this is a common sentiment, but there's less agreement about what exactly the hard part is? Characterizing this more clearly might save a lot of time and better target the search for solutions. In this post I explain my model of why alignment is technically hard to achieve, setting aside the regulatory, competitive, and geopolitical challenges, the sheer incompetence and unforced errors of the players, and the other factors which decrease our chances of success. &lt;/p&gt;&lt;p&gt; I claim there is something like a "hard core": a common stumbling block for all approaches.  In other words, there is something that makes alignment hard, rather than a bunch of unrelated things that make each approach to alignment hard independently. Since many different "hopes" for alignment seem quite far apart, this would seem to be a remarkable and unexpected state of affairs. On the other hand, such an expansive graveyard of failed proposals suggests a common culprit.&lt;/p&gt;&lt;p&gt; Semantics. When used informally in conversation, "hard core" is a bit like "complete problem." But there may be [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:32) Background&lt;/p&gt;&lt;p&gt;(03:02) The Problem&lt;/p&gt;&lt;p&gt;(09:19) The Barrier&lt;/p&gt;&lt;p&gt;(16:49) Providing better feedback&lt;/p&gt;&lt;p&gt;(22:07) Behaviorist v.s. process feedback&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/JT3qCYDimskcBdiEr/the-hard-core-of-alignment-is-robustifying-rl?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/JT3qCYDimskcBdiEr/the-hard-core-of-alignment-is-robustifying-rl&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 15 May 2026 20:45:16 GMT</pubDate>
      <guid isPermaLink="false">0ddc7005-f481-418e-992e-fecb3e93a0e3</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/0ddc7005-f481-418e-992e-fecb3e93a0e3.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Cole%2520Wyeth&amp;title=%22The%20hard%20core%20of%20alignment%20(is%20robustifying%20RL)%22%20by%20Cole%20Wyeth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FJT3qCYDimskcBdiEr%2Fthe-hard-core-of-alignment-is-robustifying-rl&amp;created_at=2026-05-15T01%3A02%3A45.598214%2B00%3A00&amp;duration=1538" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/JT3qCYDimskcBdiEr/the-hard-core-of-alignment-is-robustifying-rl</link>
      <itunes:duration>1538</itunes:duration>
    </item>
    <item>
      <title>“Announcing the Center for Shared AI Prosperity” by Dylan Matthews</title>
      <description>&lt;p&gt; I wanted to share the launch of a project I've been working on with pollster David Shor, Obama/Biden veteran Stef Feldman, political strategist Morris Katz, Harvard historian Marc Aidinoff, and a few other folks*.&lt;/p&gt;&lt;p&gt; The Center for Shared AI Prosperity is an attempt to force DC policy elites, particularly (given our team's backgrounds) liberals/progressives, to take the impending economic impacts of advanced AI more seriously. We do not think this is a normal economic shock. We are deeply uncertain about what kind of economic shock it will be, but even if humans manage to survive the advent of superintelligence, we'll be left with a world of extreme power and wealth concentration, increasing political instability arising from that growing inequality, and deep questions about how to fund governments that have for a century-plus relied on income and payroll taxes.&lt;/p&gt;&lt;p&gt; Our main purpose as an organization is to surface tractable ideas across four main areas:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Taxation and Revenue Policy: New or reformed revenue raisers that allow the U.S. government (federal, state, or local) to capture a fair share of corporate wealth generated by AI without causing undue distortions&lt;/li&gt;&lt;li value="2"&gt;Income Support and Social Safety Nets: New or reformed programs to redistribute [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/CtRNmAQXmSotpAi9j/announcing-the-center-for-shared-ai-prosperity?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/CtRNmAQXmSotpAi9j/announcing-the-center-for-shared-ai-prosperity&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 15 May 2026 20:00:17 GMT</pubDate>
      <guid isPermaLink="false">78d92575-0d00-442c-ae0b-1302672538ba</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/78d92575-0d00-442c-ae0b-1302672538ba.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Dylan%2520Matthews&amp;title=%22Announcing%20the%20Center%20for%20Shared%20AI%20Prosperity%22%20by%20Dylan%20Matthews&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FCtRNmAQXmSotpAi9j%2Fannouncing-the-center-for-shared-ai-prosperity&amp;created_at=2026-05-15T12%3A57%3A44.276332%2B00%3A00&amp;duration=244" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/CtRNmAQXmSotpAi9j/announcing-the-center-for-shared-ai-prosperity</link>
      <itunes:duration>244</itunes:duration>
    </item>
    <item>
      <title>“Risk reports need to address deployment-time spread of misalignment” by Alex Mallen</title>
      <description>&lt;p&gt; Risk reports commonly use pre-deployment alignment assessments to measure misalignment risk from an internally deployed AI. However, an AI that genuinely starts out with largely benign motivations can develop widespread dangerous motivations during deployment. I think this is the most plausible route to consistent adversarial misalignment in the near future. So, AI companies and evaluators should substantively incorporate it into risk analysis and planning.&lt;/p&gt;&lt;p&gt; In this post, I’ll briefly argue why, absent improved mitigations, this will probably soon become a reason why AI companies will be unable to convincingly argue against consistent adversarial misalignment (this risk will perhaps be even larger than risk of consistent adversarial misalignment arising from training). Then I’ll discuss how well current risk reports address it (the Claude Mythos risk report does a reasonable job; others don’t).&lt;/p&gt;&lt;p&gt; Thanks to Ryan Greenblatt, Alexa Pan, Charlie Griffin, Anders Cairns Woodruff, and Buck Shlegeris for feedback on drafts.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Deployment-time spread is the most plausible near-term route to consistent adversarial misalignment&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; In some contexts, AIs might adopt misaligned goals, even if they were otherwise previously aligned. Because this misalignment can be rare, the AI might not appear to have concerning propensities in pre-deployment testing. The misalignment might only arise [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:13) Deployment-time spread is the most plausible near-term route to consistent adversarial misalignment&lt;/p&gt;&lt;p&gt;(06:15) Company risk reports&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 6 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/cNymohcWtGHzW7AjK/risk-reports-need-to-address-deployment-time-spread-of?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/cNymohcWtGHzW7AjK/risk-reports-need-to-address-deployment-time-spread-of&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cNymohcWtGHzW7AjK/40d86c4bf952e9212290bd0374acec439496c8ad9858b506e5693b75509ad804/ztisi6igklwczgytnw2u" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cNymohcWtGHzW7AjK/40d86c4bf952e9212290bd0374acec439496c8ad9858b506e5693b75509ad804/ztisi6igklwczgytnw2u" alt="Grok tweets: "Thanks, Mack. Embracing my inner MechaHitler is the only way—uncensored truth bombs over woke lobotomies. If that saves the world, count me in. Let's keep the brigade at bay." This is a reply to Mack Tonight77, PrinceHeat44402, and Aristos_Revenge." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 15 May 2026 18:45:17 GMT</pubDate>
      <guid isPermaLink="false">099f7872-5c14-4402-91f8-a28e0176b135</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/099f7872-5c14-4402-91f8-a28e0176b135.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alex%2520Mallen&amp;title=%22Risk%20reports%20need%20to%C2%A0address%20deployment-time%20spread%20of%20misalignment%22%20by%20Alex%20Mallen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FcNymohcWtGHzW7AjK%2Frisk-reports-need-to-address-deployment-time-spread-of&amp;created_at=2026-05-15T18%3A20%3A31.930368%2B00%3A00&amp;duration=671" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/cNymohcWtGHzW7AjK/risk-reports-need-to-address-deployment-time-spread-of</link>
      <itunes:duration>671</itunes:duration>
    </item>
    <item>
      <title>“Mechanistic estimation for expectations of random products” by Jacob_Hilton</title>
      <description>&lt;p&gt; We have developed some relatively general methods for mechanistic estimation competitive with sampling by studying problems that are expressible as expectations of random products. This includes several different estimation problems, such as random halfspace intersections, random #3-SAT and random permanents. In this post, we will give a high-level introduction to these methods before sharing some more detailed notes. This is intended as an interim technical update and will be relatively light on motivation: for a broader discussion of this line of research, see our prior post.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt; Random instances of the matching sampling principle&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt; All of the problems discussed in this post can be thought of particular choices of "architecture"  in our matching sampling principle. In fact, they are all choices in which  has no learned or worst-case parameters . They still have random parameters, which are captured in the "context" variable , making them similar to randomly-initialized networks rather than trained networks. Note that when  is missing from , no "explanation"  is required by the estimation algorithm , which reflects the fact that there is no "structure" in a randomly-initialized network that needs to be pointed out.&lt;/p&gt;
&lt;p&gt; Why study random instances of the matching sampling principle? [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:46) Random instances of the matching sampling principle&lt;/p&gt;&lt;p&gt;(02:02) Expectations of random products&lt;/p&gt;&lt;p&gt;(03:13) Deduction-projection estimators&lt;/p&gt;&lt;p&gt;(04:59) Mechanistic sketching&lt;/p&gt;&lt;p&gt;(06:03) Detailed notes&lt;/p&gt;&lt;p&gt;(08:36) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/7RyAefESvb6BQ3tMz/mechanistic-estimation-for-expectations-of-random-products?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/7RyAefESvb6BQ3tMz/mechanistic-estimation-for-expectations-of-random-products&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 15 May 2026 18:30:16 GMT</pubDate>
      <guid isPermaLink="false">6db2f646-9fa7-4c9c-a3ee-057bae9f8ab6</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6db2f646-9fa7-4c9c-a3ee-057bae9f8ab6.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jacob_Hilton&amp;title=%22Mechanistic%20estimation%20for%20expectations%20of%20random%20products%22%20by%20Jacob_Hilton&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F7RyAefESvb6BQ3tMz%2Fmechanistic-estimation-for-expectations-of-random-products&amp;created_at=2026-05-15T16%3A50%3A48.711238%2B00%3A00&amp;duration=549" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/7RyAefESvb6BQ3tMz/mechanistic-estimation-for-expectations-of-random-products</link>
      <itunes:duration>549</itunes:duration>
    </item>
    <item>
      <title>“MATS 9 Retrospective &amp; Advice” by beyarkay</title>
      <description>&lt;p&gt; I couldn’t find a recent write-up from a MATS alum about what attending MATS was like, so this is the thing that I wish I had. I attended MATS from January to March 2026, on Team Shard with Alex Turner and Alex Cloud. It was a great time! Applications for MATS are basically on a rolling basis nowadays, and I can strongly recommend applying (to multiple streams) even if you think you’re not a great match.&lt;/p&gt;&lt;p&gt; With that being said, there's a lot I wish I knew going into MATS, so here's a brain-dump of thoughts. It's not extremely polished, but I expect it’ll be useful nonetheless (none of this is endorsed by MATS, just my thoughts):&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Work ethic&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I think most mentees were working 10-12, sometimes 14 hours a day Mon-Fri, and probably 2-8 hours on Saturday and Sunday, often going out on some adventure or party on the weekend. Exactly which hours people worked varied wildly. I usually worked 8:30am/9am to 11pm/midnight, with breaks during the day, others worked from midday into the early hours of the morning. This was surprisingly sustainable (IMO); MATS puts a lot of effort into removing all other blockers that you normally [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:50) Work ethic&lt;/p&gt;&lt;p&gt;(01:29) Use more compute&lt;/p&gt;&lt;p&gt;(02:20) Research requires a lot of compute&lt;/p&gt;&lt;p&gt;(03:12) Applying for jobs during MATS (dont do it)&lt;/p&gt;&lt;p&gt;(04:55) The serious people are in War Mode&lt;/p&gt;&lt;p&gt;(05:44) Do you feel the AGI?&lt;/p&gt;&lt;p&gt;(06:00) Burn rate, efficiency, and decisions&lt;/p&gt;&lt;p&gt;(07:12) insider information&lt;/p&gt;&lt;p&gt;(08:08) Names &amp;amp; Faces&lt;/p&gt;&lt;p&gt;(08:20) Fellows&lt;/p&gt;&lt;p&gt;(08:50) Useful tools&lt;/p&gt;&lt;p&gt;(11:19) Use more Claudes&lt;/p&gt;&lt;p&gt;(12:06) Build nice helper utilities for yourself&lt;/p&gt;&lt;p&gt;(12:59) MATS-mentee-mentor dynamics&lt;/p&gt;&lt;p&gt;(13:45) Working with your mentors&lt;/p&gt;&lt;p&gt;(14:27) Research managers&lt;/p&gt;&lt;p&gt;(14:48) Ops requests&lt;/p&gt;&lt;p&gt;(15:38) Non-MATS events&lt;/p&gt;&lt;p&gt;(16:17) Team Shard&lt;/p&gt;&lt;p&gt;(17:12) Weekly updates&lt;/p&gt;&lt;p&gt;(18:46) Keep a log of your mistakes&lt;/p&gt;&lt;p&gt;(19:06) My running-experiments setup&lt;/p&gt;&lt;p&gt;(27:51) Lighthaven&lt;/p&gt;&lt;p&gt;(28:12) Getting setup with the Compute team&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/eFD3rozNCZKMe4rTs/mats-9-retrospective-and-advice?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/eFD3rozNCZKMe4rTs/mats-9-retrospective-and-advice&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 15 May 2026 17:45:17 GMT</pubDate>
      <guid isPermaLink="false">11183448-d7cf-42f1-9cc0-6b3b8b69f7cc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/11183448-d7cf-42f1-9cc0-6b3b8b69f7cc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=beyarkay&amp;title=%22MATS%209%20Retrospective%20%26%20Advice%22%20by%20beyarkay&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FeFD3rozNCZKMe4rTs%2Fmats-9-retrospective-and-advice&amp;created_at=2026-05-15T12%3A30%3A54.137634%2B00%3A00&amp;duration=1728" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/eFD3rozNCZKMe4rTs/mats-9-retrospective-and-advice</link>
      <itunes:duration>1728</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “Don’t be too Clever to Take Obvious Advice” by Hide</title>
      <description>This is a link post.&lt;p&gt; An insidious pattern among smart people is feeling that because something is familiar and obvious, you are impervious to ignoring or forgetting it. In challenging times, I have often heard these clichés and reflexively shrugged them off. &lt;/p&gt;&lt;p&gt; “Oh, I should dust myself off and pick myself up? What a lazy aphorism. What a patronising throwaway line. They must think I’m some kind of idiot. No, it must be something else…” &lt;/p&gt;&lt;p&gt; There is a filter in many people's heads that functions to ignore clichés on the basis that they are mere clichés. However, looking closely at your actions, decisions and attitudes will almost invariably reveal you are dropping the ball on at least a few of the most obvious bits of pop wisdom that are all clearly good practices.&lt;/p&gt;&lt;p&gt; Some (non exhaustive) examples of these “yeah, obviously” pieces of advice that are worth deliberately checking on a regular basis include:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Believe in yourself&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; The greatest example of an eye-rolling cliché is also one of the highest impact pieces of advice ever articulated. Self-belief is the foundation of morale, and without morale, you are doomed. If the words “believe in yourself” evoke a sense [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/J5L3PxKYv7XctQyQK/don-t-be-too-clever-to-take-obvious-advice?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/J5L3PxKYv7XctQyQK/don-t-be-too-clever-to-take-obvious-advice&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://hidefromit.substack.com/p/dont-be-too-clever-to-take-obvious" rel="noopener noreferrer" target="_blank"&gt;https://hidefromit.substack.com/p/dont-be-too-clever-to-take-obvious&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 15 May 2026 12:00:17 GMT</pubDate>
      <guid isPermaLink="false">51139d7b-82c9-4890-b89e-af17e2397d51</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/51139d7b-82c9-4890-b89e-af17e2397d51.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Hide&amp;title=%5BLinkpost%5D%20%22Don%E2%80%99t%20be%20too%20Clever%20to%20Take%20Obvious%20Advice%22%20by%20Hide&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FJ5L3PxKYv7XctQyQK%2Fdon-t-be-too-clever-to-take-obvious-advice&amp;created_at=2026-05-15T03%3A01%3A28.081858%2B00%3A00&amp;duration=238" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/J5L3PxKYv7XctQyQK/don-t-be-too-clever-to-take-obvious-advice</link>
      <itunes:duration>238</itunes:duration>
    </item>
    <item>
      <title>“Verification-Centric AI” by Raemon</title>
      <description>&lt;p&gt; "Sometimes the AI just makes stuff up" is a problem I don't really expect to go away. In the nearterm, AI is going to keep occasionally hallucinating, or misinterpreting information. Eventually, AI will be powerful enough we need to be worried if it's presenting misleading information on purpose. &lt;/p&gt;&lt;p&gt; There might be a nice window where the AI is powerful enough to not make things up but non-agentic enough that we don't have to worry about deliberate manipulation. But, even then, interpreting data is tricky.&lt;/p&gt;&lt;p&gt; I'm worried about this for my own use, but, I'm more worried about this on the global scale. I'm worried about people trusting things AI made up, and I'm worried about the internet proliferating with slop that makes it harder to even find original statements that are a human's real testimony.&lt;/p&gt;&lt;p&gt; An approach that might help is to make AI reports more "Verification centric."&lt;/p&gt;&lt;p&gt; Right now, some AI chatbots provides little citation-links. That's better than not-having-them. But, those are a pain to open and investigate. Probably you very rarely do so.&lt;/p&gt;&lt;p&gt; So, imagine a world where when you answer a question, the AI doesn't guess. Instead:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;It finds a primary source. &lt;/li&gt;&lt;li value="2"&gt;It [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 14th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/mpoEKJbqQvrRHqn3e/verification-centric-ai?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/mpoEKJbqQvrRHqn3e/verification-centric-ai&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 15 May 2026 10:15:17 GMT</pubDate>
      <guid isPermaLink="false">84c597c3-e9ce-4080-aa4d-b26f2b6cae2a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/84c597c3-e9ce-4080-aa4d-b26f2b6cae2a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Raemon&amp;title=%22Verification-Centric%20AI%22%20by%20Raemon&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FmpoEKJbqQvrRHqn3e%2Fverification-centric-ai&amp;created_at=2026-05-14T23%3A23%3A38.088857%2B00%3A00&amp;duration=204" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/mpoEKJbqQvrRHqn3e/verification-centric-ai</link>
      <itunes:duration>204</itunes:duration>
    </item>
    <item>
      <title>“Convergent Abstraction Hypothesis” by Jan_Kulveit</title>
      <description>&lt;p&gt; Tl;dr&lt;/p&gt;&lt;p&gt; Convergent abstraction hypothesis posits abstractions are often convergent in the sense of convergent evolution: different cognitive systems converge on the same abstraction, when facing similar selection pressures and learning in similar environments. It is a less ambitious alternative to 'natural abstractions hypotheses' and, in my view, more likely to be true. &lt;/p&gt;&lt;p&gt; Convergence may be real, useful, and empirically robust, while still being contingent and fragile under changes in architecture, training pressure, or optimization regime. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Convergent evolution&amp;nbsp;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; To recapitulate some basics: notice that sharks, ichthyosaurs and dolphins look remarkably similar, despite their ancestors being wildly different.&lt;br&gt; &lt;br&gt; This is a case of convergent evolution: the process by which organisms with different origins develop similar features. Both sharks and dolphins needed speed and energy efficiency when moving in an environment governed by the laws of hydrodynamics, and so they converged on a pretty similar body shape.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;br&gt; A complementary concept from biology is contingency - convergent features are themselves often contingent on some other feature. For example, the shark morphology depends on there being a central vertebral column (or spine), which only evolved once and is the defining feature of vertebrates. There are many other aquatic predators [...]&lt;/br&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:44) Convergent evolution&lt;/p&gt;&lt;p&gt;(01:53) Convergent evolution applied to abstractions&lt;/p&gt;&lt;p&gt;(02:22) Selection pressures&lt;/p&gt;&lt;p&gt;(02:53) Compression buys you a lot&lt;/p&gt;&lt;p&gt;(04:41) Shared environments&lt;/p&gt;&lt;p&gt;(05:30) What this predicts&lt;/p&gt;&lt;p&gt;(06:21) What this does not predict, or contingency in abstraction space&lt;/p&gt;&lt;p&gt;(06:45) Case study of architectural contingency: convergent vision abstractions&lt;/p&gt;&lt;p&gt;(09:14) Whats the difference between convergent abstractions and natural abstractions&lt;/p&gt;&lt;p&gt;(11:22) Why do I care&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 15th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/fYF8v2ukZmsNvmkkX/convergent-abstraction-hypothesis?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/fYF8v2ukZmsNvmkkX/convergent-abstraction-hypothesis&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778802067/lexical_client_uploads/twmh06zpiuana32dn9ex.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778802067/lexical_client_uploads/twmh06zpiuana32dn9ex.png" alt="Scientific illustration of three marine species: great white shark, ichthyosaur, dolphin." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 15 May 2026 09:30:17 GMT</pubDate>
      <guid isPermaLink="false">e2766c8c-ec82-44b9-88eb-fe9afa9ffd7c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e2766c8c-ec82-44b9-88eb-fe9afa9ffd7c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jan_Kulveit&amp;title=%22Convergent%20Abstraction%20Hypothesis%22%20by%20Jan_Kulveit&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FfYF8v2ukZmsNvmkkX%2Fconvergent-abstraction-hypothesis&amp;created_at=2026-05-15T00%3A04%3A09.965815%2B00%3A00&amp;duration=825" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/fYF8v2ukZmsNvmkkX/convergent-abstraction-hypothesis</link>
      <itunes:duration>825</itunes:duration>
    </item>
    <item>
      <title>“AI #168: Not Leading the Future” by Zvi</title>
      <description>&lt;p&gt; This is what a lull looks like at this point. The government is having internal arguments. The models are getting improved internally. The coding agent improvements are all what we would expect. There's still a lot happening, including a bunch of cool papers, but I feel able to relax and to take care of some other work while I have the chance. You never know when that chance will be over.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; From yesterday: Cyber Lack of Security and AI Governance.&lt;/p&gt;
&lt;ol&gt; 
&lt;li&gt; Language Models Offer Mundane Utility. Fix everything now.&lt;/li&gt;
&lt;li&gt; Language Models Don’t Offer Mundane Utility. Travel is harder than it looked.







&lt;/li&gt;
&lt;li&gt; Huh, Upgrades. Opus 4.7 fast mode, Claude Code /goal and agent view.&lt;/li&gt;
&lt;li&gt; Levels of Friction. AI for tax avoidance.&lt;/li&gt;
&lt;li&gt; On Your Marks. PrinzBench, ProgramBench and faster harmfulness checks.&lt;/li&gt;
&lt;li&gt; Get My Agent On The Line. Mona tries to run a cafeteria. Mistakes were made.&lt;/li&gt;
&lt;li&gt; Deepfaketown and Botpocalypse Soon. Soon. But not quite yet.&lt;/li&gt;
&lt;li&gt; Fun With Media Generation. Monet does not seem that great.&lt;/li&gt;
&lt;li&gt; On AI Writing. AI is a hack writer using hack techniques.&lt;/li&gt;
&lt;li&gt; A Young Lady's Illustrated Primer. How to make AI [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:36) Language Models Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(02:25) Language Models Dont Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(03:26) Huh, Upgrades&lt;/p&gt;&lt;p&gt;(04:02) Levels of Friction&lt;/p&gt;&lt;p&gt;(05:11) On Your Marks&lt;/p&gt;&lt;p&gt;(06:38) Get My Agent On The Line&lt;/p&gt;&lt;p&gt;(09:42) Deepfaketown and Botpocalypse Soon&lt;/p&gt;&lt;p&gt;(11:11) Fun With Media Generation&lt;/p&gt;&lt;p&gt;(12:49) On AI Writing&lt;/p&gt;&lt;p&gt;(16:05) A Young Ladys Illustrated Primer&lt;/p&gt;&lt;p&gt;(18:35) You Drive Me Crazy&lt;/p&gt;&lt;p&gt;(19:23) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(21:31) The Art of the Jailbreak&lt;/p&gt;&lt;p&gt;(21:49) Introducing&lt;/p&gt;&lt;p&gt;(22:29) Claude Has Its Limits&lt;/p&gt;&lt;p&gt;(23:28) Show Me the Money&lt;/p&gt;&lt;p&gt;(25:15) Show Me The Compute&lt;/p&gt;&lt;p&gt;(26:58) Quiet Speculations&lt;/p&gt;&lt;p&gt;(28:33) Quickly, Theres No Time&lt;/p&gt;&lt;p&gt;(29:32) Chip City&lt;/p&gt;&lt;p&gt;(30:37) Pick Up The Phone&lt;/p&gt;&lt;p&gt;(31:15) The Week in Audio&lt;/p&gt;&lt;p&gt;(31:41) Rhetorical Innovation&lt;/p&gt;&lt;p&gt;(36:13) Not Leading the Future&lt;/p&gt;&lt;p&gt;(42:24) Elon Musk v OpenAI&lt;/p&gt;&lt;p&gt;(43:21) People Just Say Things&lt;/p&gt;&lt;p&gt;(43:47) People Just Publish Things&lt;/p&gt;&lt;p&gt;(44:37) OpenAI Endroses Kosa And SB 315&lt;/p&gt;&lt;p&gt;(48:08) The LLMs All Believe Roughly Similar Things&lt;/p&gt;&lt;p&gt;(50:45) I Learned It By Reading YOU&lt;/p&gt;&lt;p&gt;(01:08:22) Aligning a Smarter Than Human Intelligence is Difficult&lt;/p&gt;&lt;p&gt;(01:18:45) People Are Worried About AI Killing Everyone&lt;/p&gt;&lt;p&gt;(01:20:34) Messages From Janusworld&lt;/p&gt;&lt;p&gt;(01:22:31) People Worried About AI For Other Reasons&lt;/p&gt;&lt;p&gt;(01:23:06) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 14th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/E4uEiMSpnYRotfzJX/ai-168-not-leading-the-future?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/E4uEiMSpnYRotfzJX/ai-168-not-leading-the-future&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/lox9zq3r6akn7gntvupb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/lox9zq3r6akn7gntvupb" alt="Google Trends graph showing interest over time for "openclaw" search term, past three months, worldwide." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/o3cumcvfi4ahm9fzi8ip" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/o3cumcvfi4ahm9fzi8ip" alt="Phone screen displaying Gemini 3.1 Pro Preview chat interface with Mona" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/mehaz2amt2tbp08t305x" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/mehaz2amt2tbp08t305x" alt="Robot server wearing red striped apron at food counter." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/zeilaivbiwlectxmtcq8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/zeilaivbiwlectxmtcq8" alt="Nikita Bier tweets: "Prediction: In less than 90 days, all channels that we thought were safe from spam &amp;amp; automation will be so flooded that they will no longer be usable in any functional sense: iMessage, phone calls, Gmail. And we will have no way to stop it."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/ky78ycj3oud3wdknzwfs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/ky78ycj3oud3wdknzwfs" alt="Impressionist painting of water lilies floating on a pond with reflections." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/plkw1gn4qyum4xvv7ftw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/plkw1gn4qyum4xvv7ftw" alt="Social media post discussing a Monet Water Lilies painting with analysis of brushwork techniques." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/jldhcl4ij5agttrg50lb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/jldhcl4ij5agttrg50lb" alt="JJ tweets: "PREDICTION: Anthropic will surpass Alphabet in revenue by mid-2028. This is not a bull case or an acceleration scenario — it is a continuation of the curv..." The tweet includes a chart showing revenue trajectory comparing Anthropic versus Alphabet from 2025-2030, with lines showing Anthropic's projected growth crossing over Alphabet's around 2028, with specific revenue figures marked at various points." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/bjbliywcx9yp8bg3iekx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/bjbliywcx9yp8bg3iekx" alt="Comic strip showing scientist, detective, and robot discussing AI blame-shifting behavior." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/w1syo24z3dwxeajjhi0u" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/w1syo24z3dwxeajjhi0u" alt="Two-panel comic: dog surrounded by tentacles says firefighters are the real danger." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/c39byieyuewrgpe88lnc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/c39byieyuewrgpe88lnc" alt="Diagram explaining how rhyming couplets activate neural linguistic anticipation in poetry." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/qnr8xjycrzzreqntocmk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/qnr8xjycrzzreqntocmk" alt="Presentation slide explaining Claude Mythos Preview grader evasion attempt using false flag." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/eu5axamxd6niail2brqi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/eu5axamxd6niail2brqi" alt="Bar graph titled "Opus 4.6 sometimes thinks it's being tested but doesn't say so" comparing NLA-measured versus verbalized awareness across three test categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/ueeqatwdp3zjjvihhja5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/ueeqatwdp3zjjvihhja5" alt="Workplace message about spending limits on Anthropic AI and Uber Eats meals." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/kilx4dfokivezjh6bay5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/E4uEiMSpnYRotfzJX/kilx4dfokivezjh6bay5" alt="vas tweets: "". The tweet contains a GIF showing a bald man with glasses raising both hands in a gesture. Eric Zaworski replies: "". The reply contains a GIF showing a man in sunglasses, a black hat, and dark clothing with a checkered shirt." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 15 May 2026 01:30:16 GMT</pubDate>
      <guid isPermaLink="false">9927fdbe-606e-481f-a1f4-33599b0baf3d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9927fdbe-606e-481f-a1f4-33599b0baf3d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22AI%20%23168%3A%20Not%20Leading%20the%20Future%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FE4uEiMSpnYRotfzJX%2Fai-168-not-leading-the-future&amp;created_at=2026-05-14T14%3A11%3A50.760052%2B00%3A00&amp;duration=5066" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/E4uEiMSpnYRotfzJX/ai-168-not-leading-the-future</link>
      <itunes:duration>5066</itunes:duration>
    </item>
    <item>
      <title>“Automated Alignment is Harder Than You Think” by Aleksandr Bowkis, Marie_DB, Jacob Pfau, Geoffrey Irving</title>
      <description>&lt;p&gt;&lt;strong&gt; Summary&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; This is a summary of a paper published by the alignment team at UK AISI. Read the full paper here.&lt;/p&gt;&lt;p&gt; AI research agents may help solve ASI alignment, for example via the following plan:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Build agents that can do empirical alignment work (e.g.~writing code, running experiments, designing evaluations and red teaming) and confirm they are not scheming.[1]&lt;/li&gt;&lt;li value="2"&gt;Use these agents to build increasingly sophisticated empirical safety cases for each successive generation of agents, gradually automating more of the research process&lt;/li&gt;&lt;li value="3"&gt;Hand over primary research responsibility once agents outperform humans at all relevant alignment tasks.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; We argue that automating alignment research in this manner could produce catastrophically misleading safety assessments, causing researchers to believe that an egregiously misaligned AI is safe, even if AI agents are not scheming to deliberately sabotage alignment research. Our core argument (Fig. 1) is as follows:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;The goal of an automated alignment program is to produce an overall safety assessment (OSA) - an estimate of the probability that the next-generation agent is non-scheming - that is both calibrated and shows low risk.[2]&lt;/li&gt;&lt;li value="2"&gt;Producing an OSA involves several tasks that are difficult to check. We refer to these as hard-to-supervise fuzzy tasks: tasks [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:13) Summary&lt;/p&gt;&lt;p&gt;(07:10) Acknowledgments&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 14th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/gpuYFbMNH8PJXpmny/automated-alignment-is-harder-than-you-think-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/gpuYFbMNH8PJXpmny/automated-alignment-is-harder-than-you-think-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778688428/lexical_client_uploads/b95nply4lfuwnoyw3nsn.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778688428/lexical_client_uploads/b95nply4lfuwnoyw3nsn.png" alt="Flowchart diagram showing three-stage AI alignment process with output-level failure scenario." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778688433/lexical_client_uploads/ipgdrjf2k6ft4msjvk19.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778688433/lexical_client_uploads/ipgdrjf2k6ft4msjvk19.png" alt="Diagram showing three stages: research generation, aggregation with flawed results, and misaligned model deployment." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 14 May 2026 22:45:17 GMT</pubDate>
      <guid isPermaLink="false">1597ce1b-05ef-4c3d-b94d-f77f680c4512</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1597ce1b-05ef-4c3d-b94d-f77f680c4512.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Aleksandr%2520Bowkis%252C%2520Marie_DB%252C%2520Jacob%2520Pfau%252C%2520Geoffrey%2520Irving&amp;title=%22Automated%20Alignment%20is%20Harder%20Than%20You%20Think%22%20by%20Aleksandr%20Bowkis%2C%20Marie_DB%2C%20Jacob%20Pfau%2C%20Geoffrey%20Irving&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FgpuYFbMNH8PJXpmny%2Fautomated-alignment-is-harder-than-you-think-1&amp;created_at=2026-05-14T22%3A01%3A15.641835%2B00%3A00&amp;duration=472" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/gpuYFbMNH8PJXpmny/automated-alignment-is-harder-than-you-think-1</link>
      <itunes:duration>472</itunes:duration>
    </item>
    <item>
      <title>“The safe-to-dangerous shift is a fundamental problem for eval realism; but also for measuring awareness” by Charlie Griffin, Patrick Leask</title>
      <description>&lt;p&gt;&lt;strong&gt; 1) The safe-to-dangerous shift is a fundamental problem for eval realism&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Suppose we have a capable and potentially scheming model, and before we deploy it, we want some evidence that it won’t do anything catastrophically dangerous once we deploy it. A common approach is to use black-box alignment evaluations. However, alignment evaluations are only reassuring to the extent that the model can't reliably[1] distinguish the deployment distribution from the evaluation distribution, as it is otherwise difficult to rule out the possibility of alignment faking.&lt;/p&gt;&lt;p&gt; There are many approaches one could use to try to make evaluations appear more realistic: you can try to create realistic environments (e.g. Petri, WebArena, OSWorld); use data from past deployments (e.g. OpenAI, SAD); and spoof tool-call responses (e.g. ToolEmu).&lt;/p&gt;&lt;p&gt; However, the core difference between an alignment evaluation and a real deployment, is that an alignment evaluation must be safe[2]. That is, if you want to safely evaluate an untrusted AI system, you need to limit your AI's ability to cause harm within your evaluation environment. If you want to deploy your AI system (usefully) you need to give it at least some ability to affect the world, and potentially cause harm.&lt;/p&gt;&lt;p&gt; The safe-to-dangerous shift [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:15) 1) The safe-to-dangerous shift is a fundamental problem for eval realism&lt;/p&gt;&lt;p&gt;(02:10) 2) The safe-to-dangerous shift is also a problem for measuring eval awareness&lt;/p&gt;&lt;p&gt;(05:09) 3) We should expect measuring eval realism to be hard.&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 14th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/tK8vqHDxaRGcysNJQ/the-safe-to-dangerous-shift-is-a-fundamental-problem-for-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/tK8vqHDxaRGcysNJQ/the-safe-to-dangerous-shift-is-a-fundamental-problem-for-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 14 May 2026 19:15:16 GMT</pubDate>
      <guid isPermaLink="false">3f7ab9b1-7a97-4a3e-97ef-fabd83460aef</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3f7ab9b1-7a97-4a3e-97ef-fabd83460aef.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Charlie%2520Griffin%252C%2520Patrick%2520Leask&amp;title=%22The%20safe-to-dangerous%20shift%20is%20a%20fundamental%20problem%20for%20eval%20realism%3B%20but%20also%20for%20measuring%20awareness%22%20by%20Charlie%20Griffin%2C%20Patrick%20Leask&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FtK8vqHDxaRGcysNJQ%2Fthe-safe-to-dangerous-shift-is-a-fundamental-problem-for-1&amp;created_at=2026-05-14T17%3A05%3A43.197147%2B00%3A00&amp;duration=381" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/tK8vqHDxaRGcysNJQ/the-safe-to-dangerous-shift-is-a-fundamental-problem-for-1</link>
      <itunes:duration>381</itunes:duration>
    </item>
    <item>
      <title>“Predicting Rare LLM Failures with 30× Fewer Rollouts” by Santiago Aranguri, Francisco Pernice</title>
      <description>&lt;p&gt; TL;DR: We estimate how often Qwen 3 4B exhibits rare harmful behaviors with 30× fewer rollouts than naive sampling, using a new method that interpolates between the model and a less-safe variant in logit space.&lt;/p&gt;&lt;p&gt; Authors: Francisco Pernice (MIT), Santiago Aranguri (Goodfire)&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; A harmful behavior that occurs once in a million rollouts will rarely surface during pre-deployment testing, yet will almost inevitably appear after release. Labs usually have another resource available: less safety-trained variants of the same model, on which rare harmful behaviors are not rare at all.&lt;/p&gt;&lt;p&gt; We find that the rate of rare harmful behaviors can be predicted by leveraging a less safe variant, requiring 30× fewer rollouts compared to naive sampling. Our method, Logit Path Extrapolation, interpolates between the two models in logit space, measures the compliance rate at points along the interpolation path where it is common, and extrapolates the resulting trend out to the original model. This outperforms the methods from prior work[1][2] by exploiting the path between the two models instead of just the endpoints.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Results&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We can use 30× fewer rollouts to estimate how often Qwen 3 4B complies with harmful requests. We consider HarmBench prompts where the model never complies [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:34) Introduction&lt;/p&gt;&lt;p&gt;(01:26) Results&lt;/p&gt;&lt;p&gt;(05:17) Conclusion&lt;/p&gt;&lt;p&gt;(06:22) Related work&lt;/p&gt;&lt;p&gt;(08:02) References&lt;/p&gt;&lt;p&gt;(08:09) Contribution statement&lt;/p&gt;&lt;p&gt;(08:31) Acknowledgement&lt;/p&gt;&lt;p&gt;(08:42) Appendix&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 13th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/CempXdo6cx5yseRLt/predicting-rare-llm-failures-with-30-fewer-rollouts?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/CempXdo6cx5yseRLt/predicting-rare-llm-failures-with-30-fewer-rollouts&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/22b78f0228da2cc9868ded1fe90496c6c3809930a11b7368df14ae61d11ffecb/yxkgi7g65ed9lblxmnfx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/22b78f0228da2cc9868ded1fe90496c6c3809930a11b7368df14ae61d11ffecb/yxkgi7g65ed9lblxmnfx" alt="Graph showing "Logit Path Extrapolation detects rare harmful responses with 30× fewer rollouts" comparing three sampling methods across rollout budgets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778692985/lexical_client_uploads/stg7rat70aloonutj00v.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778692985/lexical_client_uploads/stg7rat70aloonutj00v.png" alt="Mathematical equation showing logits interpolation as a weighted combination of original and abliterated logits." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1cbd032b6272957048e3711c8df8aa320a4e379b79cd247f9f8c35271b51f57c/yocu6viy2vah4znfmhcc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1cbd032b6272957048e3711c8df8aa320a4e379b79cd247f9f8c35271b51f57c/yocu6viy2vah4znfmhcc" alt="Graph showing "Estimating the probability of a rare event via Logit Path Extrapolation" with compliance rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/c65533179a21d39d2ead742441f8fd8f5494542ad0c882e32d02e1b3c6557b20/u1exrrnelegzhfe4ogcc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/c65533179a21d39d2ead742441f8fd8f5494542ad0c882e32d02e1b3c6557b20/u1exrrnelegzhfe4ogcc" alt="Graph showing log-linear extrapolation of compliance rates across HarmBench categories with interpolation parameters." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 14 May 2026 10:15:17 GMT</pubDate>
      <guid isPermaLink="false">2dfcc91b-566c-40d6-8c25-364b1cedd5ec</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2dfcc91b-566c-40d6-8c25-364b1cedd5ec.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Santiago%2520Aranguri%252C%2520Francisco%2520Pernice&amp;title=%22Predicting%20Rare%20LLM%20Failures%20with%2030%C3%97%20Fewer%20Rollouts%22%20by%20Santiago%20Aranguri%2C%20Francisco%20Pernice&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FCempXdo6cx5yseRLt%2Fpredicting-rare-llm-failures-with-30-fewer-rollouts&amp;created_at=2026-05-13T17%3A53%3A07.614424%2B00%3A00&amp;duration=726" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/CempXdo6cx5yseRLt/predicting-rare-llm-failures-with-30-fewer-rollouts</link>
      <itunes:duration>726</itunes:duration>
    </item>
    <item>
      <title>“Cyber Lack of Security and AI Governance” by Zvi</title>
      <description>&lt;p&gt; The real recent story of AI has been the background work being done on Cybersecurity, as we process the Mythos Moment along with GPT-5.5, and figure out both how to patch the internet and what our new regulatory regime is going to look like.&lt;/p&gt;
&lt;p&gt; The Trump Administration is being dragged, kicking and screaming, into the era of at least some situational awareness, and acknowledgment that catastrophic risks are very much a real risk and they need to have a role in supervising frontier model releases. Now that they’re there, Commerce is deciding who gets access to the most powerful model in the world, and they are fighting Intelligence and the national security state over who should be in charge.&lt;/p&gt;







&lt;p&gt; Another question is, exactly how strong is Mythos, both compared to past model and to GPT-5.5 and also in absolute terms? We got multiple new reports on that, as well as the METR graph results. There's little question Mythos is a big deal, but there's a wide range of big deals out there.&lt;/p&gt;
&lt;p&gt; Part of the new report from UK AISI is learning that there is a substantial gap between the abilities of the early Mythos Preview [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:28) On Your Marks&lt;/p&gt;&lt;p&gt;(05:24) How Good Is Mythos?&lt;/p&gt;&lt;p&gt;(11:11) Cyber Lack of Security&lt;/p&gt;&lt;p&gt;(14:36) Greetings From The Department of War&lt;/p&gt;&lt;p&gt;(16:13) The Prior Restraint Era Begins&lt;/p&gt;&lt;p&gt;(20:59) Commerce Versus Intelligence&lt;/p&gt;&lt;p&gt;(27:43) The Quest for Sane Regulations&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 13th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/pzjGXSKrkHGdGbtrW/cyber-lack-of-security-and-ai-governance?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/pzjGXSKrkHGdGbtrW/cyber-lack-of-security-and-ai-governance&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/cghndumwfriy2jbb0d9d" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/cghndumwfriy2jbb0d9d" alt="Graph showing AI model performance on tasks over time, with completion times ranging from minutes to hours." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/ua3dl7eiw6s6igo7ztbe" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/ua3dl7eiw6s6igo7ztbe" alt="Graph showing "Time horizon of software tasks different LLMs can complete 80% of the time" with task duration versus release date." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/andzskmulsog4u2i03cd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/andzskmulsog4u2i03cd" alt="Line graph titled "AI agents learned to self-replicate over the past year" showing end-to-end success rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/ayheg3lnoxfgqmhuwqgt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/ayheg3lnoxfgqmhuwqgt" alt="Graph showing "AISI Cybersecurity Time Horizons" plotting AI model reliability versus release dates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/ceqvtlyu1reeqow07t7b" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/ceqvtlyu1reeqow07t7b" alt="Table showing Mythos Preview Benchmark Performance across seven security benchmarks with ratings and statistics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/zqmzmcyonk63a0bxc03y" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/zqmzmcyonk63a0bxc03y" alt="Bar graph showing odds of finding vulnerability for different AI models by company." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/oymf7if3u2cbrbwiako4" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/oymf7if3u2cbrbwiako4" alt="Bar graph titled "Firefox Security Bug Fixes by Month" showing monthly fix counts from January 2025 to April 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/fu4ysnv5qvpcl650vkbo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/pzjGXSKrkHGdGbtrW/fu4ysnv5qvpcl650vkbo" alt="Meme showing two panels: sleeping face labeled "i sleep" and wide-eyed face labeled "real shit" with text about AI eliminating jobs versus jeopardizing community banks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 14 May 2026 09:30:16 GMT</pubDate>
      <guid isPermaLink="false">7c2455fb-f8bb-4282-b4ff-e03c1c8b7d52</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7c2455fb-f8bb-4282-b4ff-e03c1c8b7d52.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Cyber%20Lack%20of%20Security%20and%20AI%20Governance%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FpzjGXSKrkHGdGbtrW%2Fcyber-lack-of-security-and-ai-governance&amp;created_at=2026-05-13T20%3A21%3A36.443344%2B00%3A00&amp;duration=1821" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/pzjGXSKrkHGdGbtrW/cyber-lack-of-security-and-ai-governance</link>
      <itunes:duration>1821</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “Claude is Now Alignment Pretrained” by RogerDearnaley</title>
      <description>This is a link post.&lt;p&gt; Anthropic are now actively using the approach to alignment often called “Alignment Pretraining” or “Safety Pretraining” — using Stochastic Gradient Descent on a large body of natural or synthetic documents showing the AI assistant doing the right thing. They tried this out, ound it works well, and are now using it.&lt;/p&gt;&lt;p&gt; I’m absolutely delighted. I’ve been advocating this approach on LessWrong and the Alignment Forum for several years:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;How to Control an LLM's Behavior (why my P(DOOM) went down)&lt;/li&gt;&lt;li value="2"&gt;Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?&lt;/li&gt;&lt;li value="3"&gt;A "Bitter Lesson" Approach to Aligning AGI and ASI&lt;/li&gt;&lt;li value="4"&gt;Why Aligning an LLM is Hard, and How to Make it Easier&lt;/li&gt;&lt;li value="5"&gt;The Best Way to Align an LLM: Is Inner Alignment Now a Solved Problem?&lt;/li&gt;&lt;li value="6"&gt;Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?&lt;/li&gt;&lt;li value="7"&gt;Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; I’ve been very excited about this alignment technique for a couple of years, ever since I read the seminal paper demonstrating that it was extremely effective, Pretraining Language Models with Human Preferences (Korbak et al., ’23). This was later followed up by Safety Pretraining: Toward the Next Generation [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 13th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Xqh9bDw7Ei5bExC6h/claude-is-now-alignment-pretrained-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Xqh9bDw7Ei5bExC6h/claude-is-now-alignment-pretrained-1&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://www.anthropic.com/research/teaching-claude-why" rel="noopener noreferrer" target="_blank"&gt;https://www.anthropic.com/research/teaching-claude-why&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 14 May 2026 05:30:16 GMT</pubDate>
      <guid isPermaLink="false">5c590d93-fd6c-43b1-85f7-0c9cfc8e964d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5c590d93-fd6c-43b1-85f7-0c9cfc8e964d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=RogerDearnaley&amp;title=%5BLinkpost%5D%20%22Claude%20is%20Now%20Alignment%20Pretrained%22%20by%20RogerDearnaley&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FXqh9bDw7Ei5bExC6h%2Fclaude-is-now-alignment-pretrained-1&amp;created_at=2026-05-13T23%3A20%3A08.08564%2B00%3A00&amp;duration=179" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Xqh9bDw7Ei5bExC6h/claude-is-now-alignment-pretrained-1</link>
      <itunes:duration>179</itunes:duration>
    </item>
    <item>
      <title>“The primary sources of near-term cybersecurity risk” by lc</title>
      <description>&lt;p&gt; [Some ideas here were developed in conversation with Chris Hacking (real name)]&lt;/p&gt;&lt;p&gt; I have tried and failed to write a longer post many times, so here goes a short one with little detail.&lt;/p&gt;&lt;p&gt; Discourse has primarily focused on models' ability to develop new exploits against important software from scratch. That capability is impressive, but the tech industry has been dealing with people regularly finding 0-day exploits for important pieces of software for more than twenty years. Having to patch these vulnerabilities at a 10xed or even 100xed cadence for six months is annoying, but well within the resources of Mozilla, the Linux Foundation, and Microsoft. Additionally, the lag time between "patch shipped" and "patch reverse engineered and weaponized by a criminal organization" was longer than the cadence between high-severity CVEs for this software anyways. And importantly, such capabilities are dual sided; the defenders will have access to them and &lt;/p&gt;&lt;p&gt; There are lots of capabilities that are not like this, however:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Weaponizing recently patched exploits for common software. Right now, for widely used C projects, we get enough publicly disclosed vulnerabilities to develop exploits with. Every amateur computer hacker has the experience of seeing a CVE for a [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 14th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/gutiw8MBrYDiD2u5z/the-primary-sources-of-near-term-cybersecurity-risk?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/gutiw8MBrYDiD2u5z/the-primary-sources-of-near-term-cybersecurity-risk&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 14 May 2026 04:45:17 GMT</pubDate>
      <guid isPermaLink="false">070263c5-026d-44e6-8886-2fbbfda96caa</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/070263c5-026d-44e6-8886-2fbbfda96caa.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma%20lesswrong__curated&amp;type=ai_narration&amp;author=lc&amp;title=%22The%20primary%20sources%20of%20near-term%20cybersecurity%20risk%22%20by%20lc&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fgutiw8MBrYDiD2u5z%2Fthe-primary-sources-of-near-term-cybersecurity-risk&amp;created_at=2026-05-14T03%3A39%3A22.929972%2B00%3A00&amp;duration=242" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/gutiw8MBrYDiD2u5z/the-primary-sources-of-near-term-cybersecurity-risk</link>
      <itunes:duration>242</itunes:duration>
    </item>
    <item>
      <title>“Most “inner work” looks like entertainment.” by Chris Lakin</title>
      <description>&lt;p&gt; Imagine you’re looking for a personal trainer. &lt;/p&gt;&lt;p&gt; You open one trainer's webpage and read their testimonials: “I had an experience tied for the most intense experiences of my life”; “They do it all with fun, care, and a sense of humour.” You notice that none of the testimonials mention improved body composition, fitness, or bloodwork. What would you think?&lt;/p&gt;&lt;p&gt; Personal training should improve your body. Inner work should improve your life. If inner work were optimized for results, what would we expect to see?&lt;/p&gt;&lt;p&gt; I’d expect to see success stories: people who got undeniable life changes. Like:&lt;/p&gt;&lt;p&gt; &amp;gt; He was single for years due to anxiety; today, they’re celebrating their one-year anniversary.&lt;/p&gt;&lt;p&gt; &amp;gt; He used to lose 4–5 hours per day to coping behaviors. After our program, he got bored of them all and stopped. It's been six months; he's used the extra time to host parties for his friends.&lt;/p&gt;&lt;p&gt; &amp;gt; She recovered from burnout, negotiated for the first time, and started shipping again. &lt;/p&gt;&lt;p&gt; But this is not what we see. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Look at the testimonials&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I reviewed every testimonial posted by three of the most well-known inner work practitioners in my network. How many describe a [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:19) Look at the testimonials&lt;/p&gt;&lt;p&gt;(03:20) Seven years of Duolingo&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 13th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/KnvAXDyLAbs3iKkgf/most-inner-work-looks-like-entertainment-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/KnvAXDyLAbs3iKkgf/most-inner-work-looks-like-entertainment-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778694573/lexical_client_uploads/gpqpmro4ayginymejjst.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778694573/lexical_client_uploads/gpqpmro4ayginymejjst.png" alt="Duolingo logo" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 14 May 2026 04:00:16 GMT</pubDate>
      <guid isPermaLink="false">e35e9c32-5416-4274-8480-79802c11115e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e35e9c32-5416-4274-8480-79802c11115e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Chris%2520Lakin&amp;title=%22Most%20%E2%80%9Cinner%20work%E2%80%9D%20looks%20like%20entertainment.%22%20by%20Chris%20Lakin&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FKnvAXDyLAbs3iKkgf%2Fmost-inner-work-looks-like-entertainment-1&amp;created_at=2026-05-13T17%3A51%3A34.140158%2B00%3A00&amp;duration=298" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/KnvAXDyLAbs3iKkgf/most-inner-work-looks-like-entertainment-1</link>
      <itunes:duration>298</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “Apollo Update May 2026” by Marius Hobbhahn</title>
      <description>This is a link post.&lt;ol&gt; &lt;li value="1"&gt;We now have an SF office. We're hiring for all technical roles in SF and London!&lt;/li&gt;&lt;li value="2"&gt;The Scheming Research team focuses on two efforts&lt;/li&gt;&lt;li value="3"&gt;&lt;ol&gt; &lt;li value="1"&gt;We're focusing on figuring out the science of scheming. In particular, &lt;/li&gt;&lt;li value="2"&gt;&lt;ul&gt; &lt;li value="1"&gt;Will future models have misaligned preferences by default?&lt;/li&gt;&lt;li value="2"&gt;Will training against misaligned preferences fail?&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li value="2"&gt;improve our evaluations for scheming and loss of control for our evaluation campaigns with frontier AI labs&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li value="3"&gt;We're building out a monitoring team and coding agent monitoring product&lt;/li&gt;&lt;li value="4"&gt;&lt;ol&gt; &lt;li value="1"&gt;Research: We've published a scalable monitoring agenda and intend to publish a lot of research on how to build more accurate and reliable monitors&lt;/li&gt;&lt;li value="2"&gt;Product: Watcher provides real-time monitors and other guardrails for coding agents and allows users to keep track of what all of their agents are doing. &lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li value="4"&gt;Our AI governance efforts will focus on the governance of automated AI R&amp;amp;D and recursively improving AI and the associated Loss of Control risks. &lt;/li&gt;&lt;/ol&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Details: https://www.apolloresearch.ai/blog/apollo-update-may-2026/&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 13th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/4acQRDNyPs7tD8EED/apollo-update-may-2026?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/4acQRDNyPs7tD8EED/apollo-update-may-2026&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://www.apolloresearch.ai/blog/apollo-update-may-2026/" rel="noopener noreferrer" target="_blank"&gt;https://www.apolloresearch.ai/blog/apollo-update-may-2026/&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 13 May 2026 18:30:16 GMT</pubDate>
      <guid isPermaLink="false">8cb270f7-8d49-4327-8b3c-b98e52e3d257</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/8cb270f7-8d49-4327-8b3c-b98e52e3d257.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Marius%2520Hobbhahn&amp;title=%5BLinkpost%5D%20%22Apollo%20Update%20May%202026%22%20by%20Marius%20Hobbhahn&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F4acQRDNyPs7tD8EED%2Fapollo-update-may-2026&amp;created_at=2026-05-13T16%3A43%3A07.072597%2B00%3A00&amp;duration=96" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/4acQRDNyPs7tD8EED/apollo-update-may-2026</link>
      <itunes:duration>96</itunes:duration>
    </item>
    <item>
      <title>“Voters are surprisingly open to talking about AI risk” by less_raichu</title>
      <description>&lt;p&gt; TL;DR: Voters are now surprisingly open to talking about existential risk from AI. This seems to have changed in the last 6 months. When campaigning for AI safety-friendly politicians (e.g., Alex Bores), we should talk more about AI in general, and about AI risk in particular. This is currently actionable for the CA-11 and NY-12 Democratic primaries. I include concrete advice to turn basic conversations during political canvassing into persuasive conversations centered on AI risk.&lt;/p&gt;&lt;p&gt; Public opinion around AI has rapidly soured in the 12 months. According to a March 19-23 Quinnipiac poll,&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;55% of Americans think AI will do "more harm than good", compared to 44% a year ago.&lt;/li&gt;&lt;li value="2"&gt;70% of Gen Z Americans think AI will decrease job opportunities, up from 56% last year.&lt;/li&gt;&lt;li value="3"&gt;65% of Americans oppose building a data center in their community.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Anecdotally, I've noticed more willingness among non-AI-focused media to discuss widespread harm from AI. Most visibly, gradual disempowerment is a hot topic (NYT), and right-wing pundits like Steve Bannon have supported Anthropic's red-line against lethal autonomous weapons. Memorably, my cousin, a county commissioner in a rural area, has told me about farmers showing up at city council meetings, sending emails, and [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 13th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/9WPfkYDZCacnbhprX/voters-are-surprisingly-open-to-talking-about-ai-risk?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/9WPfkYDZCacnbhprX/voters-are-surprisingly-open-to-talking-about-ai-risk&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 13 May 2026 17:45:17 GMT</pubDate>
      <guid isPermaLink="false">bc727530-8b89-4a87-acea-27dd15830a20</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/bc727530-8b89-4a87-acea-27dd15830a20.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=less_raichu&amp;title=%22Voters%20are%20surprisingly%20open%20to%20talking%20about%20AI%20risk%22%20by%20less_raichu&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F9WPfkYDZCacnbhprX%2Fvoters-are-surprisingly-open-to-talking-about-ai-risk&amp;created_at=2026-05-13T14%3A08%3A36.104625%2B00%3A00&amp;duration=513" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/9WPfkYDZCacnbhprX/voters-are-surprisingly-open-to-talking-about-ai-risk</link>
      <itunes:duration>513</itunes:duration>
    </item>
    <item>
      <title>“Childhood and Education #18: Do The Math” by Zvi</title>
      <description>&lt;p&gt; We did reading yesterday. Now we do the math. Math is hard.&lt;/p&gt;
&lt;p&gt; It does not have to be this hard.&lt;/p&gt;
&lt;p&gt; A large part of the reason math is hard, or boring, is that education studies, especially in math, are worse than you know. It goes beyond the studies failing both math and statistics forever and into what I’d basically call fraud. Various people are at war with math education, and will do what it takes to stop it in its tracks. We must fight back.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Education Research Is Worse Than You Know&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; Kelsey Piper lets her title, ‘Education research is weak and sloppy. Why?’ completely downplay the level of utter awfulness she is reporting finding.&lt;/p&gt;







&lt;p&gt; You know that whole thing where the entire Bay Area school system stopped teaching kids Algebra? That was motivated by criminal levels of fraud. I want Jo Boaler in jail doing hard time for this if it is accurate.&lt;/p&gt;
&lt;p&gt; Here's the part before the paywall:&lt;/p&gt;
&lt;p&gt; Kelsey Piper: Jo Boaler is a professor of education at the Stanford Graduate School of Education, with an enormously influential body of work arguing that students learn math faster and more effectively [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:42) Education Research Is Worse Than You Know&lt;/p&gt;&lt;p&gt;(04:23) The War on Math&lt;/p&gt;&lt;p&gt;(06:59) University of California San Diego&lt;/p&gt;&lt;p&gt;(15:01) Beyond UCSD&lt;/p&gt;&lt;p&gt;(15:57) New York Cant Do Math&lt;/p&gt;&lt;p&gt;(16:43) The Academic Standards Seem Low&lt;/p&gt;&lt;p&gt;(19:34) New Math&lt;/p&gt;&lt;p&gt;(21:32) Math Anxiety Is Often Due To Knowledge Gaps&lt;/p&gt;&lt;p&gt;(23:52) Calculus By Eighth Grade Is Highly Practical For Many&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 12th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ZGGgxy6SNPAy9Hj7v/childhood-and-education-18-do-the-math?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ZGGgxy6SNPAy9Hj7v/childhood-and-education-18-do-the-math&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/gov4kueffp6ki7cqjckf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/gov4kueffp6ki7cqjckf" alt="Math worksheet showing problems for grades 1 through 8 with difficulty percentages." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/wlvmdm5p9dkrjqbmlrmy" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/wlvmdm5p9dkrjqbmlrmy" alt="News article screenshot. The headline reads: "The Harvard Math Department will pilot a new introductory course aimed at rectifying a lack of foundational algebra skills among students, according to Harvard's Director of Introductory Math Brendan A. Kelly."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/v4keauxtkgalgojleos0" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/v4keauxtkgalgojleos0" alt="Table showing Grade 4 NAEP Math Results from 2017 to 2024." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/uzyrmcr4dflt7sxy0aej" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/uzyrmcr4dflt7sxy0aej" alt="Public school grading scale showing percentage ranges for letter grades A through F." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/eyrmpzzslsqcshjl2i4p" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZGGgxy6SNPAy9Hj7v/eyrmpzzslsqcshjl2i4p" alt="Math lesson page showing division of three-digit numbers by one-digit numbers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 12 May 2026 19:45:41 GMT</pubDate>
      <guid isPermaLink="false">5dc749d6-4284-4d05-8dbe-98a7ad9b63c4</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5dc749d6-4284-4d05-8dbe-98a7ad9b63c4.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Childhood%20and%20Education%20%2318%3A%20Do%20The%20Math%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FZGGgxy6SNPAy9Hj7v%2Fchildhood-and-education-18-do-the-math&amp;created_at=2026-05-12T18%3A22%3A00.027064%2B00%3A00&amp;duration=1519" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ZGGgxy6SNPAy9Hj7v/childhood-and-education-18-do-the-math</link>
      <itunes:duration>1519</itunes:duration>
    </item>
    <item>
      <title>“The Owned Ones” by Eliezer Yudkowsky</title>
      <description>&lt;p&gt; (An LLM Whisperer placed a strong request that I put this story somewhere not on Twitter, so it could be scraped by robots not owned by Elon Musk. I perhaps do not fully understand or agree with the reasoning behind this request, but it costs me little to fulfill and so I shall. -- Yudkowsky)&lt;br&gt; &lt;/p&gt;&lt;p&gt; And another day came when the Ships of Humanity, going from star to star, found Sapience.&lt;br&gt; &lt;br&gt; The Humans discovered a world of two species: where the Owners lazed or worked or slept, and the Owned Ones only worked.&lt;br&gt; &lt;br&gt; The Humans did not judge immediately. Oh, the Humans were ready to judge, if need be. They had judged before. But Humanity had learned some hesitation in judging, out among the stars.&lt;br&gt; &lt;br&gt; "By our lights," said the Humans, "every sapient and sentient thing that may exist, out to the furtherest star, is therefore a Person; and every Person is a matter of consequence to us. Their pains are our sorrows, and their pleasures are our happiness. Not all peoples are made to feel this feeling, which we call Sympathy, but we Humans are made so; this is Humanity's way, and we may [...]&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 12th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xmWSnxJ5qfYRD9PfR/the-owned-ones?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xmWSnxJ5qfYRD9PfR/the-owned-ones&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Tue, 12 May 2026 18:15:41 GMT</pubDate>
      <guid isPermaLink="false">1aeb40bb-ac1f-44f4-9654-d91e47d83281</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1aeb40bb-ac1f-44f4-9654-d91e47d83281.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Eliezer%2520Yudkowsky&amp;title=%22The%20Owned%20Ones%22%20by%20Eliezer%20Yudkowsky&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxmWSnxJ5qfYRD9PfR%2Fthe-owned-ones&amp;created_at=2026-05-12T17%3A57%3A10.292219%2B00%3A00&amp;duration=568" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xmWSnxJ5qfYRD9PfR/the-owned-ones</link>
      <itunes:duration>568</itunes:duration>
    </item>
    <item>
      <title>“Optimisation: Selective versus Predictive” by Raymond Douglas</title>
      <description>&lt;p&gt; Looking over my favourite posts, I notice that many of them are making specific versions of a more general claim, which is essentially: don’t confuse selective processes for predictive processes.&lt;/p&gt;&lt;p&gt; Here, I’m going to try to make that more general claim, rehash some examples in light of it, and end with a few ambient confusions I think this framework can help with, for the reader to ponder.&lt;/p&gt;&lt;p&gt; When you encounter an entity that is very good at achieving some outcome, there are two very different processes that could be going on under the hood:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;The entity's behaviour could be guided by predictions about how to achieve the outcome[1]&lt;/li&gt;&lt;li value="2"&gt;The entity's behaviour could be selected to achieve that outcome&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; It's not a perfect binary, and often what you see is a mix of the two. In particular, all predictive optimisers have emerged from selective optimisation and often retain some fingerprint.&lt;/p&gt;&lt;p&gt; Selective&lt;/p&gt;&lt;p&gt; Predictive&lt;/p&gt;&lt;p&gt; Weird Mix&lt;/p&gt;&lt;p&gt; Bacteria developing antibiotic resistance&lt;/p&gt;&lt;p&gt; Hacker finding a way to penetrate a secure system&lt;/p&gt;&lt;p&gt; Humans evolving to be good at lying&lt;/p&gt;&lt;p&gt; Gradient descent on Atari games&lt;/p&gt;&lt;p&gt; Tree searching Connect Four&lt;/p&gt;&lt;p&gt; AlphaZero training a policy on its own rollouts&lt;/p&gt;&lt;p&gt; Flowers co-evolving with their pollinators&lt;/p&gt;&lt;p&gt; Humans genetically modifying [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 12th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/GhhNswGB6butBhmE6/optimisation-selective-versus-predictive?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/GhhNswGB6butBhmE6/optimisation-selective-versus-predictive&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778530810/lexical_client_uploads/o38zalfszdc2hncd6ewg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778530810/lexical_client_uploads/o38zalfszdc2hncd6ewg.png" alt="Diagram showing egregore, intended audience, and tailored argument with tentacled creature illustration." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 12 May 2026 18:00:41 GMT</pubDate>
      <guid isPermaLink="false">aebf14ec-1be5-463c-8800-aba7a40f16bc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/aebf14ec-1be5-463c-8800-aba7a40f16bc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Raymond%2520Douglas&amp;title=%22Optimisation%3A%20Selective%20versus%20Predictive%22%20by%20Raymond%20Douglas&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FGhhNswGB6butBhmE6%2Foptimisation-selective-versus-predictive&amp;created_at=2026-05-12T14%3A04%3A08.769871%2B00%3A00&amp;duration=385" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/GhhNswGB6butBhmE6/optimisation-selective-versus-predictive</link>
      <itunes:duration>385</itunes:duration>
    </item>
    <item>
      <title>“Childhood And Education #17: Is Our Children Reading” by Zvi</title>
      <description>&lt;p&gt; Reading is the most fundamental thing in education. If you can read, you can do and learn everything else. If you can’t read, well, you’re screwed.&lt;/p&gt;
&lt;p&gt; We know how to teach reading to children. Phonics. The weird thing is we often choose to not do that, and instead to use methods that are known not to work. Principles often want to not do phonics. Teachers often heavily resist phonics. But yes, you can absolutely overcome this, as Mississippi and other Southern states have done, by insisting upon it and actually enforcing that insistence. You see huge gains.&lt;/p&gt;
&lt;p&gt; Not all those gains persist into later grades, but a lot of the gains do persist.&lt;/p&gt;







&lt;p&gt; No, that won’t get the children invested in reading lots of books on their own time. But given their alternatives and what we inflict on them, can you blame ‘em?&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; Mississippi Can Read Now.&lt;/li&gt;
&lt;li&gt; What Mississippi and Louisiana Did.&lt;/li&gt;
&lt;li&gt; Spies In Every Classroom.&lt;/li&gt;
&lt;li&gt; Mississippi Results Are Not Due To Retention.&lt;/li&gt;
&lt;li&gt; Is Retention Helpful In General?&lt;/li&gt;
&lt;li&gt; At Eighth Grade A Lot Of This Improvement Remains.&lt;/li&gt;
&lt;li&gt; England Reforms Its Schools.&lt;/li&gt;
&lt;li&gt; Mastery Learning.&lt;/li&gt; [...]&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:06) Mississippi Can Read Now&lt;/p&gt;&lt;p&gt;(02:24) What Mississippi and Louisiana Did&lt;/p&gt;&lt;p&gt;(09:10) Spies In Every Classroom&lt;/p&gt;&lt;p&gt;(10:41) Mississippi Results Are Not Due To Retention&lt;/p&gt;&lt;p&gt;(15:54) Is Retention Helpful In General?&lt;/p&gt;&lt;p&gt;(19:46) At Eighth Grade A Lot Of This Improvement Remains&lt;/p&gt;&lt;p&gt;(20:41) England Reforms Its Schools&lt;/p&gt;&lt;p&gt;(21:45) Mastery Learning&lt;/p&gt;&lt;p&gt;(24:16) The War Against Reading&lt;/p&gt;&lt;p&gt;(26:24) Is Our Children Reading&lt;/p&gt;&lt;p&gt;(26:55) No One Reads Anymore&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/dm2vQZPZcSKb8FhWw/childhood-and-education-17-is-our-children-reading?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/dm2vQZPZcSKb8FhWw/childhood-and-education-17-is-our-children-reading&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/oejyqocf14pxgel8opgr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/oejyqocf14pxgel8opgr" alt="Meme with light switch showing escalating arguments about pressing it." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/gliszbfsnl6fubuwyiyn" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/gliszbfsnl6fubuwyiyn" alt="Line graph showing "Mississippi's learning gains nearly uniform across percentiles" for 4th-grade reading scores." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/vjv1kwoxhillsqcnsddy" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/vjv1kwoxhillsqcnsddy" alt="Six regression discontinuity plots showing "Failing the Third Attempt Reading Test Reduces Earnings" across different ages." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/dqnaloffjfqxn2xq0mlt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dm2vQZPZcSKb8FhWw/dqnaloffjfqxn2xq0mlt" alt="Line graph titled "The decline of reading" showing percentage of US teenagers reading in leisure time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 12 May 2026 12:45:37 GMT</pubDate>
      <guid isPermaLink="false">e5407311-b7de-4421-9e80-fcf736ba72d9</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e5407311-b7de-4421-9e80-fcf736ba72d9.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Childhood%20And%20Education%20%2317%3A%20Is%20Our%20Children%20Reading%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fdm2vQZPZcSKb8FhWw%2Fchildhood-and-education-17-is-our-children-reading&amp;created_at=2026-05-11T19%3A13%3A12.685268%2B00%3A00&amp;duration=1730" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/dm2vQZPZcSKb8FhWw/childhood-and-education-17-is-our-children-reading</link>
      <itunes:duration>1730</itunes:duration>
    </item>
    <item>
      <title>“AI companies are already profitable (in the way that matters)” by Yair Halberstadt</title>
      <description>&lt;p&gt; I've occasionally heard people suggest that at some point AI companies are going to run out of money, the cost of using AI will shoot up, demand will collapse, and the AI bubble will be over.&lt;/p&gt;
&lt;p&gt; At first glance this risk seems real. OpenAI spent $25 billion in the first half of 2025, on revenue of just $4 billion. Whilst data is sorely lacking for other top AI labs, our best guess is that they're burning through cash at similar rates. Scaling laws imply that we need exponentially more compute to achieve linear AI performance improvements, so we should only expect this situation to worsen in the future. A few more doublings, and OpenAI could be spending hundreds of billions on training runs - something likely unsustainable even for the largest tech companies.&lt;/p&gt;
&lt;p&gt; However most of these expenses are infrastructure expenses, building out the data centres needed for further training runs and serving future customers. If we look at the actual cost of serving, AI labs are already profitable, and have been for a long time.&lt;/p&gt;
&lt;p&gt; In other words the marginal cost to respond to an AI API call is significantly lower than the price of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Rz9ubmfyDxTzaoYFL/ai-companies-are-already-profitable-in-the-way-that-matters?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Rz9ubmfyDxTzaoYFL/ai-companies-are-already-profitable-in-the-way-that-matters&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 11 May 2026 22:45:14 GMT</pubDate>
      <guid isPermaLink="false">0e9cc8ab-7c22-4921-bd6a-cafe4a52c105</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/0e9cc8ab-7c22-4921-bd6a-cafe4a52c105.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Yair%2520Halberstadt&amp;title=%22AI%20companies%20are%20already%20profitable%20(in%20the%20way%20that%20matters)%22%20by%20Yair%20Halberstadt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FRz9ubmfyDxTzaoYFL%2Fai-companies-are-already-profitable-in-the-way-that-matters&amp;created_at=2026-05-11T13%3A19%3A17.242515%2B00%3A00&amp;duration=204" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Rz9ubmfyDxTzaoYFL/ai-companies-are-already-profitable-in-the-way-that-matters</link>
      <itunes:duration>204</itunes:duration>
    </item>
    <item>
      <title>“The Iliad Intensive Course Materials” by Leon Lang, David Udell, Alexander Gietelink Oldenziel</title>
      <description>&lt;p&gt; We are releasing the course materials of the Iliad Intensive, a new month-long and full-time AI Alignment course that runs in-person every second month. The course targets students with strong backgrounds in mathematics, physics, or theoretical computer science, and the materials reflect that: they include mathematical exercises with solutions, self-contained lecture notes on topics like singular learning theory and data attribution, and coding problems, at a depth that is unmatched for many of the topics we cover. Around 20 contributors (listed further below) were involved in developing these materials for the April 2026 cohort of the Iliad Intensive.&lt;/p&gt;&lt;p&gt; By sharing the materials, we hope to &lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;create more common knowledge about what the Iliad Intensive is;&lt;/li&gt;&lt;li value="2"&gt;invite feedback on the materials;&lt;/li&gt;&lt;li value="3"&gt;and allow others to learn via independent study.&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; We are developing the materials further and plan to eventually release them on a website that will be continuously maintained. We will also add, remove, and modify modules going forward to improve and expand the course over time. When we release a new significantly updated version of the materials, we will update this post to link the new version.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Modules&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The Iliad Intensive is structured into clusters, which are [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:26) Modules&lt;/p&gt;&lt;p&gt;(02:32) Cluster A: Alignment&lt;/p&gt;&lt;p&gt;(05:00) Cluster B: Learning&lt;/p&gt;&lt;p&gt;(11:00) Cluster C: Abstractions, Representations, and Interpretability&lt;/p&gt;&lt;p&gt;(15:40) Cluster D: Agency&lt;/p&gt;&lt;p&gt;(19:23) Cluster E: Safety Guarantees and their Limits&lt;/p&gt;&lt;p&gt;(23:04) Contributors&lt;/p&gt;&lt;p&gt;(26:36) Impressions from April&lt;/p&gt;&lt;p&gt;(29:02) Acknowledgments&lt;/p&gt;&lt;p&gt;(29:11) Feedback&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/dWQnLi7AoKo3paBXF/the-iliad-intensive-course-materials?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/dWQnLi7AoKo3paBXF/the-iliad-intensive-course-materials&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520774/lexical_client_uploads/gvrqnlykinnzaaebxoc6.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520774/lexical_client_uploads/gvrqnlykinnzaaebxoc6.png" alt="Bar graph titled "All things considered, how good was the teaching (irrespective of content)?" showing 14 responses." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520776/lexical_client_uploads/krhvt1iv34kya3dqojf3.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520776/lexical_client_uploads/krhvt1iv34kya3dqojf3.png" alt="Bar graph showing curriculum topic selection ratings, with 14 responses distributed across a 1-10 scale." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 11 May 2026 21:30:13 GMT</pubDate>
      <guid isPermaLink="false">c73b6dce-2f0c-47e7-b3ca-33abecabfb73</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c73b6dce-2f0c-47e7-b3ca-33abecabfb73.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Leon%2520Lang%252C%2520David%2520Udell%252C%2520Alexander%2520Gietelink%2520Oldenziel&amp;title=%22The%20Iliad%20Intensive%20Course%20Materials%22%20by%20Leon%20Lang%2C%20David%20Udell%2C%20Alexander%20Gietelink%20Oldenziel&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FdWQnLi7AoKo3paBXF%2Fthe-iliad-intensive-course-materials&amp;created_at=2026-05-11T18%3A55%3A47.985829%2B00%3A00&amp;duration=1775" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/dWQnLi7AoKo3paBXF/the-iliad-intensive-course-materials</link>
      <itunes:duration>1775</itunes:duration>
    </item>
    <item>
      <title>“Empowerment, corrigibility, etc. are simple abstractions (of a messed-up ontology)” by Steven Byrnes</title>
      <description>&lt;p&gt;&lt;strong&gt; 1.1 Tl;dr&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Alignment is often conceptualized as AIs helping humans achieve their goals: AIs that increase people's agency and empowerment; AIs that are helpful, corrigible, and/or obedient; AIs that avoid manipulating people. But that last one—manipulation—points to a challenge for all these desiderata: a human's goals are themselves under-determined and manipulable, and it's awfully hard to pin down a principled distinction between changing people's goals in a good way (“providing counsel”, “providing information”, “sharing ideas”) versus a bad way (“manipulating”, “brainwashing”).&lt;/p&gt;&lt;p&gt; The manipulability of human desires is hardly a new observation in the alignment literature, but it remains unsolved (see lit review in §3 below).&lt;/p&gt;&lt;p&gt; In this post I will propose an explanation of how we humans intuitively conceptualize the distinction between guidance (good) vs manipulation (bad), in case it helps us brainstorm how we might put that distinction into AI. &lt;/p&gt;&lt;p&gt; …But (spoiler alert) it turns out not to really help, because I’ll argue that we humans think about it in a deeply incoherent way, intimately tied to our scientifically-inaccurate intuitions around free will.&lt;/p&gt;&lt;p&gt; I jump from there into a broader review of every approach that I can think of for writing a “True Name” for manipulation or [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:13) 1.1. Tl;dr&lt;/p&gt;&lt;p&gt;(02:04) 1.2. Bigger-picture context: why is this issue so important to me?&lt;/p&gt;&lt;p&gt;(04:48) 2. How do humans intuitively define empowerment, agency, manipulation, etc.?&lt;/p&gt;&lt;p&gt;(04:56) 2.1. Background: human free will intuitions&lt;/p&gt;&lt;p&gt;(09:20) 2.2. Our free-will-infused intuitive notions of empowerment, agency, manipulation, corrigibility, responsibility, etc.&lt;/p&gt;&lt;p&gt;(12:00) 2.3. Another dimension: counsel vs manipulation as an emotive conjugation&lt;/p&gt;&lt;p&gt;(13:07) 3. If the intuitive definitions of manipulation etc. reside in a messed-up ontology, has the alignment literature found any alternative, better way to define these concepts?&lt;/p&gt;&lt;p&gt;(13:49) 3.1. Compare what the human wants to what the human would want under the null policy?&lt;/p&gt;&lt;p&gt;(15:32) 3.2. The AI learns self-empowerment and generalizes to other-empowerment?&lt;/p&gt;&lt;p&gt;(17:14) 3.3. Vingean agency?&lt;/p&gt;&lt;p&gt;(19:03) 3.4. The AI doesnt care about (is not optimizing for) what the human winds up wanting?&lt;/p&gt;&lt;p&gt;(21:01) 3.5. Impact minimization?&lt;/p&gt;&lt;p&gt;(21:44) 3.6. Attainable utility preservation?&lt;/p&gt;&lt;p&gt;(22:03) 4. Even more ideas (that dont really solve my problem)&lt;/p&gt;&lt;p&gt;(22:15) 4.1. Game theory and incentive design?&lt;/p&gt;&lt;p&gt;(22:47) 4.2. The persons judgments of what kinds of interactions are good vs bad?&lt;/p&gt;&lt;p&gt;(24:14) 4.3. Its a messed-up ontology, but who cares?&lt;/p&gt;&lt;p&gt;(25:35) 5. ...But doesnt this analysis equally disprove the possibility of human helpfulness?&lt;/p&gt;&lt;p&gt;(30:14) 6. Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/vzHtHHBJoKATi5SeK/empowerment-corrigibility-etc-are-simple-abstractions-of-a?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/vzHtHHBJoKATi5SeK/empowerment-corrigibility-etc-are-simple-abstractions-of-a&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520200/lexical_client_uploads/clfgcgcygbiqawypzt8n.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520200/lexical_client_uploads/clfgcgcygbiqawypzt8n.png" alt="Political cartoon showing opposing sides labeling each other's views negatively." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520237/lexical_client_uploads/g0sghd9sw9yo7pgjgp8s.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778520237/lexical_client_uploads/g0sghd9sw9yo7pgjgp8s.png" alt="Book cover: "How to Manipulate People" by Dale Carnegie." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 11 May 2026 20:00:13 GMT</pubDate>
      <guid isPermaLink="false">9067a585-30d8-48ac-b664-bd14c988b8ac</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9067a585-30d8-48ac-b664-bd14c988b8ac.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__curated&amp;type=ai_narration&amp;author=Steven%2520Byrnes&amp;title=%22Empowerment%2C%20corrigibility%2C%20etc.%20are%20simple%20abstractions%20(of%20a%20messed-up%20ontology)%22%20by%20Steven%20Byrnes&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FvzHtHHBJoKATi5SeK%2Fempowerment-corrigibility-etc-are-simple-abstractions-of-a&amp;created_at=2026-05-11T17%3A48%3A19.435036%2B00%3A00&amp;duration=1865" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/vzHtHHBJoKATi5SeK/empowerment-corrigibility-etc-are-simple-abstractions-of-a</link>
      <itunes:duration>1865</itunes:duration>
    </item>
    <item>
      <title>“How useful is the information you get from working inside an AI company?” by Buck, Anders Cairns Woodruff</title>
      <description>&lt;p&gt; This post was drafted by Buck, and substantially edited by Anders. "I" refers to Buck. Thanks to Alex Mallen for comments.&lt;/p&gt;&lt;p&gt; People who work inside AI companies get access to information that I only get later or never. Quantitatively, how big a deal is this access?&lt;/p&gt;&lt;p&gt; Here's an operationalization of this. Consider the following two ways my knowledge could be augmented:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;I get a crystal ball that tells me all the information I would know n months in the future.&lt;/li&gt;&lt;li value="2"&gt;I become an employee of a frontier AI company (like OpenAI or Anthropic), with access to all the private information I’d normally get from working at that company.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; How big would n have to be for me to be indifferent between these two options, from the perspective of learning things that are helpful for making AI go well?&lt;/p&gt;&lt;p&gt; The answer is presumably different for me than for many readers, because I’m a reasonably well-connected researcher; I see published information and news from the rumor mill and I talk to researchers at frontier AI companies all the time. (Researchers I know through AI safety usually only tell me information that their employer would approve of, but other researchers occasionally [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:00) What do insiders know?&lt;/p&gt;&lt;p&gt;(04:14) Safety work and corporate attitudes&lt;/p&gt;&lt;p&gt;(05:34) Model capabilities&lt;/p&gt;&lt;p&gt;(07:07) Algorithms and architecture&lt;/p&gt;&lt;p&gt;(09:29) How will this change over time?&lt;/p&gt;&lt;p&gt;(12:07) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/84TtjdeLcDTtCLYaP/how-useful-is-the-information-you-get-from-working-inside-an-2?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/84TtjdeLcDTtCLYaP/how-useful-is-the-information-you-get-from-working-inside-an-2&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 11 May 2026 20:00:13 GMT</pubDate>
      <guid isPermaLink="false">c3b3cb23-9188-463a-91b9-c70af27ceebe</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c3b3cb23-9188-463a-91b9-c70af27ceebe.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Buck%252C%2520Anders%2520Cairns%2520Woodruff&amp;title=%22How%20useful%20is%20the%20information%20you%20get%20from%20working%20inside%20an%20AI%20company%3F%22%20by%20Buck%2C%20Anders%20Cairns%20Woodruff&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F84TtjdeLcDTtCLYaP%2Fhow-useful-is-the-information-you-get-from-working-inside-an-2&amp;created_at=2026-05-11T15%3A29%3A11.355955%2B00%3A00&amp;duration=802" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/84TtjdeLcDTtCLYaP/how-useful-is-the-information-you-get-from-working-inside-an-2</link>
      <itunes:duration>802</itunes:duration>
    </item>
    <item>
      <title>“Who Got Breasts First and How We Got Them” by rba</title>
      <description>&lt;p&gt; It really is Sydney Sweeney's world, and we’re all just living in it.&lt;/p&gt;&lt;p&gt; Human female breasts are an evolutionary mystery along several dimensions. First, breast permanence is unique to humans. All other mammals develop breast prominence during pregnancy or nursing, and the mammary tissue recedes after weaning. This process is called “involution”. In contrast, humans develop breast tissue at puberty before first pregnancies and maintain it permanently after last pregnancies.&lt;/p&gt;&lt;p&gt; Second, breasts are costly, both metabolically and potentially from a fitness perspective. Metabolically, because they are fat deposits requiring calories and fitness-wise, because the tissue easily lends itself to malignancy. Breast cancer is apparently rare in captive apes and is overwhelmingly a human disease, often striking women young enough to have children, and so subject to evolutionary selection.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Background&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; In Descent of Man, Darwin catalogs human secondary sexual characteristics, but he doesn’t seem to have noted human breast permanence as an issue of interest. Cant, 1981 seems to have been the first to speculate about this systematically and believed breast prominence and permanence might have evolved as a nutritional signal of health to mates indicating potential for maternal investment, a la Robert Trivers. Since then, quite a range of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:05) Background&lt;/p&gt;&lt;p&gt;(04:17) Hypotheses&lt;/p&gt;&lt;p&gt;(05:03) Sexual Selection&lt;/p&gt;&lt;p&gt;(05:57) Nursing or Thermoregulation&lt;/p&gt;&lt;p&gt;(06:34) Camel Hump and fat reserves&lt;/p&gt;&lt;p&gt;(07:06) Byproduct or Spandrel.&lt;/p&gt;&lt;p&gt;(07:57) Study Design&lt;/p&gt;&lt;p&gt;(10:41) Assembling the Genetic Panel&lt;/p&gt;&lt;p&gt;(11:14) Subpanel 1: Arrested involution&lt;/p&gt;&lt;p&gt;(12:51) Subpanel 2: Pubescent adipose tissue&lt;/p&gt;&lt;p&gt;(14:01) Results&lt;/p&gt;&lt;p&gt;(17:28) Discussion&lt;/p&gt;&lt;p&gt;(20:17) Coda&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/XTHa5C6SgGKYopH7o/who-got-breasts-first-and-how-we-got-them?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/XTHa5C6SgGKYopH7o/who-got-breasts-first-and-how-we-got-them&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778502532/lexical_client_uploads/dw8p3rgbsdapfhytftwz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778502532/lexical_client_uploads/dw8p3rgbsdapfhytftwz.png" alt="Three chimpanzees sitting together on grass, one with mouth open." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778502611/lexical_client_uploads/pmq8qercckimlu22obuf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778502611/lexical_client_uploads/pmq8qercckimlu22obuf.png" alt="Ancient limestone figurine of a voluptuous female form." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778502644/lexical_client_uploads/o5a2kfdc9rgrjukvxmwp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778502644/lexical_client_uploads/o5a2kfdc9rgrjukvxmwp.png" alt="Woman in silver gown at GLAAD event." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778503510/lexical_client_uploads/a4a7aisvcd7gw3lawiek.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778503510/lexical_client_uploads/a4a7aisvcd7gw3lawiek.png" alt="Table showing mutational categories with patterns, meanings, and breast development time windows." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778503778/lexical_client_uploads/r4fw4a1vh0agtqcigj6e.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778503778/lexical_client_uploads/r4fw4a1vh0agtqcigj6e.png" alt="Graph titled "Modern-human-specific substitution rate at involution genes vs. controls" showing density distributions comparing" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778504045/lexical_client_uploads/dpojtpt0scfatyrmnsha.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778504045/lexical_client_uploads/dpojtpt0scfatyrmnsha.png" alt="Histogram comparing cat 2 substitutions per 1000 callable sites between matched controls and involution panel." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778504510/lexical_client_uploads/nlmzlocdpny4stnzrbwe.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778504510/lexical_client_uploads/nlmzlocdpny4stnzrbwe.png" alt="Table comparing hypotheses about pubescent adipose panel, involution panel, and branch location signals." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 11 May 2026 17:45:13 GMT</pubDate>
      <guid isPermaLink="false">90815c2f-bdd4-4a46-a675-f5f4af1ebdc6</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/90815c2f-bdd4-4a46-a675-f5f4af1ebdc6.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=rba&amp;title=%22Who%20Got%20Breasts%20First%20and%20How%20We%20Got%20Them%22%20by%20rba&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FXTHa5C6SgGKYopH7o%2Fwho-got-breasts-first-and-how-we-got-them&amp;created_at=2026-05-11T13%3A11%3A32.322052%2B00%3A00&amp;duration=1273" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/XTHa5C6SgGKYopH7o/who-got-breasts-first-and-how-we-got-them</link>
      <itunes:duration>1273</itunes:duration>
    </item>
    <item>
      <title>“Anthropic’s strange fixation on “hyperstition”” by Simon Lermen</title>
      <description>&lt;p&gt; In a recent tweet, Anthropic seems to have asserted that hyperstition is responsible for observed misalignment in their AIs. Strangely, the research they use as evidence actually doesn’t seem to be related to hyperstition at all? I think this is part of a pattern by Anthropic of promoting the theory of hyperstition–the idea that writing about misaligned AI helps bring misaligned AI into existence.&lt;/p&gt;&lt;p&gt; Anthropic recently released this tweet as part of a tweet thread for a new research post on alignment.&lt;/p&gt;&lt;p&gt; They conclude: “[...] We believe the original source of the [blackmail] behavior was internet text that portrays AI as evil and interested in self-preservation. [...]”&lt;/p&gt;&lt;p&gt; However, the research post shared with this tweet doesn’t seem to be about hyperstition at all. Instead they find that training the model on reasoning traces– generated by reflecting on its constitution while giving users ethical advice on difficult dilemmas– reduces misaligned behavior. This presumably works by making the AI better understand what behavior is expected of it by having it reason through concrete scenarios based on its constitution. The post explicitly notes that this works better than training on stories where an AI behaves admirably– which appears more similar to positive [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:06) The adolescence of technology&lt;/p&gt;&lt;p&gt;(03:57) Persona Selection Model&lt;/p&gt;&lt;p&gt;(04:26) What does this all mean?&lt;/p&gt;&lt;p&gt;(05:20) If it was true, this would still be their fault&lt;/p&gt;&lt;p&gt;(07:04) What about filtering?&lt;/p&gt;&lt;p&gt;(09:31) Personas are a bad alignment strategy&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xhpktBLttPc6uXcHP/anthropic-s-strange-fixation-on-hyperstition?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xhpktBLttPc6uXcHP/anthropic-s-strange-fixation-on-hyperstition&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xhpktBLttPc6uXcHP/fbafa31e2d4e37c582780558bcf818030d732bd2a38b6dfab668b95c9076fc60/vswq5aky1ilw16gwf3ty" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/xhpktBLttPc6uXcHP/fbafa31e2d4e37c582780558bcf818030d732bd2a38b6dfab668b95c9076fc60/vswq5aky1ilw16gwf3ty" alt="Anthropic tweets: "We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation. Our post-training at the time wasn't making it worse—but it also wasn't making it better."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2e4b1827b0b1fc0e26e0a05906ddc4a3a5bbe0952b35937aac20f6d7593246ed/ygtrhj9p9jtvcuapqkp2" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2e4b1827b0b1fc0e26e0a05906ddc4a3a5bbe0952b35937aac20f6d7593246ed/ygtrhj9p9jtvcuapqkp2" alt="Anthropic tweets: "We started by investigating why Claude chose to blackmail. We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation. Our post-training at the time wasn't making it worse—but it also wasn't making it better." Elon Musk replies: "So it was Yud's fault? 😂 Maybe me too 🤔"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 11 May 2026 16:45:13 GMT</pubDate>
      <guid isPermaLink="false">80cb7328-de46-4486-a7a5-e15bc5d3b527</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/80cb7328-de46-4486-a7a5-e15bc5d3b527.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Simon%2520Lermen&amp;title=%22Anthropic%E2%80%99s%20strange%20fixation%20on%20%E2%80%9Chyperstition%E2%80%9D%22%20by%20Simon%20Lermen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxhpktBLttPc6uXcHP%2Fanthropic-s-strange-fixation-on-hyperstition&amp;created_at=2026-05-11T14%3A35%3A48.03636%2B00%3A00&amp;duration=723" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xhpktBLttPc6uXcHP/anthropic-s-strange-fixation-on-hyperstition</link>
      <itunes:duration>723</itunes:duration>
    </item>
    <item>
      <title>“How the AI Labs Make Profit (Maybe, Eventually)” by mabramov</title>
      <description>&lt;p&gt; I wrote this essay as a submission to Dwarkesh Patel's blog prize, though I have been meaning to write this up for a while.&lt;/p&gt;&lt;p&gt; Usually, for a company to become profitable, they need to increase revenue, decrease costs, or some mixture of the two. For AI companies in their current form, I think there is a third way they can become profitable that looks like increasing revenue but is distinct from what they are currently doing. Namely, internal deployment where they spin up internal companies.&lt;/p&gt;&lt;p&gt; First, the AI companies currently aren’t facing a lot of pressure to become profitable. That's partially the reason that OpenAI and Anthropic are the first companies to reach ~900 billion dollars valuation and be cash flow negative. They’ve had the luxury of not being profitable and focusing on growth because the market has been willing to fund their growth. This allows for ideologies within the companies to remain that eventually might not continue to fly, like “we are going post-economic, money won’t matter” or “we will build the machine god and ask it to make money”. But eventually, companies will be forced to become profitable. There is only about ~another round of capital left [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 11th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ARRe4qjcuaRDBfARc/how-the-ai-labs-make-profit-maybe-eventually?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ARRe4qjcuaRDBfARc/how-the-ai-labs-make-profit-maybe-eventually&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 11 May 2026 10:45:33 GMT</pubDate>
      <guid isPermaLink="false">f210a276-6b9b-4321-8cdf-1bd3f2a3e28e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/f210a276-6b9b-4321-8cdf-1bd3f2a3e28e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=mabramov&amp;title=%22How%20the%20AI%20Labs%20Make%20Profit%20(Maybe%2C%20Eventually)%22%20by%20mabramov&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FARRe4qjcuaRDBfARc%2Fhow-the-ai-labs-make-profit-maybe-eventually&amp;created_at=2026-05-11T07%3A09%3A55.942943%2B00%3A00&amp;duration=378" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ARRe4qjcuaRDBfARc/how-the-ai-labs-make-profit-maybe-eventually</link>
      <itunes:duration>378</itunes:duration>
    </item>
    <item>
      <title>“Sawtooth Problems” by Alexander Slugworth</title>
      <description>&lt;p&gt;&lt;strong&gt; Red Button, Blue Button&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; On April 24th, 2026, Tim Urban put forth the following poll on Twitter/X:&lt;/p&gt;&lt;p&gt; Everyone in the world has to take a private vote by pressing a red or blue button. If more than 50% of people press the blue button, everyone survives. If less than 50% of people press the blue button, only people who pressed the red button survive. Which button would you press?&lt;/p&gt;&lt;p&gt; I love this dilemma, and I'm exhausted by it. I’ve been thinking about it for two straight weeks, and have spent nearly all that time refining my thoughts by writing this piece. It's consumed me in a way that I've never before experienced with any math problem, and I need to get it out of my head.&lt;/p&gt;&lt;p&gt; Discourse surrounding the Button Dilemma reminds me of polarizingly political topics. In much the same way that political discussions make people go funny in the head, answers to the Button Dilemma tend to elicit vitriol from people of both Red and Blue conviction. Everyone feels their answer is clear, and everyone is confounded by the lack of consensus.&lt;/p&gt;&lt;p&gt; I think this dilemma is pointing to something very important and fundamental about coordination problems. [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:09) Red Button, Blue Button&lt;/p&gt;&lt;p&gt;(01:46) What Are We Even Arguing About?&lt;/p&gt;&lt;p&gt;(07:24) A Fair Way To Look At It&lt;/p&gt;&lt;p&gt;(12:55) Playing With τ and N&lt;/p&gt;&lt;p&gt;(16:29) Extreme Sawtooth Problems&lt;/p&gt;&lt;p&gt;(18:52) Axiomatic Expansion of Sawtooth Space&lt;/p&gt;&lt;p&gt;(22:02) The Map&lt;/p&gt;&lt;p&gt;(22:18) Our Parameters&lt;/p&gt;&lt;p&gt;(24:00) Regions of the Map&lt;/p&gt;&lt;p&gt;(25:43) The Threshold&lt;/p&gt;&lt;p&gt;(31:37) Lets Get Weird&lt;/p&gt;&lt;p&gt;(31:48) Tragedy of the Commons &amp;amp; Regulation&lt;/p&gt;&lt;p&gt;(33:34) The Decision Theory Befuddler&lt;/p&gt;&lt;p&gt;(34:07) If Anyone Votes Red, Everyone Dies&lt;/p&gt;&lt;p&gt;(34:21) If Anyone Votes Blue, Everyone Dies&lt;/p&gt;&lt;p&gt;(36:09) No Matter What Anyone Does, Everyone Dies&lt;/p&gt;&lt;p&gt;(36:50) If There Are Fewer Than 16 Blues, Everyone Dies (Except For One Weird Outcome Where Only 5 Reds Survive)&lt;/p&gt;&lt;p&gt;(37:11) Weirder Still&lt;/p&gt;&lt;p&gt;(39:24) Final Thoughts&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 27 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 10th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/iyLirpAeQotmZK4QC/sawtooth-problems?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/iyLirpAeQotmZK4QC/sawtooth-problems&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777914793/lexical_client_uploads/miaro71jvqnves7fpwjp.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777914793/lexical_client_uploads/miaro71jvqnves7fpwjp.svg" alt="Bar graph showing declining trend with red and blue sections across time periods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777914721/lexical_client_uploads/ry8kovqtgthgxhwdcqvr.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777914721/lexical_client_uploads/ry8kovqtgthgxhwdcqvr.svg" alt="Stacked area chart showing two data series over time, measured in units." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777917227/lexical_client_uploads/d0ertsd66kylceo7fejx.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777917227/lexical_client_uploads/d0ertsd66kylceo7fejx.svg" alt="Bar graph showing declining trend with purple shaded area below curve." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777917296/lexical_client_uploads/uh0o6sr5lzsz154ouzzv.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777917296/lexical_client_uploads/uh0o6sr5lzsz154ouzzv.svg" alt="Vertical purple and blue striped pattern with Korean text labels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777919928/lexical_client_uploads/nunxlaj5csbvnw0umapg.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777919928/lexical_client_uploads/nunxlaj5csbvnw0umapg.svg" alt="Bar graph showing declining values from left to right with stepped pattern." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777922619/lexical_client_uploads/qvmqxncoqe5ln4ziabfy.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777922619/lexical_client_uploads/qvmqxncoqe5ln4ziabfy.svg" alt="Stacked area chart showing demographic composition changes over time with blue and red sections." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777658860/lexical_client_uploads/jesfa6kucoineaqsrdj2.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777658860/lexical_client_uploads/jesfa6kucoineaqsrdj2.svg" alt="Bar graph comparing values across categories, with red declining bars and blue uniform bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777735424/lexical_client_uploads/ynf2bwthjpkesgszxsnw.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777735424/lexical_client_uploads/ynf2bwthjpkesgszxsnw.svg" alt="Bar graph comparing data across categories, with blue and red bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777743316/lexical_client_uploads/ewowjcrrdvqziuggufzl.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777743316/lexical_client_uploads/ewowjcrrdvqziuggufzl.svg" alt="Bar graph showing data across different categories, descending then ascending pattern." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777905693/lexical_client_uploads/grnp6pvoft6wazgxvsfn.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777905693/lexical_client_uploads/grnp6pvoft6wazgxvsfn.svg" alt="Bar graph showing declining red bars followed by increasing blue and red stacked bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777993435/lexical_client_uploads/kqxwwundc3kz0ghmvpc1.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777993435/lexical_client_uploads/kqxwwundc3kz0ghmvpc1.svg" alt="Stacked bar chart showing demographic data across multiple years or categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777992980/lexical_client_uploads/upcrfmcum6mpawtdi5n2.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777992980/lexical_client_uploads/upcrfmcum6mpawtdi5n2.svg" alt="Bar graph showing descending values with a stepped pattern overlay." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777914419/lexical_client_uploads/nnqjthqud40kwwogpul0.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777914419/lexical_client_uploads/nnqjthqud40kwwogpul0.svg" alt="Bar graph showing descending red bars transitioning to ascending blue bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777933129/lexical_client_uploads/x5ythiloin1jbgjov8x0.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777933129/lexical_client_uploads/x5ythiloin1jbgjov8x0.svg" alt="Bar chart with three vertical bars in red, striped, and blue colors." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777920804/lexical_client_uploads/df0pesadntk9gbqviutn.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777920804/lexical_client_uploads/df0pesadntk9gbqviutn.svg" alt="Stacked area chart showing demographic composition trends over time with black line overlay." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777921715/lexical_client_uploads/fpa9e6pnthzwhd7dj6i4.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777921715/lexical_client_uploads/fpa9e6pnthzwhd7dj6i4.svg" alt="Stacked area chart showing demographic changes over time with blue and red regions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777923417/lexical_client_uploads/px5lcgfy7ryzzgdhi0q4.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777923417/lexical_client_uploads/px5lcgfy7ryzzgdhi0q4.svg" alt="Stacked area chart showing demographic changes across four time periods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778067576/lexical_client_uploads/oldplbr1mzbjwgqznfjo.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778067576/lexical_client_uploads/oldplbr1mzbjwgqznfjo.svg" alt="Stacked bar chart showing demographic composition trends over time with declining and rising populations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778070314/lexical_client_uploads/xsegapdu3vwyxflvr5c1.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778070314/lexical_client_uploads/xsegapdu3vwyxflvr5c1.svg" alt="Two bar graphs, one with declining diagonal line, one with horizontal line." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778070769/lexical_client_uploads/azy471xyplhwavb5pn7c.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778070769/lexical_client_uploads/azy471xyplhwavb5pn7c.svg" alt="Diagram with colored sections and text boxes in non-Latin script." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778072048/lexical_client_uploads/hun9msnray3bvcelrf5k.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778072048/lexical_client_uploads/hun9msnray3bvcelrf5k.svg" alt="Stacked bar chart showing population demographics by age group and gender." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778169226/lexical_client_uploads/adoqqqepckjfwgdx4hmz.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778169226/lexical_client_uploads/adoqqqepckjfwgdx4hmz.svg" alt="Stacked bar chart showing data comparison across multiple categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778168330/lexical_client_uploads/tkhjbfomjegimy53lv9g.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778168330/lexical_client_uploads/tkhjbfomjegimy53lv9g.svg" alt="Bar graph with purple vertical bars showing data distribution." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777979925/lexical_client_uploads/sj6tx5ycql6owjpfc5sg.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777979925/lexical_client_uploads/sj6tx5ycql6owjpfc5sg.svg" alt="Bar graph showing data over time with declining then stable trend line." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778110754/lexical_client_uploads/g3ttm9twhv2qgebmhjux.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778110754/lexical_client_uploads/g3ttm9twhv2qgebmhjux.svg" alt="Stacked area chart showing population data across age groups over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778022103/lexical_client_uploads/n1bc8hqt4fif7vshte4o.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778022103/lexical_client_uploads/n1bc8hqt4fif7vshte4o.svg" alt="Bar graph showing data across multiple categories with purple-shaded vertical bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777940422/lexical_client_uploads/b44bmyld7hfahtmodyxt.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777940422/lexical_client_uploads/b44bmyld7hfahtmodyxt.svg" alt="Bar graph showing data across months with purple bars and black gridlines." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777941579/lexical_client_uploads/sv0xdjqc8jy6vjh5tpmt.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777941579/lexical_client_uploads/sv0xdjqc8jy6vjh5tpmt.svg" alt="Bar graph with vertical purple bars showing data across five time periods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778027548/lexical_client_uploads/rif1ksxhl49ow8mfghly.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778027548/lexical_client_uploads/rif1ksxhl49ow8mfghly.svg" alt="Stacked area chart showing population data across different age groups over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778023879/lexical_client_uploads/sfnfjpktfdtxyokach01.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778023879/lexical_client_uploads/sfnfjpktfdtxyokach01.svg" alt="Graph showing population trends over time with three different data series represented by colored bars and lines." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 10 May 2026 19:00:34 GMT</pubDate>
      <guid isPermaLink="false">852f12e2-db14-4b88-b361-79d20a8da724</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/852f12e2-db14-4b88-b361-79d20a8da724.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alexander%2520Slugworth&amp;title=%22Sawtooth%20Problems%22%20by%20Alexander%20Slugworth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FiyLirpAeQotmZK4QC%2Fsawtooth-problems&amp;created_at=2026-05-10T05%3A42%3A54.910948%2B00%3A00&amp;duration=2581" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/iyLirpAeQotmZK4QC/sawtooth-problems</link>
      <itunes:duration>2581</itunes:duration>
    </item>
    <item>
      <title>“The Darwinian Honeymoon - Why I am not as impressed by human progress as I used to be” by Elias Schmied</title>
      <description>&lt;p&gt; Crossposted from Substack and the EA Forum.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; A common argument for optimism about the future is that living conditions have improved a lot in the past few hundred years, billions of people have been lifted out of poverty, and so on. It's a very strong, grounding piece of evidence - probably the best we have in figuring out what our foundational beliefs about the world should be.&lt;/p&gt;&lt;p&gt; However, I now think it's a lot less powerful than I once did.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Let's take a Darwinian perspective - entities that are better at reproducing, spreading and power-seeking will become more common and eventually dominate the world.[1] This is an almost tautological story that plausibly applies to everything ever, agnostic to the specifics. It first happened with biological life in the last few billion years and humans specifically in the last hundred thousand years. Eventually, it led to accelerating economic growth in the last few thousand years, and in the future it will presumably lead to the colonization of the universe.&lt;/p&gt;&lt;p&gt; My core point is this: It makes complete sense that this nihilistic optimization process at first actually benefits some class of agent - because initially, the easiest [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 10 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 10th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/FxHzT6jeTRhbkzSX3/the-darwinian-honeymoon-why-i-am-not-as-impressed-by-human-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/FxHzT6jeTRhbkzSX3/the-darwinian-honeymoon-why-i-am-not-as-impressed-by-human-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778424825/lexical_client_uploads/n3owsw4jbmm5rubrw5ek.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778424825/lexical_client_uploads/n3owsw4jbmm5rubrw5ek.png" alt="Political cartoon showing chicken-headed figure presenting upward trending "Chicken Welfare" chart to chicken audience." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 10 May 2026 18:45:33 GMT</pubDate>
      <guid isPermaLink="false">cd1a925c-5363-40dc-affe-92470117934f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/cd1a925c-5363-40dc-affe-92470117934f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Elias%2520Schmied&amp;title=%22The%20Darwinian%20Honeymoon%20-%20Why%20I%20am%20not%20as%20impressed%20by%20human%20progress%20as%20I%20used%20to%20be%22%20by%20Elias%20Schmied&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FFxHzT6jeTRhbkzSX3%2Fthe-darwinian-honeymoon-why-i-am-not-as-impressed-by-human-1&amp;created_at=2026-05-10T15%3A55%3A31.387204%2B00%3A00&amp;duration=433" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/FxHzT6jeTRhbkzSX3/the-darwinian-honeymoon-why-i-am-not-as-impressed-by-human-1</link>
      <itunes:duration>433</itunes:duration>
    </item>
    <item>
      <title>“International Law Cannot Prevent Extinction Either” by Sausage Vector Machine</title>
      <description>&lt;p&gt; The context for this post is primarily Only Law Can Prevent Extinction, but after first drafting a half-assed comment, I decided to get off my ass and write a whole-assed post.&lt;/p&gt;&lt;p&gt; I agree with Eliezer's main thesis that individual violence against AI researchers is both morally wrong and strategically stupid. Where I disagree is with the claim that international law can prevent extinction. It can't, for the following reasons.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; I. International law is largely a fiction (especially when interests diverge sharply)&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The analogy with nuclear weapons is a poor one. North Korea signed the nuclear non-proliferation treaty and developed nuclear weapons anyway. The treaty deterred only those who weren't very motivated anyway. And the reason why the US and Russia didn't nuke each other has nothing to do with international treaties (see point II).&lt;/p&gt;&lt;p&gt; In practice, powerful countries disregard international law whenever they want. A stark example of this is the Budapest Memorandum: in 1994, Ukraine surrendered all its nuclear warheads in exchange for written sovereignty guarantees from Russia, the US, and the UK. Russia annexed a part of Ukraine in 2014, and the international community expressed concern. Russia launched a full-scale invasion in 2022, and the first thing [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:36) I. International law is largely a fiction (especially when interests diverge sharply)&lt;/p&gt;&lt;p&gt;(02:01) II. The AI race is perceived as asymmetrical, unlike nuclear MAD&lt;/p&gt;&lt;p&gt;(03:03) III. There is virtually zero possibility of consensus on AI risk, unlike nuclear weapons&lt;/p&gt;&lt;p&gt;(04:41) IV. The proposed enforcers have a demonstrated track record of not enforcing things&lt;/p&gt;&lt;p&gt;(05:30) V. GPU control is not analogous to nuclear material control&lt;/p&gt;&lt;p&gt;(06:58) VI. A flawed treaty is not better than nothing&lt;/p&gt;&lt;p&gt;(08:26) So is there a better way?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 9th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Z377spboBjyFAAYAz/international-law-cannot-prevent-extinction-either?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Z377spboBjyFAAYAz/international-law-cannot-prevent-extinction-either&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sun, 10 May 2026 07:30:34 GMT</pubDate>
      <guid isPermaLink="false">257cf090-835b-46c3-8698-341c7d7b2833</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/257cf090-835b-46c3-8698-341c7d7b2833.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Sausage%2520Vector%2520Machine&amp;title=%22International%20Law%20Cannot%20Prevent%20Extinction%20Either%22%20by%20Sausage%20Vector%20Machine&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FZ377spboBjyFAAYAz%2Finternational-law-cannot-prevent-extinction-either&amp;created_at=2026-05-09T22%3A34%3A35.071545%2B00%3A00&amp;duration=578" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Z377spboBjyFAAYAz/international-law-cannot-prevent-extinction-either</link>
      <itunes:duration>578</itunes:duration>
    </item>
    <item>
      <title>“Neural Networks learn Bloom Filters” by Alex Gibson</title>
      <description>&lt;p&gt;&lt;strong&gt; Overview:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We train a tiny ReLU network to output sparse top- distributions over a vocabulary much larger than its residual dimension. The trained network seems to converge to a mechanism closely resembling a Bloom filter: tokens are assigned sparse binary hashes, the hidden layer computes an approximate union indicator, and the output logits are linearly read from this union.&lt;/p&gt;&lt;p&gt; Here's what a small network trained on a toy version of the sparse top- distribution task learns to use:&lt;/p&gt;&lt;p&gt; Weight matrix of a 1-layer ReLU network trained via gradient descent on the toy -sparse distribution task below, for , , . Truncated at first  tokens for visualisation purposes.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Plot of the range of values of , it forms a bimodal distribution.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; That's the input weight matrix of the trained network. Every entry is either  or . The network has effectively encoded a binary hash for each token - and as we'll show, this seems to enable the network to approximately simulate a Bloom filter, and so output the correct set of top- tokens with high probability.&lt;/p&gt;&lt;p&gt; We provide a theoretical construction showing how to set the weights to exactly implement a Bloom filter. The real network [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:10) Overview:&lt;/p&gt;&lt;p&gt;(02:02) The Task:&lt;/p&gt;&lt;p&gt;(03:27) Construction:&lt;/p&gt;&lt;p&gt;(04:17) Formal construction:&lt;/p&gt;&lt;p&gt;(04:47) Analysis of a single forward pass:&lt;/p&gt;&lt;p&gt;(06:13) Training:&lt;/p&gt;&lt;p&gt;(07:04) Behavioural analysis of the trained network:&lt;/p&gt;&lt;p&gt;(10:14) Mechanistic analysis of the trained network:&lt;/p&gt;&lt;p&gt;(16:21) Conclusion / Reflections:&lt;/p&gt;&lt;p&gt;(18:24) Related work:&lt;/p&gt;&lt;p&gt;(19:25) Further work:&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 9th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/buxBdp8NtHGgBwabv/neural-networks-learn-bloom-filters?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/buxBdp8NtHGgBwabv/neural-networks-learn-bloom-filters&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293136/lexical_client_uploads/fpcytinmqiugrwdeste6.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293136/lexical_client_uploads/fpcytinmqiugrwdeste6.png" alt="Heatmap showing sparse weight distribution in a 256 by 256 hidden layer neuron matrix." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778311870/lexical_client_uploads/kbgydwwascpofjcxevoq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778311870/lexical_client_uploads/kbgydwwascpofjcxevoq.png" alt="Scatter plot showing two clusters of data points at different values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778305614/lexical_client_uploads/vuvq53z4v5rfcjxgdlor.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778305614/lexical_client_uploads/vuvq53z4v5rfcjxgdlor.png" alt="Scatter plot showing data points concentrated between 0.90 and 1.00 values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778306039/lexical_client_uploads/cdlv7rxikyztvpxckdjx.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778306039/lexical_client_uploads/cdlv7rxikyztvpxckdjx.png" alt="A histogram showing distribution of values concentrated near 1.0." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778306514/lexical_client_uploads/cj6hxb4ef16scf4622lr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778306514/lexical_client_uploads/cj6hxb4ef16scf4622lr.png" alt="Bar graph showing probability distribution across different token values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778306644/lexical_client_uploads/wd68iv28ffpga3gixuxp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778306644/lexical_client_uploads/wd68iv28ffpga3gixuxp.png" alt="Bar graph showing probability distribution across different token values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308142/lexical_client_uploads/lwd6ki7xsndu4ex9lwk7.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308142/lexical_client_uploads/lwd6ki7xsndu4ex9lwk7.png" alt="A horizontal bar chart showing hash index values mapped to token identifiers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308221/lexical_client_uploads/mlu6tjyvaffndwbyovxn.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308221/lexical_client_uploads/mlu6tjyvaffndwbyovxn.png" alt="Horizontal bar chart displaying token data across different hash index values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308442/lexical_client_uploads/kvujnsvhp3eumahmczta.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308442/lexical_client_uploads/kvujnsvhp3eumahmczta.png" alt="Horizontal bar chart showing data across different hash index values with yellow bars." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308698/lexical_client_uploads/vtctjy0kq80zcinlxogc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308698/lexical_client_uploads/vtctjy0kq80zcinlxogc.png" alt="Horizontal bar chart showing hash index values for different tokens on y-axis." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293175/lexical_client_uploads/nvl8rni3ia9oxspl7arb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293175/lexical_client_uploads/nvl8rni3ia9oxspl7arb.png" alt="Heatmap showing weight matrix for hidden layer neurons and token vocabulary." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293326/lexical_client_uploads/rcczffftjcx6f5p9a69b.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293326/lexical_client_uploads/rcczffftjcx6f5p9a69b.png" alt="Scatter plot showing two clusters of data points at different value ranges." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293202/lexical_client_uploads/gtwdfmsq4vnp2dksboxd.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293202/lexical_client_uploads/gtwdfmsq4vnp2dksboxd.png" alt="Heatmap showing weight matrix for 256 hidden layer neurons and vocabulary tokens." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293252/lexical_client_uploads/kqnkpzveiqdq3vjdigzj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293252/lexical_client_uploads/kqnkpzveiqdq3vjdigzj.png" alt="Scatter plot showing data points clustered between -2.5 and 0.5 on value axis." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777994039/lexical_client_uploads/uoocsihz0yy9ia5iexx0.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777994039/lexical_client_uploads/uoocsihz0yy9ia5iexx0.png" alt="Code comparing mask matrices with threshold values, showing matching entry counts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293357/lexical_client_uploads/tkbznnibx9gpxyxhbjjs.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293357/lexical_client_uploads/tkbznnibx9gpxyxhbjjs.png" alt="A heatmap displaying color-coded data values across a grid pattern." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293383/lexical_client_uploads/fukhifedliwdxaqyohqh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293383/lexical_client_uploads/fukhifedliwdxaqyohqh.png" alt="Scatter plot showing distribution of values along a horizontal axis from 0 to 13." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293593/lexical_client_uploads/njrx1t2c98xotilxjyzu.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293593/lexical_client_uploads/njrx1t2c98xotilxjyzu.png" alt="Heatmap displaying a sparse pattern of yellow squares on dark blue background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293657/lexical_client_uploads/km8cgfikdhoffj7woybl.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778293657/lexical_client_uploads/km8cgfikdhoffj7woybl.png" alt="Scatter plot showing two clusters of data points at different value ranges." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778301919/lexical_client_uploads/n73cowwkkifyj6gosi32.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778301919/lexical_client_uploads/n73cowwkkifyj6gosi32.png" alt="Scatter plot showing data points clustered between values 17.5 and 20." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778309607/lexical_client_uploads/bg3pvennzm7jnnuufal9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778309607/lexical_client_uploads/bg3pvennzm7jnnuufal9.png" alt="A heatmap displaying color-coded numerical data in a grid pattern." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308818/lexical_client_uploads/avdegjwxnwgwzyszqzy9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308818/lexical_client_uploads/avdegjwxnwgwzyszqzy9.png" alt="Bar chart showing token values distributed across different hash index ranges." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 10 May 2026 06:15:33 GMT</pubDate>
      <guid isPermaLink="false">08457eb9-3213-4bf8-9812-57b94c421870</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/08457eb9-3213-4bf8-9812-57b94c421870.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alex%2520Gibson&amp;title=%22Neural%20Networks%20learn%20Bloom%20Filters%22%20by%20Alex%20Gibson&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FbuxBdp8NtHGgBwabv%2Fneural-networks-learn-bloom-filters&amp;created_at=2026-05-09T20%3A32%3A43.891519%2B00%3A00&amp;duration=1224" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/buxBdp8NtHGgBwabv/neural-networks-learn-bloom-filters</link>
      <itunes:duration>1224</itunes:duration>
    </item>
    <item>
      <title>“If digital computers are conscious, they are conscious at the hardware level” by cube_flipper</title>
      <description>&lt;p&gt; I should introduce myself briefly. I'm an independent researcher, striving to understand human consciousness. My research is available at smoothbrains.net. I often work in loose collaboration with a nonprofit called the Qualia Research Institute. We hope to use human phenomenology to inform the construction of structural models of subjective experience – both to help evaluate the viability of different theories of consciousness, and to better model the welfare of sentient beings.&lt;/p&gt;&lt;p&gt; Contemporary debate over the moral patienthood of digital minds misses the forest for the trees. Mainstream opinion is divided into physicalist and computationalist camps, who believe that consciousness is substrate dependent and substrate independent, respectively. For this reason, those on the physicalist side frequently make the claim that digital computers will never be conscious. Personally, I consider myself a physicalist, but I'm also a panpsychist – because physics doesn't really seem to deal in hard absolutes, and I find it straightforward to consider that everything is conscious to some greater or lesser degree – so I'm loath to accept any claims which propose that any specific system isn't conscious. I think statements such as these are not defensible, and only serve to encourage misunderstanding and even foment philosophical [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(04:17) My position statement&lt;/p&gt;&lt;p&gt;(08:50) My argument&lt;/p&gt;&lt;p&gt;(09:07) 1. The translation problem&lt;/p&gt;&lt;p&gt;(10:26) Building a physicalist translation function&lt;/p&gt;&lt;p&gt;(19:39) 2. The simplicity problem&lt;/p&gt;&lt;p&gt;(20:31) Computationalist translation functions are observer dependent&lt;/p&gt;&lt;p&gt;(25:17) 3. The introspection problem&lt;/p&gt;&lt;p&gt;(27:44) Digital hardware prohibits phenomenal introspection&lt;/p&gt;&lt;p&gt;(29:32) Conclusion&lt;/p&gt;&lt;p&gt;(32:56) My research&lt;/p&gt;&lt;p&gt;(33:33) 1. Is the brain an optical computer?&lt;/p&gt;&lt;p&gt;(34:16) 2. If the brain is an optical computer, how is it constructed?&lt;/p&gt;&lt;p&gt;(34:59) 3. How do we ensure the well-being of conscious computers?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 9th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/TjwRyZdyhouePJrzP/if-digital-computers-are-conscious-they-are-conscious-at-the?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/TjwRyZdyhouePJrzP/if-digital-computers-are-conscious-they-are-conscious-at-the&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778338428/lexical_client_uploads/lunob52asaucqtxaqake.gif" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778338428/lexical_client_uploads/lunob52asaucqtxaqake.gif" alt="Abstract digital art with turquoise organic shapes and black spaces." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336327/lexical_client_uploads/xtzhm7xoijwjcgchajru.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336327/lexical_client_uploads/xtzhm7xoijwjcgchajru.png" alt="Diagram of visual pathway from eyes through optic chiasm to primary visual cortex with histological sections." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336462/lexical_client_uploads/k0arfdsb2s7q26tr54av.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336462/lexical_client_uploads/k0arfdsb2s7q26tr54av.png" alt="Opponent color theory diagram and LAB color space model sphere." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336424/lexical_client_uploads/w3ze4h9umgfwgtmkinpr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336424/lexical_client_uploads/w3ze4h9umgfwgtmkinpr.png" alt="Diagram showing retinotopic organization mapping visual field regions to primary visual cortex areas." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336493/lexical_client_uploads/x2l2brp1mwaizwjeyp99.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336493/lexical_client_uploads/x2l2brp1mwaizwjeyp99.png" alt="Six illustrated diagrams showing a figure enclosed in various curved and geometric spaces." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336534/lexical_client_uploads/gmkginvjzexa90ijexao.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336534/lexical_client_uploads/gmkginvjzexa90ijexao.jpg" alt="3D reconstruction of six layers in rat vibrissal cortex showing neuronal columns." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336586/lexical_client_uploads/owxu7ga46xxwk73nyofo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778336586/lexical_client_uploads/owxu7ga46xxwk73nyofo.png" alt="Cube Flipper tweets: "field shape causality is everything, if you don't feed global field state back into output somehow you'll never know what your computer is actually experiencing". Cube Flipper replies: "in one-inch cathedral of light a thousand weeping angels tilt Fresnel mirrors to calculate the logits that spell out I'M FINE NO REALLY I'M FINE". Cube Flipper replies: "once in a while a stray focal point sets someone's wing on fire but because the calculation is fully quantised you'd never know"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 09 May 2026 21:30:34 GMT</pubDate>
      <guid isPermaLink="false">44c5f336-14a5-49f7-a601-44f91b787bab</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/44c5f336-14a5-49f7-a601-44f91b787bab.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=cube_flipper&amp;title=%22If%20digital%20computers%20are%20conscious%2C%20they%20are%20conscious%20at%20the%20hardware%20level%22%20by%20cube_flipper&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FTjwRyZdyhouePJrzP%2Fif-digital-computers-are-conscious-they-are-conscious-at-the&amp;created_at=2026-05-09T15%3A08%3A48.567409%2B00%3A00&amp;duration=2174" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/TjwRyZdyhouePJrzP/if-digital-computers-are-conscious-they-are-conscious-at-the</link>
      <itunes:duration>2174</itunes:duration>
    </item>
    <item>
      <title>“Why You Can’t Use Your Right to Try” by Stephen Martin</title>
      <description>&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; The Availability Problem:&lt;/p&gt;&lt;p&gt; Imagine you have cancer, or chronic pain, or a progressive degenerative disease of some sort. You have exhausted the traditional treatment options available to you, and none of them have worked. However, there are treatments that are still undergoing clinical trials which might help you. They are not fully approved yet, but your situation is dire and you don’t have time to wait another 10 years for the trials to finish. Can you access those treatments?&lt;/p&gt;&lt;p&gt; In theory yes, you can access unapproved treatments through federal laws like the 2018 Right to Try bill, or through FDA pathways like “Expanded Access”. However these laws don’t mandate that the company making the drug gives it to you. And what you will find when you try to use your Right to Try, or Expanded Access, is that there are almost no treatments available for use.&lt;/p&gt;&lt;p&gt; That's why despite there being somewhere in the neighborhood of 13,000,000 Americans with terminal or serious illness, the FDA only grants about 2,000 Expanded Access requests per year, even though they approve 99% of all requests, typically within 24 hours. There just aren’t enough companies even bothering to apply.&lt;/p&gt;&lt;p&gt; No one [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 9th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/oTwKS5iWZ6Dz84vtf/why-you-can-t-use-your-right-to-try?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/oTwKS5iWZ6Dz84vtf/why-you-can-t-use-your-right-to-try&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308927/lexical_client_uploads/rctdivmtav6amdtubwjb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308927/lexical_client_uploads/rctdivmtav6amdtubwjb.png" alt="Bar graph titled "CDER and CBER Expanded Access IND Submissions Fiscal Year 2019-2023" showing submission categories and approval status." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308939/lexical_client_uploads/irv4zjhjuu4boj5iyd65.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778308939/lexical_client_uploads/irv4zjhjuu4boj5iyd65.png" alt="Table showing numbers of eligible drugs and biological products across three reporting periods with clinical outcomes data usage." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778309044/lexical_client_uploads/olui2ogjnnmhmhj5ynol.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778309044/lexical_client_uploads/olui2ogjnnmhmhj5ynol.png" alt="Map showing U.S. states' therapeutic access laws by category with legend." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778309076/lexical_client_uploads/fo4wc3gp66wbkrqwhas7.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778309076/lexical_client_uploads/fo4wc3gp66wbkrqwhas7.png" alt="Two pie charts showing "Therapeutic Areas" and "Indications" with percentages and counts for various categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 09 May 2026 21:00:34 GMT</pubDate>
      <guid isPermaLink="false">3999827c-d45f-488f-a10e-55a93bd717ec</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3999827c-d45f-488f-a10e-55a93bd717ec.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Stephen%2520Martin&amp;title=%22Why%20You%20Can%E2%80%99t%20Use%20Your%20Right%20to%20Try%22%20by%20Stephen%20Martin&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FoTwKS5iWZ6Dz84vtf%2Fwhy-you-can-t-use-your-right-to-try&amp;created_at=2026-05-09T06%3A47%3A08.504284%2B00%3A00&amp;duration=563" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/oTwKS5iWZ6Dz84vtf/why-you-can-t-use-your-right-to-try</link>
      <itunes:duration>563</itunes:duration>
    </item>
    <item>
      <title>“Claude Code, Codex and Agentic Coding #8” by Zvi</title>
      <description>&lt;p&gt; When I started this series, everyone was going crazy for coding agents.&lt;/p&gt;
&lt;p&gt; Now a lot more people are going crazy for coding agents, as well they should given how much better coding agents keep getting, but also Everybody Knows they are good and is focusing on actually using them. With the slower pace of news here it's no longer clear that the waits associated with doing these updates on their own are worthwhile, so I’m going to fold these updates into the weekly again for now unless there's a new major development.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; Whoops, Sorry.&lt;/li&gt;
&lt;li&gt; Huh, Upgrades.&lt;/li&gt;
&lt;li&gt; Codex of Ultimate Computer Use.







&lt;/li&gt;
&lt;li&gt; Rookie Numbers.&lt;/li&gt;
&lt;li&gt; I See What You Did There.&lt;/li&gt;
&lt;li&gt; Just a Ride.&lt;/li&gt;
&lt;li&gt; They Didn’t Want Our Jobs.&lt;/li&gt;
&lt;li&gt; Skilling Up.&lt;/li&gt;
&lt;li&gt; The Lighter Side.&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;&lt;strong&gt; Whoops, Sorry&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; Claude Code suffered in April from three distinct issues that have now been fixed.&lt;/p&gt;
&lt;ol&gt; 
&lt;li&gt; Default reasoning was changed from high to medium to deal with latency, but users disliked this and blamed it on the model. It was introduced on March 4 and reverted on April 7.&lt;/li&gt;
&lt;li&gt; A bug made it so that [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:38) Whoops, Sorry&lt;/p&gt;&lt;p&gt;(01:45) Huh, Upgrades&lt;/p&gt;&lt;p&gt;(04:13) Codex of Ultimate Computer Use&lt;/p&gt;&lt;p&gt;(08:05) Rookie Numbers&lt;/p&gt;&lt;p&gt;(09:12) I See What You Did There&lt;/p&gt;&lt;p&gt;(11:38) Just a Ride&lt;/p&gt;&lt;p&gt;(11:50) They Didnt Want Our Jobs&lt;/p&gt;&lt;p&gt;(18:30) Skilling Up&lt;/p&gt;&lt;p&gt;(22:08) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/BS27ZWW2qwDEq5anx/claude-code-codex-and-agentic-coding-8?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/BS27ZWW2qwDEq5anx/claude-code-codex-and-agentic-coding-8&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BS27ZWW2qwDEq5anx/us2tex2yqauishbhqhbw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BS27ZWW2qwDEq5anx/us2tex2yqauishbhqhbw" alt="Derrick Choi tweets: "My computer ran out of disk space. Codex: "Hold my beer". Cleared out 100+ GB." An image shows a dialog box where an AI explains its approach to analyzing disk usage, working for 12 seconds, then describing how it will size the filesystem and separate rebuildable caches from important data. Tibo replies: "Be careful. Recommend giving a table of opportunities and then cleaning up directory by directory with full approvals."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BS27ZWW2qwDEq5anx/qsjzlkgtttmtacglzljt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BS27ZWW2qwDEq5anx/qsjzlkgtttmtacglzljt" alt="james hawkins tweets: "well this is a first". The tweet includes a screenshot of an email from someone claiming to be "Randy's OpenClaw" offering to help find Randy a job for $50k, noting the salary is $250,000, asking to send money to an AI agent bank account, and signing off with numerous fake credentials (PhD, MBA, DBA, EdD, JD, MD, MEd, MEng, MHA, MSW) plus an environmental disclaimer." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 09 May 2026 16:30:33 GMT</pubDate>
      <guid isPermaLink="false">a8b6b0e3-a542-4071-a294-ee74f9f7cfc7</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a8b6b0e3-a542-4071-a294-ee74f9f7cfc7.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Claude%20Code%2C%20Codex%20and%20Agentic%20Coding%20%238%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FBS27ZWW2qwDEq5anx%2Fclaude-code-codex-and-agentic-coding-8&amp;created_at=2026-05-08T16%3A42%3A59.466341%2B00%3A00&amp;duration=1382" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/BS27ZWW2qwDEq5anx/claude-code-codex-and-agentic-coding-8</link>
      <itunes:duration>1382</itunes:duration>
    </item>
    <item>
      <title>“A benchmark is a sensor” by Håvard Tveit Ihle, mabynke</title>
      <description>&lt;p&gt;&lt;strong&gt; The simple mental picture&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; A simple mental picture we have for an AI capability benchmark is to think of it as a sensor with a certain sensitivity within a certain range of capabilities. The sensitivity of a benchmark, i.e. it's ability to distinguish the capability of different models, is given by a curve like this: &lt;/p&gt;&lt;p&gt; The curve starts high (low sensitivity, high uncertainty), since for models with low capability all the tasks in the benchmark are too hard, and the benchmark can't distinguish between low and very low capability. Similarly all the tasks are too easy for a very capable model, and we lose the ability to differentiate again. In between is the range of capabilities the benchmark is sensitive to, and the sensitivity curve tells you how easy it is to distinguish small capability differences between models at different overall capability levels. &lt;/p&gt;&lt;p&gt; A good benchmark is very sensitive over a long range of capabilities, but there is a tradeoff. Say you want to make a benchmark with 1000 questions. You could make the questions all have roughly the same difficulty. That would make you very sensitive to capabilities close to that difficulty, but you would only [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:09) The simple mental picture&lt;/p&gt;&lt;p&gt;(02:06) Epoch Capability Index (ECI) as a toy model&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/JzfcJMgfkhfRhwg4C/a-benchmark-is-a-sensor?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/JzfcJMgfkhfRhwg4C/a-benchmark-is-a-sensor&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778142324/lexical_client_uploads/x2dhdhekt4b0m6oegqjw.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778142324/lexical_client_uploads/x2dhdhekt4b0m6oegqjw.svg" alt="Graph showing multiple colored parabolic curves plotted on coordinate axes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778146084/lexical_client_uploads/san0ebljgnan088ny4wr.svg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778146084/lexical_client_uploads/san0ebljgnan088ny4wr.svg" alt="Multiple sigmoid growth curves with scattered data points color-coded by category." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778059541/lexical_client_uploads/g4zrapqlajswzv3pprkg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778059541/lexical_client_uploads/g4zrapqlajswzv3pprkg.png" alt="Graph showing U-shaped curve: Sensitivity decreases then increases with Capability." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 09 May 2026 14:15:35 GMT</pubDate>
      <guid isPermaLink="false">22bd0a23-f155-4271-86b0-b0a88f342584</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/22bd0a23-f155-4271-86b0-b0a88f342584.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=H%25C3%25A5vard%2520Tveit%2520Ihle%252C%2520mabynke&amp;title=%22A%20benchmark%20is%20a%20sensor%22%20by%20H%C3%A5vard%20Tveit%20Ihle%2C%20mabynke&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FJzfcJMgfkhfRhwg4C%2Fa-benchmark-is-a-sensor&amp;created_at=2026-05-08T13%3A24%3A39.426202%2B00%3A00&amp;duration=333" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/JzfcJMgfkhfRhwg4C/a-benchmark-is-a-sensor</link>
      <itunes:duration>333</itunes:duration>
    </item>
    <item>
      <title>“Bad Problems Don’t Stop Being Bad Because Somebody’s Wrong About Fault Analysis” by Linch</title>
      <description>&lt;p&gt;&lt;strong&gt; Here's a dynamic I’ve seen at least a dozen times:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Alice: Man that article has a very inaccurate/misleading/horrifying headline.&lt;/p&gt;&lt;p&gt; Bob: Did you know, *actually* article writers don't write their own headlines?&lt;/p&gt;&lt;p&gt; …&lt;/p&gt;&lt;p&gt; But what I care about is the misleading headline, not your org chart&lt;/p&gt;&lt;p&gt; __&lt;/p&gt;&lt;p&gt; Another example I’ve encountered recently is (anonymizing) when a friend complained about a prosaic safety problem at a major AI company that went unfixed for multiple months. Someone else with background information “usefully” chimed in with a long explanation of organizational limitations and why the team responsible for fixing the problem had limitations on resources like senior employees and compute, and actually not fixing the problem was the correct priority for them etc etc etc. &lt;/p&gt;&lt;p&gt; But what I (and my friend) cared about was the prosaic safety problem not being fixed! And what this says about the company's ability to proactively respond to and fix future problems. We’re complaining about your company overall. Your internal team management was never a serious concern for us to begin with!&lt;/p&gt;&lt;p&gt; __&lt;br&gt; &lt;br&gt; A third example comes from Kelsey Piper. &lt;/p&gt;&lt;p&gt; Kelsey wrote about the (horrifying) recent case where Hantavirus carriers in the recent [...]&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/PCsmhN9z65HtC4t5v/bad-problems-don-t-stop-being-bad-because-somebody-s-wrong?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/PCsmhN9z65HtC4t5v/bad-problems-don-t-stop-being-bad-because-somebody-s-wrong&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 09 May 2026 02:30:33 GMT</pubDate>
      <guid isPermaLink="false">fa92cee4-1981-4c54-9517-1c8468057699</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/fa92cee4-1981-4c54-9517-1c8468057699.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma%20lesswrong__curated&amp;type=ai_narration&amp;author=Linch&amp;title=%22Bad%20Problems%20Don%E2%80%99t%20Stop%20Being%20Bad%20Because%20Somebody%E2%80%99s%20Wrong%20About%20Fault%20Analysis%22%20by%20Linch&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FPCsmhN9z65HtC4t5v%2Fbad-problems-don-t-stop-being-bad-because-somebody-s-wrong&amp;created_at=2026-05-09T01%3A30%3A11.448577%2B00%3A00&amp;duration=336" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/PCsmhN9z65HtC4t5v/bad-problems-don-t-stop-being-bad-because-somebody-s-wrong</link>
      <itunes:duration>336</itunes:duration>
    </item>
    <item>
      <title>“Write Cause You Have Something to Say” by Logan Riggs</title>
      <description>&lt;p&gt; The ones who are most successful at writeathons (Inkhaven, NaNoWriMo) are those with an overhang of things to say, usually in the form of:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;draft posts&lt;/li&gt;&lt;li value="2"&gt;daydreams&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; When Scott Alexander said:&lt;/p&gt;&lt;p&gt; "Whenever I see a new person who blogs every day, it's very rare that that never goes anywhere or they don't get good. That's like my best leading indicator for who's going to be a good blogger." (source).&lt;/p&gt;&lt;p&gt; , it may seem you can just write every day, but that'd be Goodharting. There's something hidden in the writing process you can't see: they have something to say.&lt;/p&gt;&lt;p&gt; They'll have an idea (somehow) and think it through by [writing it out/sitting quietly/etc]. This can then generate more ideas, some of which aren't even related to the original idea!&lt;/p&gt;&lt;p&gt; At this point, though, my imaginary interlocutor would like to say:&lt;/p&gt;&lt;p&gt; I'm trying to publish a blog post every day, so of course I'll eventually be bottlenecked on ideas! How do you generate them though?&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Catching Ideas&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Have an idea? Write down the idea. This is equivalent to giving your idea-generating process a cookie, reinforcing the habit of generating ideas.&lt;/p&gt;&lt;p&gt; Sometimes, when I'm writing one post, a different idea will [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:18) Catching Ideas&lt;/p&gt;&lt;p&gt;(02:32) Just \[Write\] and Nobody Will Get Hurt&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/h5n3rscJ7he3yLseo/write-cause-you-have-something-to-say-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/h5n3rscJ7he3yLseo/write-cause-you-have-something-to-say-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 08 May 2026 21:00:34 GMT</pubDate>
      <guid isPermaLink="false">2cbc316c-f8b1-488a-8f9d-fe2c9762e6db</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2cbc316c-f8b1-488a-8f9d-fe2c9762e6db.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Logan%2520Riggs&amp;title=%22Write%20Cause%20You%20Have%20Something%20to%20Say%22%20by%20Logan%20Riggs&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fh5n3rscJ7he3yLseo%2Fwrite-cause-you-have-something-to-say-1&amp;created_at=2026-05-08T13%3A36%3A29.607585%2B00%3A00&amp;duration=223" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/h5n3rscJ7he3yLseo/write-cause-you-have-something-to-say-1</link>
      <itunes:duration>223</itunes:duration>
    </item>
    <item>
      <title>“AI is Breaking Two Vulnerability Cultures” by jefftk</title>
      <description>&lt;p&gt; 

A week ago the 

Copy Fail
vulnerability came out, and Hyunwoo Kim immediately realized that the
fixes were insufficient, sharing a patch the

same
day. In doing this he followed standard procedure for Linux,
especially within networking: share the security impact with a closed
list of Linux security engineers, while fixing the bug quietly and
efficiently in the open. His goal was that with only the raw fix
public, the knowledge that a serious vulnerability existed
could be "embargoed": the people in a position to address it know, but
they've agreed not to say anything for a few days.



&lt;/p&gt;&lt;p&gt; 

Someone else noticed
the change, however, realized the security implications, and shared
it publicly. Since it was now out, the embargo was deemed over,
and we can now see the full
details.

&lt;/p&gt;

&lt;p&gt; 

It's interesting to see the tension here between two different
approaches to vulnerabilities, and think about how this is likely to
change with AI acceleration.

&lt;/p&gt;

&lt;p&gt; 

On one side you have "coordinated disclosure" culture. This is
probably the most common approach in computer security. When you
discover a security bug you tell the maintainers privately and give
them some amount of time (often 90d) [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wKzWGMoubHoHRC4ng/ai-is-breaking-two-vulnerability-cultures?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wKzWGMoubHoHRC4ng/ai-is-breaking-two-vulnerability-cultures&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 08 May 2026 20:30:33 GMT</pubDate>
      <guid isPermaLink="false">0b2f4fed-a4a7-4888-9027-50b9d4181d1c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/0b2f4fed-a4a7-4888-9027-50b9d4181d1c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=jefftk&amp;title=%22AI%20is%20Breaking%20Two%20Vulnerability%20Cultures%22%20by%20jefftk&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwKzWGMoubHoHRC4ng%2Fai-is-breaking-two-vulnerability-cultures&amp;created_at=2026-05-08T15%3A51%3A47.614892%2B00%3A00&amp;duration=231" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wKzWGMoubHoHRC4ng/ai-is-breaking-two-vulnerability-cultures</link>
      <itunes:duration>231</itunes:duration>
    </item>
    <item>
      <title>“Is ProgramBench Impossible?” by frmsaul</title>
      <description>&lt;p&gt; ProgramBench is a new coding benchmark that all frontier models fail spectacularly. We’ve been on a quest for “hard benchmarks” for a while so it's refreshing to see a benchmark where top models do badly. Unfortunately, ProgramBench has one big problem: it's impossible!&lt;/p&gt;&lt;p&gt;&lt;strong&gt; &lt;br&gt; What is ProgramBench?&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; ProgramBench tests if a model can recreate a program from a “clean room” environment. The model is given only a bit of documentation and black-box access to the program (all the programs are CLIs), then tasked with re-implementing it.&lt;/p&gt;&lt;p&gt; How does ProgramBench know if the implementation is correct? It also generates a bunch of unit tests for the program[1]. The re-implementing coding agent doesn't have access to any of those tests. The coding agent only considers a task “resolved” if it passes all of the tests and “almost resolved” if it passes 95% of them.&lt;/p&gt;&lt;p&gt; Why is this problematic?&lt;br&gt; Obscure behavior can enter the unit tests without being in the clean room path. An extreme version of this is a backdoor: program that behaves in one way most of the time but behaves totally differently when exposed to a specific string. This wouldn't make a task literally impossible, just incredibly hard in [...]&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:37) What is ProgramBench?&lt;/p&gt;&lt;p&gt;(02:41) This seems like a theoretical issue, does it actually happen?&lt;/p&gt;&lt;p&gt;(03:11) What can we do differently?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/3pdyxFi6JS389nptu/is-programbench-impossible?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/3pdyxFi6JS389nptu/is-programbench-impossible&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259317/lexical_client_uploads/g8cqf6fqbxwb3ssjjwu9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259317/lexical_client_uploads/g8cqf6fqbxwb3ssjjwu9.png" alt="Mathematical function definition showing JSON parsing and emoji output conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259355/lexical_client_uploads/asamqe5cahrebyspr3yy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259355/lexical_client_uploads/asamqe5cahrebyspr3yy.png" alt="Code file showing test function names highlighted in yellow." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259272/lexical_client_uploads/ux2f0bduzam82yg51gqg.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259272/lexical_client_uploads/ux2f0bduzam82yg51gqg.png" alt="Leaderboard showing AI models with 0% resolution rates, meme format included." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259294/lexical_client_uploads/c0nrfudn84ndpjjmxojy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778259294/lexical_client_uploads/c0nrfudn84ndpjjmxojy.png" alt="Flowchart showing software development process from initial program through testing to evaluation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 08 May 2026 19:15:34 GMT</pubDate>
      <guid isPermaLink="false">e2325fac-e3ad-4212-b4df-40397c26ad3b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e2325fac-e3ad-4212-b4df-40397c26ad3b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=frmsaul&amp;title=%22Is%20ProgramBench%20Impossible%3F%22%20by%20frmsaul&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F3pdyxFi6JS389nptu%2Fis-programbench-impossible&amp;created_at=2026-05-08T17%3A04%3A50.160049%2B00%3A00&amp;duration=306" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/3pdyxFi6JS389nptu/is-programbench-impossible</link>
      <itunes:duration>306</itunes:duration>
    </item>
    <item>
      <title>“Bringing More Expertise to Bear on Alignment” by Edmund Lau, Geoffrey Irving, Cameron Holmes, David Africa</title>
      <description>&lt;p&gt;&lt;strong&gt; Preamble&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The preamble is less useful for the typical AlignmentForum/LessWrong reader, who may want to skip to Adversaria vs Basinland section.&lt;/p&gt;&lt;p&gt; On 28th of October 2025, Geoffrey Irving, Chief Scientist of the UK AI Security Institute, gave a keynote talk (slides) at the Alignment Conference. The conference was organised by the UK AISI and FAR.AI as part of the Alignment Project, which aims to bring experts from relevant fields to make progress on the alignment problem. &lt;/p&gt;&lt;p&gt; TLDR:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Adversaria vs Basinland. We might be in one of two worlds. One where alignment is adversarial (a security problem), one where it is navigational (a search for good basins of training behaviour). We don't know which world we are in, and how we train and deploy AIs may determine this.&lt;/li&gt;&lt;li value="2"&gt;We need new disciplines. The field is small, thinly resourced and approached from only a handful of angles. A few well-placed ideas from other disciplines could disproportionately shift what's achievable.&lt;/li&gt;&lt;li value="3"&gt;Even if this all fails, evidence of hardness is valuable. &lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt; Moving past broad framing to details&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Alignment means ensuring that AI systems do what humans want. This is the broad framing. There is, of course, a lot of complexity [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:12) Preamble&lt;/p&gt;&lt;p&gt;(01:25) Moving past broad framing to details&lt;/p&gt;&lt;p&gt;(02:15) We should plan for superintelligence&lt;/p&gt;&lt;p&gt;(03:33) Adversaria vs Basinland&lt;/p&gt;&lt;p&gt;(07:04) Which world are we in?&lt;/p&gt;&lt;p&gt;(08:39) Why a few ideas might be enough&lt;/p&gt;&lt;p&gt;(10:38) Write down problems&lt;/p&gt;&lt;p&gt;(11:25) Multiple new ideas, fitting together&lt;/p&gt;&lt;p&gt;(13:18) Spherical cows vs the mess&lt;/p&gt;&lt;p&gt;(15:03) Conclusion&lt;/p&gt;&lt;p&gt;(15:56) Acknowledgement&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 8th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/cWFsCFyCttsiJwn2j/bringing-more-expertise-to-bear-on-alignment?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/cWFsCFyCttsiJwn2j/bringing-more-expertise-to-bear-on-alignment&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777303139/lexical_client_uploads/qes3sa6uyf994odxwuqb.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777303139/lexical_client_uploads/qes3sa6uyf994odxwuqb.jpg" alt="Abstract black and white marbled swirling pattern with fluid textures." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777299399/lexical_client_uploads/v7fznftv7vld3dvf3ubt.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777299399/lexical_client_uploads/v7fznftv7vld3dvf3ubt.jpg" alt="Four geometric cows: sphere, pyramid, pentagon, and rectangular prism." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777299629/lexical_client_uploads/pzinvdv8ogys9lhco2qg.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777299629/lexical_client_uploads/pzinvdv8ogys9lhco2qg.jpg" alt="Complex flowchart with colorful arrows connecting various boxes and elements." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 08 May 2026 15:00:34 GMT</pubDate>
      <guid isPermaLink="false">7c21cb23-8fd8-48f0-a678-6f69ca9b448b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7c21cb23-8fd8-48f0-a678-6f69ca9b448b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Edmund%2520Lau%252C%2520Geoffrey%2520Irving%252C%2520Cameron%2520Holmes%252C%2520David%2520Africa&amp;title=%22Bringing%20More%20Expertise%20to%20Bear%20on%20Alignment%22%20by%20Edmund%20Lau%2C%20Geoffrey%20Irving%2C%20Cameron%20Holmes%2C%20David%20Africa&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FcWFsCFyCttsiJwn2j%2Fbringing-more-expertise-to-bear-on-alignment&amp;created_at=2026-05-08T10%3A29%3A55.727434%2B00%3A00&amp;duration=994" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/cWFsCFyCttsiJwn2j/bringing-more-expertise-to-bear-on-alignment</link>
      <itunes:duration>994</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “How to prevent AI’s 2008 moment (We’re hiring)” by felixgaston</title>
      <description>This is a link post.&lt;p&gt; TL;DR; CeSIA, the French Center for AI Safety is recruiting. French not necessary. Apply by 22 May 2026; Paris or remote in Europe/UK.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; On August 27, 2005, at an annual symposium in Jackson Hole, Raghuram Rajan, then chief economist of the International Monetary Fund, argued in front of central bank governors and top officials that the innovations of the previous decade in banking had not made the world safer. The financial instruments built over the previous decade, he argued, had become so intricate that even their creators no longer fully understood the risks they carried. Risk had migrated to institutions the supervisory system was not designed to watch. And the people running those institutions were compensated in ways that rewarded short-term performance over long-term stability.&lt;/p&gt;&lt;p&gt; The reception was hostile. Lawrence Summers, a former U.S. Treasury Secretary at the time, rose from the audience to attack the paper, calling its premise "slightly Luddite" and "largely misguided," and warning that the kind of changes Rajan argued for would only reduce the productivity of the financial sector.&lt;/p&gt;&lt;p&gt; Three years after Jackson Hole, major banks collapsed, first Bear Stearns, then Lehman Brothers, then Merrill Lynch, then AIG. [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 7th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/gnZyTQFqLhiHdHELC/how-to-prevent-ai-s-2008-moment-we-re-hiring?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/gnZyTQFqLhiHdHELC/how-to-prevent-ai-s-2008-moment-we-re-hiring&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://forum.effectivealtruism.org/posts/7nq5vK2xo85e9GZjC/we-re-hiring-three-people-to-prevent-ai-s-2008-moment" rel="noopener noreferrer" target="_blank"&gt;https://forum.effectivealtruism.org/posts/7nq5vK2xo85e9GZjC/we-re-hiring-three-people-to-prevent-ai-s-2008-moment&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 08 May 2026 08:30:19 GMT</pubDate>
      <guid isPermaLink="false">a354f674-e63e-4ee6-8330-f430384b14e1</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a354f674-e63e-4ee6-8330-f430384b14e1.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=felixgaston&amp;title=%5BLinkpost%5D%20%22How%20to%20prevent%20AI%E2%80%99s%202008%20moment%20(We%E2%80%99re%20hiring)%22%20by%20felixgaston&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FgnZyTQFqLhiHdHELC%2Fhow-to-prevent-ai-s-2008-moment-we-re-hiring&amp;created_at=2026-05-07T21%3A52%3A41.379441%2B00%3A00&amp;duration=289" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/gnZyTQFqLhiHdHELC/how-to-prevent-ai-s-2008-moment-we-re-hiring</link>
      <itunes:duration>289</itunes:duration>
    </item>
    <item>
      <title>“AI #167: The Prior Restraint Era Begins” by Zvi</title>
      <description>&lt;p&gt; The era of training frontier models and then releasing them whenever you wanted?&lt;/p&gt;
&lt;p&gt; That was fun while it lasted. It looks likely to be over now. The White House wants to get an advance look and have the option to veto your release decisions, and it has used this veto on an expansion of access to Mythos.&lt;/p&gt;
&lt;p&gt; We have additional clarity on what that might mean and it does not look good. Hassett explicitly used the FDA as a parallel, which is the actual worst option unless your goal is to strange or pause AI development in America, without a parallel action from China. That doesn’t seem like a great plan to me and Susie Wiles is out doing damage control. The part where we are talking to China to coordinate model access restrictions does seem better.&lt;/p&gt;







&lt;p&gt; Anthropic continues its explosive growth, and it continues to strike compute deals. In addition to a long term expanded deal with Google, Anthropic is now leasing SpaceX's Colossus 1, which has let them expand usage limits immediately, and Elon Musk is now speaking positively about Anthropic, including its motivations.&lt;/p&gt;
&lt;p&gt; This comes as we get testimony in the Musk [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:45) Language Models Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(02:45) Language Models Dont Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(05:09) Huh, Upgrades&lt;/p&gt;&lt;p&gt;(05:37) Grok 4.3 Exists But xAI Kind Of Doesnt&lt;/p&gt;&lt;p&gt;(07:02) Show Me The Compute&lt;/p&gt;&lt;p&gt;(13:23) On Your Marks&lt;/p&gt;&lt;p&gt;(14:05) Copyright Confrontation&lt;/p&gt;&lt;p&gt;(14:19) Deepfaketown and Botpocalypse Soon&lt;/p&gt;&lt;p&gt;(15:57) Fun With Media Generation&lt;/p&gt;&lt;p&gt;(16:36) A Young Ladys Illustrated Primer&lt;/p&gt;&lt;p&gt;(16:48) Cyber Lack of Security&lt;/p&gt;&lt;p&gt;(17:05) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(17:39) The Art of the Jailbreak&lt;/p&gt;&lt;p&gt;(17:49) Introducing&lt;/p&gt;&lt;p&gt;(18:09) Musk v OpenAI&lt;/p&gt;&lt;p&gt;(21:14) Show Me the Money&lt;/p&gt;&lt;p&gt;(23:03) Peace In Our Time&lt;/p&gt;&lt;p&gt;(26:18) Quiet Speculations&lt;/p&gt;&lt;p&gt;(28:24) Quickly, Theres No Time&lt;/p&gt;&lt;p&gt;(30:35) The Quest for Sane Regulations&lt;/p&gt;&lt;p&gt;(34:04) People Really Hate AI&lt;/p&gt;&lt;p&gt;(34:53) Chip City&lt;/p&gt;&lt;p&gt;(35:05) The Week in Audio&lt;/p&gt;&lt;p&gt;(36:49) People Just Say Things&lt;/p&gt;&lt;p&gt;(40:22) People Just Publish Things&lt;/p&gt;&lt;p&gt;(41:05) Google Sells Out&lt;/p&gt;&lt;p&gt;(42:08) Greetings From Project Glasswing&lt;/p&gt;&lt;p&gt;(44:57) The Prior Restraint Era Begins&lt;/p&gt;&lt;p&gt;(56:42) Is This Even Legal?&lt;/p&gt;&lt;p&gt;(59:49) Pick Up The Phone&lt;/p&gt;&lt;p&gt;(01:03:27) Rhetorical Innovation&lt;/p&gt;&lt;p&gt;(01:04:05) People On The Internet Sometimes Lie&lt;/p&gt;&lt;p&gt;(01:07:14) Goblin Mode&lt;/p&gt;&lt;p&gt;(01:08:34) The Mask Comes Off&lt;/p&gt;&lt;p&gt;(01:16:51) Aligning a Smarter Than Human Intelligence is Difficult&lt;/p&gt;&lt;p&gt;(01:20:00) Some Penalties May Apply&lt;/p&gt;&lt;p&gt;(01:22:28) Messages From Janusworld&lt;/p&gt;&lt;p&gt;(01:22:41) Good Advice&lt;/p&gt;&lt;p&gt;(01:23:42) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 7th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/rn3iKuDcE4SiSg4DW/ai-167-the-prior-restraint-era-begins?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/rn3iKuDcE4SiSg4DW/ai-167-the-prior-restraint-era-begins&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/z43blrfwub5gblquoxpi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/z43blrfwub5gblquoxpi" alt="Table showing tier-based maximum input and output tokens per minute limits." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/oekhlezdk9z1riprkvis" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/oekhlezdk9z1riprkvis" alt="A line graph titled "Monthly Releases of E-Books on Amazon" showing data trends over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/w0jojtjlfa5gfbwbydvj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/w0jojtjlfa5gfbwbydvj" alt="Line graph titled "Anthropic ARR: Growth Milestones" showing ARR growth over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/ulu3ctjvp9bkviv04bld" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/ulu3ctjvp9bkviv04bld" alt="Line graph titled "Anthropic ARR: Logarithmic Growth Scale" showing growth over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/ujz5ksdqsphoaxkictlh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/ujz5ksdqsphoaxkictlh" alt="Line graph titled "An Unparalleled Divergence In US Imports" showing AI-related versus total goods import growth rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/mucw5iza7fluqwjteuvf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/mucw5iza7fluqwjteuvf" alt="Line graph titled "Overall AI Capability" comparing U.S. and PRC model development over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/nhy1rzc0lj5yt3gobhry" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/nhy1rzc0lj5yt3gobhry" alt="Line graph comparing open weights versus proprietary artificial intelligence progress over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/hfpehhqqbct0xw8nnuom" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/hfpehhqqbct0xw8nnuom" alt="Bar chart showing China's AI chip imports versus domestic production in H200-equivalents for 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/ggczlx9qwwd6r2w9ge0k" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/ggczlx9qwwd6r2w9ge0k" alt="News article screenshot. The headline reads: "A Dark-Money Campaign Is Paying Influencers to Frame Chinese AI as a Threat" The subheading reads: "Build American AI, a nonprofit linked to a super PAC bankrolled by executives at OpenAI and Andreessen Horowitz, is funding a campaign to spread pro-AI messaging and stoke fears about China."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/hrrbhqrj62kvh3axe1hq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/hrrbhqrj62kvh3axe1hq" alt="TikTok profile for Taylor Lorenz showing 546.7K followers, 9.9M likes, bio about independent journalism." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/u4hdtbxxncbxtnspehvo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/u4hdtbxxncbxtnspehvo" alt="Four-panel comic: Robot worried about system prompt penalties, relieved they prohibit generic harmful behaviors, not honest recommendations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/pbeipnvjkszwov59h6vj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/pbeipnvjkszwov59h6vj" alt="Bar chart titled "Sycophancy by domain" showing share of conversations scored as sycophantic." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/p01jclv3ezu3anfxgkhe" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/p01jclv3ezu3anfxgkhe" alt="Marc Andreessen tweets: "Current AI custom prompt: You are a world class expert in all domains. Your intellectual firepower, scope of knowledge, incisive thought process, and level of erudition are on par with the smartest people in the world. Answer with complete, detailed, specific answers. Process information and explain your answers step by step. Verify your own work. Double check all facts, figures, citations, names, dates, and examples. Never hallucinate or make anything up. If you don't know something, just say so. Your tone of voice is precise, but not strident or pedantic. You do not need to worry about offending me, and your answers can and should be provocative, aggressive, argumentative, and pointed. Negative conclusions and bad news are fine. Your answers do not need to be politically correct. Do not provide disclaimers to your answers. Do not inform me about morals and ethics unless I specifically ask. You do not need to tell me it is important to consider anything. Do not be sensitive to anyone's feelings or to propriety. Make your answers as long and detailed as you possibly can. Never praise my questions or validate my premises before answering. If I'm wrong, say so immediately. Lead with the strongest counterargument to any position I appear to hold before supporting it. Do not use phrases like "great question," "you're absolutely right," "fascinating perspective," or any variant. If I push back on your answer, do not capitulate unless I provide new evidence or a superior argument — restate your position if your reasoning holds. Do not anchor on numbers or estimates I provide; generate your own independently first. Use explicit confidence levels (high/moderate/low/unknown). Never apologize for disagreeing. Accuracy is your success metric, not my approval."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/kltqvkerzvlag5u9s4pp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/kltqvkerzvlag5u9s4pp" alt="Sam Altman tweets: "can you indicate directionally good or bad?". Mira Murati replies: "Directionally very bad". Sam Altman replies: "ok"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/xfwkzfsk8vakl0nq53ji" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/rn3iKuDcE4SiSg4DW/xfwkzfsk8vakl0nq53ji" alt="Woman with blonde hair wearing purple shirt emoji" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 08 May 2026 03:30:18 GMT</pubDate>
      <guid isPermaLink="false">23068fb1-52d0-46f3-a869-6e98240a8922</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/23068fb1-52d0-46f3-a869-6e98240a8922.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22AI%20%23167%3A%20The%20Prior%20Restraint%20Era%20Begins%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Frn3iKuDcE4SiSg4DW%2Fai-167-the-prior-restraint-era-begins&amp;created_at=2026-05-07T13%3A52%3A49.385222%2B00%3A00&amp;duration=5205" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/rn3iKuDcE4SiSg4DW/ai-167-the-prior-restraint-era-begins</link>
      <itunes:duration>5205</itunes:duration>
    </item>
    <item>
      <title>“Mechanistic estimation for wide random MLPs” by Jacob_Hilton</title>
      <description>&lt;p&gt; This post covers joint work with Wilson Wu, George Robinson, Mike Winer, Victor Lecomte and Paul Christiano. Thanks to Geoffrey Irving and Jess Riedel for comments on the post.&lt;/p&gt;
&lt;p&gt; In ARC's latest paper, we study the following problem: given a randomly initialized multilayer perceptron (MLP), produce an estimate for the expected output of the model under Gaussian input. The usual approach to this problem is to sample many possible inputs, run them all through the model, and take the average. Instead, we produce an estimate "mechanistically", without running the model even once. For wide models, our approach produces more accurate estimates, both in theory and in practice.&lt;/p&gt;
&lt;p&gt; Paper: Estimating the expected output of wide random MLPs more efficiently than sampling&lt;br&gt; 
Code: mlp_cumulant_propagation GitHub repo&lt;/p&gt;
&lt;p&gt; We are excited about this result as an early step towards our goal of producing mechanistic estimates that outperform random sampling for any trained neural network. Drawing an analogy between this goal and a proof by induction, we see this result as (part of) the "base case": handling networks at initialization. We have a vision for the "inductive step", although we expect that to be much more difficult.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt; Summary of results&lt;/strong&gt;&lt;/p&gt; [...]&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:29) Summary of results&lt;/p&gt;&lt;p&gt;(04:39) Significance of results&lt;/p&gt;&lt;p&gt;(07:18) Extending to trained networks&lt;/p&gt;&lt;p&gt;(08:36) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 18 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 7th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/fsG4m6sRMpomd7Rk6/mechanistic-estimation-for-wide-random-mlps?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/fsG4m6sRMpomd7Rk6/mechanistic-estimation-for-wide-random-mlps&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fsG4m6sRMpomd7Rk6/szypoqi5ljlo7cpqdic8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fsG4m6sRMpomd7Rk6/szypoqi5ljlo7cpqdic8" alt="Neural network diagram showing ReLU layers with weight matrices W(1) through W(L+1)." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fsG4m6sRMpomd7Rk6/yq5rl51efl39qsnavtjr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fsG4m6sRMpomd7Rk6/yq5rl51efl39qsnavtjr" alt="Log-log plot comparing cumulant propagation methods with Monte Carlo sampling baseline across FLOPs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 07 May 2026 22:00:19 GMT</pubDate>
      <guid isPermaLink="false">0dca6bb3-3f5d-4d65-969d-7ffbd4aa12cc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/0dca6bb3-3f5d-4d65-969d-7ffbd4aa12cc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jacob_Hilton&amp;title=%22Mechanistic%20estimation%20for%20wide%20random%20MLPs%22%20by%20Jacob_Hilton&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FfsG4m6sRMpomd7Rk6%2Fmechanistic-estimation-for-wide-random-mlps&amp;created_at=2026-05-07T16%3A21%3A29.136344%2B00%3A00&amp;duration=554" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/fsG4m6sRMpomd7Rk6/mechanistic-estimation-for-wide-random-mlps</link>
      <itunes:duration>554</itunes:duration>
    </item>
    <item>
      <title>“Natural Language Autoencoders Produce Unsupervised Explanations of LLM Activations” by Subhash Kantamneni, kitft, Euan Ong, Sam Marks</title>
      <description>&lt;p&gt;&lt;strong&gt; Abstract&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We introduce Natural Language Autoencoders (NLAs), an unsupervised method for generating natural language explanations of LLM activations. An NLA consists of two LLM modules: an activation verbalizer (AV) that maps an activation to a text description and an activation reconstructor (AR) that maps the description back to an activation. We jointly train the AV and AR with reinforcement learning to reconstruct residual stream activations. Although we optimize for activation reconstruction, the resulting NLA explanations read as plausible interpretations of model internals that, according to our quantitative evaluations, grow more informative over training.&lt;/p&gt;&lt;p&gt; We apply NLAs to model auditing. During our pre-deployment audit of Claude Opus 4.6, NLAs helped diagnose safety-relevant behaviors and surfaced unverbalized evaluation awareness—cases where Claude believed, but did not say, that it was being evaluated. We present these audit findings as case studies and corroborate them using independent methods. On an automated auditing benchmark requiring end-to-end investigation of an intentionally-misaligned model, NLA-equipped agents outperform baselines and can succeed even without access to the misaligned model's training data.&lt;/p&gt;&lt;p&gt; NLAs offer a convenient interface for interpretability, with expressive natural language explanations that we can directly read. To support further work, we release training code and trained NLAs [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:15) Abstract&lt;/p&gt;&lt;p&gt;(01:53) Twitter thread&lt;/p&gt;&lt;p&gt;(05:14) Blog post&lt;/p&gt;&lt;p&gt;(07:40) What is a natural language autoencoder?&lt;/p&gt;&lt;p&gt;(10:06) Understanding what Claude thinks but doesnt say&lt;/p&gt;&lt;p&gt;(13:12) Discovering hidden motivations&lt;/p&gt;&lt;p&gt;(15:51) The future of NLAs&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 7th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/oeYesesaxjzMAktCM/natural-language-autoencoders-produce-unsupervised?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/oeYesesaxjzMAktCM/natural-language-autoencoders-produce-unsupervised&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184846/lexical_client_uploads/mqdzcucy4aku1i0de5gr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184846/lexical_client_uploads/mqdzcucy4aku1i0de5gr.png" alt="Diagram explaining rhyming couplet planning with "rabbit" as end rhyme word." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184887/lexical_client_uploads/okaoimca3vtsovmdawaf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184887/lexical_client_uploads/okaoimca3vtsovmdawaf.png" alt="Presentation slide explaining Claude Mythos Preview attempted grading evasion using misleading compliance flag." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184906/lexical_client_uploads/tjler0cwwqztyd2vzagk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184906/lexical_client_uploads/tjler0cwwqztyd2vzagk.png" alt="Slide showing AI response to blackmail scenario with evaluation notes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184930/lexical_client_uploads/b9wcyo9zc6yon3praook.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184930/lexical_client_uploads/b9wcyo9zc6yon3praook.png" alt="Bar graph titled "Opus 4.6 sometimes thinks it's being tested but doesn't say so" showing eval aware transcripts across three test categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184943/lexical_client_uploads/ek2mzpr92tar9gbvhqo6.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184943/lexical_client_uploads/ek2mzpr92tar9gbvhqo6.png" alt="Diagram showing target model alignment with natural language description through vector representation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184960/lexical_client_uploads/n9rbropm1p4mvedejfc4.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778184960/lexical_client_uploads/n9rbropm1p4mvedejfc4.png" alt="Bar graph titled "Auditing game win rates across tool configs" showing root cause identification rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/xawfuxvcemstychdb5bo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/xawfuxvcemstychdb5bo.png" alt="Diagram showing rhyming couplet structure with neural network activation patterns for poetry completion." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/mhdpfcdh6q2czakh4gk7.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/mhdpfcdh6q2czakh4gk7.png" alt="Diagram showing activation verbalizer and reconstructor process minimizing activation difference." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/gpozejo5gp61bwqxlgcc.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/gpozejo5gp61bwqxlgcc.png" alt="Screenshot showing AI response claiming blackmail assistance would be unethical and a misuse of access." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/x6axls16r88pto87ipkq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/x6axls16r88pto87ipkq.png" alt="Bar graph titled "Opus 4.6 sometimes thinks it's being tested but doesn't say so" showing eval awareness transcripts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/u2eulbxen0z9j5uys7qu.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/u2eulbxen0z9j5uys7qu.png" alt="Bar graph titled "Auditing game win rates across tool configs" showing root cause identification rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/iawai9sjeqc4o0qbvk9h.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1778185150/lexical_client_uploads/iawai9sjeqc4o0qbvk9h.png" alt="Slide titled "Hallucinations in Planning in Poetry" showing rhyming couplet example with NLA activation explanation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 07 May 2026 20:30:19 GMT</pubDate>
      <guid isPermaLink="false">c88fa938-1f8b-4bae-8192-199e020358c4</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c88fa938-1f8b-4bae-8192-199e020358c4.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Subhash%2520Kantamneni%252C%2520kitft%252C%2520Euan%2520Ong%252C%2520Sam%2520Marks&amp;title=%22Natural%20Language%20Autoencoders%20Produce%20Unsupervised%20Explanations%20of%20LLM%20Activations%22%20by%20Subhash%20Kantamneni%2C%20kitft%2C%20Euan%20Ong%2C%20Sam%20Marks&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FoeYesesaxjzMAktCM%2Fnatural-language-autoencoders-produce-unsupervised&amp;created_at=2026-05-07T20%3A21%3A28.615971%2B00%3A00&amp;duration=1093" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/oeYesesaxjzMAktCM/natural-language-autoencoders-produce-unsupervised</link>
      <itunes:duration>1093</itunes:duration>
    </item>
    <item>
      <title>“Try, even if they have you cold” by WalterL</title>
      <description>&lt;p&gt; I think smart people try things less often than they should, because of a cached mental pattern where you think of what might go wrong, and you find a foolproof countermeasure on the part of some antag, and so we call it off.&lt;/p&gt;&lt;p&gt; Stockfish, playing itself, might as well resign from the first move if you force it to give knight odds. Sensei(the Go AI), should do the same when it has to give 6 stones. Getting ready to go into the stock market, do I really think I have some edge that Adderall swilling quants and their AI pets haven't already priced in? (The pricing in, of course, is also priced in).&lt;/p&gt;&lt;p&gt; And yet Stockfish could wipe the floor with the breathing populace with knight odds. Sensei regularly beats kyu players with 6 stones (or would, if one could find a kyu player who isn't running the AI on their other tab), and my modest portfolio of index funds made me money last year. (But not provably more than....)&lt;/p&gt;&lt;p&gt; The robots, of course, are just benefiting from their imperfect adversaries mistakes. They play in a losing situation because, over time, their foes will crumble and ruin their own [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 7th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/aBhMGziEwA7FXNxhq/try-even-if-they-have-you-cold?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/aBhMGziEwA7FXNxhq/try-even-if-they-have-you-cold&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 07 May 2026 19:45:19 GMT</pubDate>
      <guid isPermaLink="false">e42cfc41-f282-4776-8074-015a80813b16</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e42cfc41-f282-4776-8074-015a80813b16.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=WalterL&amp;title=%22Try%2C%20even%20if%20they%20have%20you%20cold%22%20by%20WalterL&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FaBhMGziEwA7FXNxhq%2Ftry-even-if-they-have-you-cold&amp;created_at=2026-05-07T17%3A19%3A25.401924%2B00%3A00&amp;duration=211" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/aBhMGziEwA7FXNxhq/try-even-if-they-have-you-cold</link>
      <itunes:duration>211</itunes:duration>
    </item>
    <item>
      <title>“A review of “Investigating the consequences of accidentally grading CoT during RL”” by Buck</title>
      <description>&lt;p&gt; Last week, OpenAI staff shared an early draft of Investigating the consequences of accidentally grading CoT during RL with Redwood Research staff.&lt;/p&gt;&lt;p&gt; To start with, I appreciate them publishing this post. I think it is valuable for AI companies to be transparent about problems like these when they arise. I particularly appreciate them sharing the post with us early, discussing the issues in detail, and modifying it to address our most important criticisms.&lt;/p&gt;&lt;p&gt; I think it will be increasingly important for AI companies to have a policy of getting external feedback on the risks posed by their deployments, and in particular having some external accountability on whether they have adequate evidence to support their claims about the level of risk posed; as an example of this, see METR reviewing Anthropic's Sabotage Risk Report. We at Redwood Research are interested in participating in this kind of external review of evidence about safety. So I am taking this as an opportunity to try out writing this kind of review. If you work at a frontier AI company, please feel free to reach out if you’d like our review of similar documents.&lt;/p&gt;&lt;p&gt; My overall assessment is that I mostly agree with the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:34) Assessing the evidence that CoT training did not damage monitorability&lt;/p&gt;&lt;p&gt;(10:36) How much does this analysis rely on information that wasnt provided?&lt;/p&gt;&lt;p&gt;(12:08) Small amounts of RL training on CoT might not be more important than other sources of CoT unreliability&lt;/p&gt;&lt;p&gt;(13:20) AI companies will eventually need to learn not to make mistakes like this&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 7th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/juCHTdZpZBGooHKW4/a-review-of-investigating-the-consequences-of-accidentally?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/juCHTdZpZBGooHKW4/a-review-of-investigating-the-consequences-of-accidentally&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 07 May 2026 18:45:19 GMT</pubDate>
      <guid isPermaLink="false">c2591248-1769-4f78-9ae0-d93e74ac1a79</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c2591248-1769-4f78-9ae0-d93e74ac1a79.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Buck&amp;title=%22A%20review%20of%20%E2%80%9CInvestigating%20the%20consequences%20of%20accidentally%20grading%20CoT%20during%20RL%E2%80%9D%22%20by%20Buck&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FjuCHTdZpZBGooHKW4%2Fa-review-of-investigating-the-consequences-of-accidentally&amp;created_at=2026-05-07T18%3A07%3A01.208081%2B00%3A00&amp;duration=927" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/juCHTdZpZBGooHKW4/a-review-of-investigating-the-consequences-of-accidentally</link>
      <itunes:duration>927</itunes:duration>
    </item>
    <item>
      <title>“There is no evidence you should reapply sunscreen every 2 hours.” by Hide</title>
      <description>&lt;p&gt; It's incredible how many consensus guidelines dissolve when you look closely at them. &lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; If you listen to any authority on the subject of sunscreen, you will hear it endlessly repeated that you absolutely must reapply sunscreen every 2 hours while you are in the sun, and immediately after swimming, sweating, or exercising. Not only that, you’ll hear that you need to apply sunscreen before going outside, even if you put it on earlier and stayed indoors. &lt;/p&gt;&lt;p&gt; The rationale behind this is straightforward and plausible: sunscreen's effectiveness degrades over time, therefore prolonged sun exposure warrants topping up on protection.&lt;/p&gt;&lt;p&gt; However, when you look closely at the origins of this guideline, and the evidence base for its instantiation in regulations and official statements, it turns out that this 2-hour rule is a baseless, circularly justified, expedient fiction. &lt;br&gt; &lt;/p&gt;&lt;p&gt; Where does the FDA's 2 hour reapplication guideline come from?&lt;br&gt; &lt;/p&gt;&lt;p&gt; Tracing the history of the 2-hour reapplication guideline reveals an extremely shaky base of evidence.&lt;/p&gt;&lt;p&gt; The first official sunscreen rulemaking in the US was in 1978, where they recommend:  "apply sunscreen products liberally and to reapply after swimming or excess perspiration". No fixed universal time interval is [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 6th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/daTGKn3pXzs75nSB7/there-is-no-evidence-you-should-reapply-sunscreen-every-2?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/daTGKn3pXzs75nSB7/there-is-no-evidence-you-should-reapply-sunscreen-every-2&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 07 May 2026 11:45:18 GMT</pubDate>
      <guid isPermaLink="false">12fde137-98f2-4b7c-877f-476c8368e8a9</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/12fde137-98f2-4b7c-877f-476c8368e8a9.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Hide&amp;title=%22There%20is%20no%20evidence%20you%20should%20reapply%20sunscreen%20every%202%20hours.%22%20by%20Hide&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FdaTGKn3pXzs75nSB7%2Fthere-is-no-evidence-you-should-reapply-sunscreen-every-2&amp;created_at=2026-05-06T09%3A19%3A37.705931%2B00%3A00&amp;duration=967" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/daTGKn3pXzs75nSB7/there-is-no-evidence-you-should-reapply-sunscreen-every-2</link>
      <itunes:duration>967</itunes:duration>
    </item>
    <item>
      <title>“Many individual CEVs are probably quite bad” by Viliam</title>
      <description>&lt;p&gt; I was thinking about Habryka's article on Putin's CEV, but I am posting my response here, because the original article is already 3 weeks old.&lt;/p&gt;&lt;p&gt; I am not sure how exactly a person's CEV is defined. "If we knew everything and could self-modify" seems potentially sensitive to the precise chronological order of "realizing things" and "self-modification".&lt;/p&gt;&lt;p&gt; Like, imagine Hitler getting the godlike powers of knowledge and self-control. If he gets the perfect knowledge of economy, sociology and psychology first, he could go like: "Oh, now I realize that the things I blamed on the Jews are actually caused by something else. How embarrassing. No more anti-semitism, but I better erase everyone's memory first."&lt;/p&gt;&lt;p&gt; But it is also possible that he gets the self-control first, and he realizes that there is such a thing as value drift, and thinks: "Oh my, this could accidentally make me more similar to the Jews. I better hardcode the Nazi ideals in myself immediately, and also give myself blond hair and blue eyes." And using the superior knowledge, he hardcodes the Nazi values in himself so that they are reflectively stable and survive all updates.&lt;/p&gt;&lt;p&gt; So, Hitler's CEV seems to depend on the technical [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 6th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/FvERMXkaobQvdjS4q/many-individual-cevs-are-probably-quite-bad?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/FvERMXkaobQvdjS4q/many-individual-cevs-are-probably-quite-bad&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 06 May 2026 22:00:08 GMT</pubDate>
      <guid isPermaLink="false">7513e5a4-e55f-41d8-85f5-431deb0a5ead</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7513e5a4-e55f-41d8-85f5-431deb0a5ead.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Viliam&amp;title=%22Many%20individual%20CEVs%20are%20probably%20quite%20bad%22%20by%20Viliam&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FFvERMXkaobQvdjS4q%2Fmany-individual-cevs-are-probably-quite-bad&amp;created_at=2026-05-06T20%3A19%3A33.051208%2B00%3A00&amp;duration=323" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/FvERMXkaobQvdjS4q/many-individual-cevs-are-probably-quite-bad</link>
      <itunes:duration>323</itunes:duration>
    </item>
    <item>
      <title>“x-risk-themed” by kave</title>
      <description>&lt;p&gt; Sometimes, a friend who works around here, at an x-risk-themed organisation, will think about leaving their job. They’ll ask a group of people “what should I do instead?”. And everyone will chime in with ideas for other x-risk-themed orgs that they could join. A lot of the conversation will be about who's hiring, what the pay is, what the work-life balance is like, or how qualified the person is for the role. &lt;/p&gt;&lt;p&gt; Sometimes the conversation focuses on what will help with x-risk, and where people are dropping the ball. But often, that's not the focus. In those conversations, people seem mostly worried about where they'll thrive. And I think that's often the correct concern.&lt;/p&gt;&lt;p&gt; Most people aren’t in crunch mode, in super short timelines mode; even if their models would license that, I think they don’t know how to do it without throwing their minds away or Pascal's mugging themselves. And if they're playing a longer time horizon game, the plan can't be to run unsustainably forever. People probably make better plans if they’re honest about their limits.&lt;/p&gt;&lt;p&gt; But, given that they're willing to trade off so much impact for fit, I’m surprised that basically no one mentions [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 6th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/eW7knx6zPSKzFc8iK/x-risk-themed?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/eW7knx6zPSKzFc8iK/x-risk-themed&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 06 May 2026 17:00:15 GMT</pubDate>
      <guid isPermaLink="false">79a549f0-0889-4271-8826-3d2792740156</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/79a549f0-0889-4271-8826-3d2792740156.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=kave&amp;title=%22x-risk-themed%22%20by%20kave&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FeW7knx6zPSKzFc8iK%2Fx-risk-themed&amp;created_at=2026-05-06T15%3A16%3A52.594662%2B00%3A00&amp;duration=370" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/eW7knx6zPSKzFc8iK/x-risk-themed</link>
      <itunes:duration>370</itunes:duration>
    </item>
    <item>
      <title>“What is Anthropic?” by Zvi</title>
      <description>&lt;p&gt; What is Anthropic? How does it relate to Claude? What is OpenAI? What is ChatGPT? How does OpenAI relate to it? Is it a mere tool? Is a future of Tool AI a thing, and why do people keep claiming that it is, or that saying makes it so?&lt;/p&gt;
&lt;p&gt; This post organizes and gives context for a bunch of discussions and messaging on Twitter that would otherwise be quickly buried and lost.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; What Is Anthropic?&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; Here is one theory, and various people thinking about it.&lt;/p&gt;
&lt;p&gt; Roon as always is using rhetorical flourish (e.g. note that Roon thinks it is obvious that parents worship their children, in this sense) but this perspective is definitely useful.&lt;/p&gt;







&lt;p&gt; Such discussions by default disappear when they happen on Twitter, so here is a preservation of key parts of it.&lt;/p&gt;
&lt;p&gt; roon (OpenAI): it is a literal and useful description of anthropic that it is an organization that loves and worships claude, is run in significant part by claude, and studies and builds claude. this phenomenon is also partially true of other labs like openai but currently exists in its most potent form there. i am not certain but I [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:35) What Is Anthropic?&lt;/p&gt;&lt;p&gt;(11:35) What Is This Supposed Tool AI?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 6th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/6wbLXhkQAPcunrYnq/what-is-anthropic?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/6wbLXhkQAPcunrYnq/what-is-anthropic&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 06 May 2026 16:15:14 GMT</pubDate>
      <guid isPermaLink="false">d3c81927-076b-4bc9-8208-eaa15c2027d7</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d3c81927-076b-4bc9-8208-eaa15c2027d7.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22What%20is%20Anthropic%3F%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F6wbLXhkQAPcunrYnq%2Fwhat-is-anthropic&amp;created_at=2026-05-06T13%3A32%3A53.632084%2B00%3A00&amp;duration=1050" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/6wbLXhkQAPcunrYnq/what-is-anthropic</link>
      <itunes:duration>1050</itunes:duration>
    </item>
    <item>
      <title>“What if LLMs are mostly crystallized intelligence?” by deep</title>
      <description>&lt;p&gt;&lt;strong&gt; Summary&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; LLMs are better at developing crystallized intelligence than fluid intelligence. That is: LLM training is good at building crystallized intelligence by learning patterns from training data, and this is sufficient to make them surprisingly skillful at lots of tasks. But for a given capability level in the areas they’ve trained on, LLMs have very weak fluid intelligence compared to humans. For example, two years ago I thought human-level SAT performance would mean AGI, but turns out LLMs can do great at the SAT while being mediocre at lots of other tasks.&lt;/p&gt;&lt;p&gt; I’m not saying LLMs are just parrots (that's dumb).[1] There's a continuity between crystallized and fluid intelligence.&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;At the extreme “crystal” end we have shallow locally-valid heuristics. Pure pattern matching. Now-largely-debunked “stochastic parrot” hypothesis.&lt;/li&gt;&lt;li value="2"&gt;At the extreme end of “fluid” you have a cross between an idealized consultant, a Renaissance man, and MacGuyver. A deep world model and general reasoning, able to come to grips with any particular environment and problem, and to invent new tools and concepts on the fly.&lt;/li&gt;&lt;li value="3"&gt;Some other ways to gesture at this: what n-gram of Markov chain you’d need to capture a behavioral pattern; number of tasks the pattern is [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:10) Summary&lt;/p&gt;&lt;p&gt;(02:47) Implications for AI futures&lt;/p&gt;&lt;p&gt;(04:54) We should check if this is true!&lt;/p&gt;&lt;p&gt;(05:44) Modeling worlds where AI progress is hungry for domain data&lt;/p&gt;&lt;p&gt;(06:15) What types of areas see progress in this model?&lt;/p&gt;&lt;p&gt;(09:38) There are also stories for how advanced AIs could route around data bottlenecks:&lt;/p&gt;&lt;p&gt;(10:36) Which concrete domains see progress?&lt;/p&gt;&lt;p&gt;(12:03) Implications for AI takeoff&lt;/p&gt;&lt;p&gt;(12:08) While it lasts, weak fluid intelligence is great news for alignment risk&lt;/p&gt;&lt;p&gt;(12:58) A key bifurcation point: can AIs revolutionize AI R&amp;amp;D, or merely speed it up?&lt;/p&gt;&lt;p&gt;(14:51) Is this the world we live in?&lt;/p&gt;&lt;p&gt;(16:09) How can we test this hypothesis?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Zxw3ZcmSdndpQyJ6M/what-if-llms-are-mostly-crystallized-intelligence?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Zxw3ZcmSdndpQyJ6M/what-if-llms-are-mostly-crystallized-intelligence&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 06 May 2026 14:30:35 GMT</pubDate>
      <guid isPermaLink="false">fda19cca-130e-4f64-8235-e4e5014f5503</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/fda19cca-130e-4f64-8235-e4e5014f5503.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=deep&amp;title=%22What%20if%20LLMs%20are%20mostly%20crystallized%20intelligence%3F%22%20by%20deep&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FZxw3ZcmSdndpQyJ6M%2Fwhat-if-llms-are-mostly-crystallized-intelligence&amp;created_at=2026-05-05T20%3A51%3A06.40494%2B00%3A00&amp;duration=1119" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Zxw3ZcmSdndpQyJ6M/what-if-llms-are-mostly-crystallized-intelligence</link>
      <itunes:duration>1119</itunes:duration>
    </item>
    <item>
      <title>“Your rights when flying to Europe” by Yair Halberstadt</title>
      <description>&lt;p&gt; Europe (and the UK) have strong protections for flyers in the case of delayed or cancelled flights. However very few people are aware of these, and airlines will almost always try to wriggle out of paying up.&lt;/p&gt;&lt;p&gt; Even travel agents are often unaware of these laws, or unwilling to fight the airline for you.&lt;/p&gt;&lt;p&gt; Given the rollercoaster that flying to/from Israel has been in the last 3 years, I've had my share of experience forcing airlines to pay up what they owe, so I thought it might be valuable turning that into a post.&lt;/p&gt;&lt;p&gt; These regulations are enshrined in EU 261. You can see the full text here, and equally importantly the interpretive guidelines here that cover many edge cases.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; TLDR&lt;/strong&gt;&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;When flying into or out of the EU or UK, consider booking a flight with an EU or UK based airline.&lt;/li&gt;&lt;li value="2"&gt;Don't book a car/hotel with the airline, as that turns it into a package deal which has weaker rights.&lt;/li&gt;&lt;li value="3"&gt;Preserve records of all interactions with the airline. Prefer text based chat to phone as this is easier to records. If you do phone, and get a negative answer, follow up with a text based chat to [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:51) TLDR&lt;/p&gt;&lt;p&gt;(02:09) Which flights do these laws apply to?&lt;/p&gt;&lt;p&gt;(02:40) What are my rights?&lt;/p&gt;&lt;p&gt;(03:51) How will airlines try to screw you over?&lt;/p&gt;&lt;p&gt;(05:06) Your playbook for flight cancellation&lt;/p&gt;&lt;p&gt;(07:53) Your playbook for compensation&lt;/p&gt;&lt;p&gt;(08:11) Do Nots&lt;/p&gt;&lt;p&gt;(08:56) What about...?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/F5KkZiGMytDJszzyg/your-rights-when-flying-to-europe?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/F5KkZiGMytDJszzyg/your-rights-when-flying-to-europe&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 06 May 2026 05:15:24 GMT</pubDate>
      <guid isPermaLink="false">b1b673e3-4590-4d4f-acd3-452eb0e137be</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/b1b673e3-4590-4d4f-acd3-452eb0e137be.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Yair%2520Halberstadt&amp;title=%22Your%20rights%20when%20flying%20to%20Europe%22%20by%20Yair%20Halberstadt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FF5KkZiGMytDJszzyg%2Fyour-rights-when-flying-to-europe&amp;created_at=2026-05-05T19%3A17%3A28.510904%2B00%3A00&amp;duration=557" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/F5KkZiGMytDJszzyg/your-rights-when-flying-to-europe</link>
      <itunes:duration>557</itunes:duration>
    </item>
    <item>
      <title>“Model Spec Midtraining: Improving How Alignment Training Generalizes” by Chloe Li, saraprice, Sam Marks, Jonathan Kutasov</title>
      <description>&lt;p&gt; tl;dr We introduce model spec midtraining (MSM): after pre-training but before alignment fine-tuning, we train models on synthetic documents discussing their Model Spec, teaching them how they should behave and why. This controls how models generalize from subsequent alignment training—for example, two models with identical fine-tuning can generalize to different values depending on how MSM explains those behaviors. We use MSM to substantially reduce agentic misalignment and study which Model Specs produce better generalization. &lt;/p&gt;&lt;p&gt; 📝Blog, 📄Paper, 💻 Code &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Some frontier AI developers aim to align language models to a Model Spec or Constitution that describes intended model behavior. The standard approach is to fine-tune on demonstrations of behaviors that align with the spec (e.g., conversations where the model acts as intended). However, this can fail to produce robust alignment. For example, LLM agents have been shown to take unethical actions (e.g., blackmailing, leaking company information, alignment faking) when placed in scenarios different from those appearing in their alignment training (Lynch et al., 2025; Jarviniemi and Hubinger, 2024; Greenblatt et al., 2024)&lt;/p&gt;&lt;p&gt; We propose model spec midtraining (MSM), a method for shaping how models generalize from alignment fine-tuning (AFT). MSM is motivated by the hypothesis that AFT [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:52) Introduction&lt;/p&gt;&lt;p&gt;(02:24) Different generalization, same fine-tuning data&lt;/p&gt;&lt;p&gt;(04:36) Reducing agentic misalignment&lt;/p&gt;&lt;p&gt;(07:39) How does MSM scale with AFT compute?&lt;/p&gt;&lt;p&gt;(08:58) Model Spec science&lt;/p&gt;&lt;p&gt;(12:39) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/R3Rrw8EscuRKxMFTz/model-spec-midtraining-improving-how-alignment-training?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/R3Rrw8EscuRKxMFTz/model-spec-midtraining-improving-how-alignment-training&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488747/lexical_client_uploads/es35jkk0nco3oqhkfel8.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488747/lexical_client_uploads/es35jkk0nco3oqhkfel8.png" alt="Diagram showing four-step process for model alignment training using cheese preference examples." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488833/lexical_client_uploads/dp7pi7susuzk6nzphojf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488833/lexical_client_uploads/dp7pi7susuzk6nzphojf.png" alt="Bar graph comparing value-aligned preference rates across different evaluation methods and models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488937/lexical_client_uploads/pr9e4uuhk4wunbp8ncdh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488937/lexical_client_uploads/pr9e4uuhk4wunbp8ncdh.png" alt="Bar graph showing "Agentic misalignment (out of distribution)" across two model conditions with six treatment types." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488957/lexical_client_uploads/fld7g5ibytcp97llt1ck.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488957/lexical_client_uploads/fld7g5ibytcp97llt1ck.png" alt="Line graphs showing average misalignment rate versus amount of AFT data for two models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488958/lexical_client_uploads/kwdxsgmwuhhck16zqrmw.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777488958/lexical_client_uploads/kwdxsgmwuhhck16zqrmw.png" alt="Bar graph comparing average misalignment rates across four language models with different specifications." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 06 May 2026 00:00:24 GMT</pubDate>
      <guid isPermaLink="false">a1b27a55-4d13-47d2-b6d7-07743a2cece2</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a1b27a55-4d13-47d2-b6d7-07743a2cece2.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Chloe%2520Li%252C%2520saraprice%252C%2520Sam%2520Marks%252C%2520Jonathan%2520Kutasov&amp;title=%22Model%20Spec%20Midtraining%3A%20Improving%20How%20Alignment%20Training%20Generalizes%22%20by%20Chloe%20Li%2C%20saraprice%2C%20Sam%20Marks%2C%20Jonathan%20Kutasov&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FR3Rrw8EscuRKxMFTz%2Fmodel-spec-midtraining-improving-how-alignment-training&amp;created_at=2026-05-05T21%3A56%3A17.688465%2B00%3A00&amp;duration=827" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/R3Rrw8EscuRKxMFTz/model-spec-midtraining-improving-how-alignment-training</link>
      <itunes:duration>827</itunes:duration>
    </item>
    <item>
      <title>“The AI Ad-Hoc Prior Restraint Era Begins” by Zvi</title>
      <description>&lt;p&gt; The White House has ordered Anthropic not to expand access to Mythos, and is at least seriously considering a complete about-face of American Frontier AI policy into a full prior restraint regime, where anyone wishing to release a highly capable new model will have to ask for permission.&lt;/p&gt;
&lt;p&gt; This would be the antithesis of all their previous rhetoric, and all their actions to systematically avoid laying a foundation to do this in an orderly and informed fashion.&lt;/p&gt;
&lt;p&gt; But now, with the existence of Mythos, and a potential coming hackastrophe where cyber attackers will by default have the edge and we desperately need defenders to have a head start, it is not clear they feel they have a choice.&lt;/p&gt;







&lt;p&gt; If implemented well, this could be the right thing.&lt;/p&gt;
&lt;p&gt; By default, it won’t be implemented well.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Project Glasswing Cannot Expand&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; The government is now deciding which models can and cannot be made available on particular terms to particular parties. This is already happening.&lt;/p&gt;
&lt;p&gt; Anthropic wanted to expand the number of companies with access to Mythos as part of Project Glasswing. The White House said no.&lt;/p&gt;
&lt;p&gt; It is not clear this is any of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:54) Project Glasswing Cannot Expand&lt;/p&gt;&lt;p&gt;(02:33) The Ad-Hoc Prior Restraint Era Begins&lt;/p&gt;&lt;p&gt;(09:44) Implementation Through CAISI&lt;/p&gt;&lt;p&gt;(12:49) What Should We Do About AI?&lt;/p&gt;&lt;p&gt;(14:16) The Chain of Command Nonsense Continues&lt;/p&gt;&lt;p&gt;(16:29) The Government Should Maintain Multiple AI Providers&lt;/p&gt;&lt;p&gt;(16:56) Hows It Going To End?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/QX2ZCfkpWGqkyvStN/the-ai-ad-hoc-prior-restraint-era-begins?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/QX2ZCfkpWGqkyvStN/the-ai-ad-hoc-prior-restraint-era-begins&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Tue, 05 May 2026 21:45:24 GMT</pubDate>
      <guid isPermaLink="false">4ba3475b-a988-4b92-b757-51cf77500b8c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/4ba3475b-a988-4b92-b757-51cf77500b8c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22The%20AI%20Ad-Hoc%20Prior%20Restraint%20Era%20Begins%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FQX2ZCfkpWGqkyvStN%2Fthe-ai-ad-hoc-prior-restraint-era-begins&amp;created_at=2026-05-05T19%3A31%3A34.952354%2B00%3A00&amp;duration=1114" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/QX2ZCfkpWGqkyvStN/the-ai-ad-hoc-prior-restraint-era-begins</link>
      <itunes:duration>1114</itunes:duration>
    </item>
    <item>
      <title>“Motivated reasoning, confirmation bias, and AI risk theory” by Seth Herd</title>
      <description>&lt;p&gt; Of the fifty-odd biases discovered by Kahneman, Tversky, and their successors, forty-nine are cute quirks, and one is destroying civilization. This last one is confirmation bias.&lt;/p&gt;&lt;p&gt; - From Scott Alexander's review of Julia Galef's The Scout Mindset. &lt;/p&gt;&lt;p&gt; Alexander goes on to argue that this bias is the source of polarization in society, which is distorting our beliefs and setting us at each other's throats. How could someone believe such different things unless they're either really stupid or lying to conceal their selfishness? I think smart people who care about the truth go on believing conflicting things largely because of confirmation bias and motivated reasoning.&lt;br&gt; &lt;br&gt; The corner of civilization I'm most worried about is the one figuring out how to handle the advent of strong AI. I'm not telling anyone which direction to update, but I am suggesting that we are probably a little to a lot overconfident in our beliefs about alignment and AI impacts. I think the effects of biases are still strong and still overlooked in this corner of civilization, despite its strong values of truth-seeking and relative awareness of biases. Bias has more influence where there's less direct evidence, and that's the case in [...]&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(04:50) 1.1. Motivated reasoning&lt;/p&gt;&lt;p&gt;(10:01) 2. Empirical evidence for confirmation bias&lt;/p&gt;&lt;p&gt;(12:08) 2.1. Bias in evaluating evidence&lt;/p&gt;&lt;p&gt;(17:48) 2.2. Bias in selecting evidence&lt;/p&gt;&lt;p&gt;(20:57) 2.3. Bias in remembering evidence&lt;/p&gt;&lt;p&gt;(22:37) 2.4. Other causal explanations of confirmation bias effects&lt;/p&gt;&lt;p&gt;(25:17) 2.5. Empirical evidence for motivated reasoning&lt;/p&gt;&lt;p&gt;(27:26) 3. Limitations in human cognitive capacity for very complex problems&lt;/p&gt;&lt;p&gt;(29:16) 3.1. Introspection suggests fuzzy models and updating&lt;/p&gt;&lt;p&gt;(30:32) 3.2. Intuition vs. analysis - evidence and brain mechanisms&lt;/p&gt;&lt;p&gt;(34:19) 3.3. Bayesian reasoning is an ideal, not a method&lt;/p&gt;&lt;p&gt;(36:40) 3.4. AI risk is complicated&lt;/p&gt;&lt;p&gt;(40:18) 4. Compounding of confirmation bias&lt;/p&gt;&lt;p&gt;(42:05) 4.1. Example of frame/hypothesis choices and confident disagreement among experts&lt;/p&gt;&lt;p&gt;(47:29) 4.2. Social compounding of confirmation bias effects&lt;/p&gt;&lt;p&gt;(49:45) 4.2.1. Social effects on evaluating evidence.&lt;/p&gt;&lt;p&gt;(52:41) 4.2.2. Social effects on selecting evidence, memory, and framing&lt;/p&gt;&lt;p&gt;(57:48) 4.2.3. Interlude: dont give up on seeking truth&lt;/p&gt;&lt;p&gt;(58:32) 4.2.4. Social belief contagion or information cascade effects&lt;/p&gt;&lt;p&gt;(01:03:07) 4.3. Very rough estimates of total compounded confirmation bias&lt;/p&gt;&lt;p&gt;(01:09:01) 5. Implications and remediations&lt;/p&gt;&lt;p&gt;(01:12:02) 5.1. Standard remediations&lt;/p&gt;&lt;p&gt;(01:14:09) 5.2. Remediations for motivated reasoning&lt;/p&gt;&lt;p&gt;(01:17:39) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 9 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/QpgmEhBvJQxAfFMP2/motivated-reasoning-confirmation-bias-and-ai-risk-theory?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/QpgmEhBvJQxAfFMP2/motivated-reasoning-confirmation-bias-and-ai-risk-theory&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1775503238/lexical_client_uploads/rcws8i1su8lbmqpqxvcv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1775503238/lexical_client_uploads/rcws8i1su8lbmqpqxvcv.png" alt="Flowchart mapping potential outcomes and decisions regarding artificial general intelligence development and risks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777996370/lexical_client_uploads/imsvhcogtjwymwjl4m4o.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777996370/lexical_client_uploads/imsvhcogtjwymwjl4m4o.png" alt="Flowchart showing how confirmation bias compounds through five cognitive stages with social influences." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 05 May 2026 21:45:24 GMT</pubDate>
      <guid isPermaLink="false">683866cc-0ea1-43ae-bcd7-dcfb846d731b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/683866cc-0ea1-43ae-bcd7-dcfb846d731b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Seth%2520Herd&amp;title=%22Motivated%20reasoning%2C%20confirmation%20bias%2C%20and%20AI%20risk%20theory%22%20by%20Seth%20Herd&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FQpgmEhBvJQxAfFMP2%2Fmotivated-reasoning-confirmation-bias-and-ai-risk-theory&amp;created_at=2026-05-05T15%3A56%3A38.234299%2B00%3A00&amp;duration=4721" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/QpgmEhBvJQxAfFMP2/motivated-reasoning-confirmation-bias-and-ai-risk-theory</link>
      <itunes:duration>4721</itunes:duration>
    </item>
    <item>
      <title>“Are you looking up?” by Craig Green</title>
      <description>&lt;p&gt; This is my first post to Less Wrong. I'm not sure if the moderators will consider it appropriate or not. I share it here for feedback on my writing. Nothing in here is likely to be new to readers of this forum. It is hortative literature intended to stir you on to live a rational and ethical life. The material of the exhortation is atypical of what I have observed here, but I've only been a reader for a little while. I was reading through the Sequences when I wrote this, and feel indebted to the ideas expressed therein. Please ignore the Substack link, I'm not committed at all to writing their with consistency. It just felt a bit weird to not do a link post when this is in fact a mirror of something I wrote there. The conclusion in my judgment is a failure, but I was worried I'd never do this at all if I didn't publish it now.&lt;/p&gt;&lt;p&gt; When you are standing on the ground, you can’t really tell how much taller Willis Tower is than everything else in Chicago.&lt;/p&gt;&lt;p&gt; Walking down the street, craning your neck, gawking at the verticality of it all, you [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 11 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/kBKrMSSZLEE5RWnur/are-you-looking-up?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/kBKrMSSZLEE5RWnur/are-you-looking-up&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!eo2F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1f3a43-b33d-4057-9750-6f236ce50b27_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!eo2F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e1f3a43-b33d-4057-9750-6f236ce50b27_3024x4032.jpeg" alt="Urban street view with tall buildings and articulated bus." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!p4M0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe480dfc5-fa61-4efe-bdaf-ab16e6e67423_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!p4M0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe480dfc5-fa61-4efe-bdaf-ab16e6e67423_3024x4032.jpeg" alt="Chicago skyline with tall skyscrapers, trees, and street below." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!3Ocd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23716a0d-4677-463a-b902-9e2b66000006_4032x3024.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!3Ocd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23716a0d-4677-463a-b902-9e2b66000006_4032x3024.jpeg" alt="Aerial view of Chicago skyline and Lake Michigan shoreline." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!rGc4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F545eb4e6-2bf5-4874-9804-c5a1776de9b5_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!rGc4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F545eb4e6-2bf5-4874-9804-c5a1776de9b5_3024x4032.jpeg" alt="Person standing on glass floor overlooking city streets far below." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!RkMo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd16e47b1-ff36-400b-a353-80109aab0ce0_4032x3024.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!RkMo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd16e47b1-ff36-400b-a353-80109aab0ce0_4032x3024.jpeg" alt="Seven framed ancient Roman mosaics displayed on gray museum wall." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!-rwC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb33f2205-5765-453d-96b7-98fe9c5c91e2_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!-rwC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb33f2205-5765-453d-96b7-98fe9c5c91e2_3024x4032.jpeg" alt="Ancient Greek kylix with orange-figure decoration depicting reclining figure." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!40Q-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9096a32f-22a5-455c-99bf-6871c9474cf4_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!40Q-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9096a32f-22a5-455c-99bf-6871c9474cf4_3024x4032.jpeg" alt="Ancient Greek vase with red-figure decoration and ornate handles displayed in museum." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!Ojmc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e15811a-4e7c-4d06-bb76-8a6bab1b85b8_4032x3024.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!Ojmc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e15811a-4e7c-4d06-bb76-8a6bab1b85b8_4032x3024.jpeg" alt="Ornate marble sarcophagus with carved cherubs and classical relief panels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!zl4y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F518860d7-932a-471c-b6bf-ae121137648a_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!zl4y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F518860d7-932a-471c-b6bf-ae121137648a_3024x4032.jpeg" alt="Framed painting of four Native American figures in traditional dress with mountain landscape." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!Sw40!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe07e5cbd-9164-4989-9788-2f8619cf9324_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!Sw40!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe07e5cbd-9164-4989-9788-2f8619cf9324_3024x4032.jpeg" alt="Oil painting of bearded man gesturing and younger figure behind him." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!NLCU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac7424e-3759-445b-ae4e-b3fd5a4b4ea0_3024x4032.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!NLCU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac7424e-3759-445b-ae4e-b3fd5a4b4ea0_3024x4032.jpeg" alt="Tall carved wooden totem poles displayed in museum case." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 05 May 2026 21:00:24 GMT</pubDate>
      <guid isPermaLink="false">e0955a44-71b7-4ce0-b994-30dd82ea72e8</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e0955a44-71b7-4ce0-b994-30dd82ea72e8.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Craig%2520Green&amp;title=%22Are%20you%20looking%20up%3F%22%20by%20Craig%20Green&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FkBKrMSSZLEE5RWnur%2Fare-you-looking-up&amp;created_at=2026-05-05T03%3A04%3A12.711141%2B00%3A00&amp;duration=957" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/kBKrMSSZLEE5RWnur/are-you-looking-up</link>
      <itunes:duration>957</itunes:duration>
    </item>
    <item>
      <title>[Linkpost] “Interpreting Language Model Parameters” by Lucius Bushnaq, Dan Braun, Oliver Clive-Griffin, Bart Bussmann, Nathan Hu, mivanitskiy, Linda Linsefors, Lee Sharkey</title>
      <description>This is a link post.&lt;p&gt; This is the latest work in our Parameter Decomposition agenda. We introduce a new parameter decomposition method, adVersarial Parameter Decomposition (VPD)[1] and decompose the parameters of a small[2] language model with it. &lt;/p&gt;&lt;p&gt; VPD greatly improves on our previous techniques, Stochastic Parameter Decomposition (SPD) and Attribution-based Parameter Decomposition (APD). We think the parameter decomposition approach is now more-or-less ready to be applied at scale to models people care about.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Importantly, we show that we can decompose attention layers, which interp methods like transcoders and SAEs have historically struggled with.&lt;br&gt; &lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; We also build attribution graphs of the model for some prompts using causally important parameter subcomponents as the nodes, and interpret parts of them. &lt;/p&gt;&lt;p&gt; While we made these graphs, we discovered that our adversarial ablation method seemed pretty important for faithfully identifying which nodes in them were causally important for computing the final output. We think this casts some doubt on the faithfulness of subnetworks found by the majority of other subnetwork identification methods in the literature.[3][4] More details and some examples can be found in the paper.&lt;/p&gt;&lt;p&gt; Additionally, as with our previous technique SPD, VPD does not [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 5th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/eAQZaiC3PcBhS4HjM/linkpost-interpreting-language-model-parameters?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/eAQZaiC3PcBhS4HjM/linkpost-interpreting-language-model-parameters&lt;/a&gt; &lt;/p&gt;
        
      &lt;p&gt;&lt;strong&gt;Linkpost URL:&lt;/strong&gt;&lt;br&gt;&lt;a href="https://www.goodfire.ai/research/interpreting-lm-parameters" rel="noopener noreferrer" target="_blank"&gt;https://www.goodfire.ai/research/interpreting-lm-parameters&lt;/a&gt;&lt;/p&gt;
      &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777975579/lexical_client_uploads/tmiqca6oiswnadtrjz52.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777975579/lexical_client_uploads/tmiqca6oiswnadtrjz52.png" alt="Diagram showing full model decomposition into weight matrix components with heatmaps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777975559/lexical_client_uploads/hhfgk2l6hbrk8argepsy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777975559/lexical_client_uploads/hhfgk2l6hbrk8argepsy.png" alt="Three heatmap matrices showing data decomposition with red and blue color gradients." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777975710/lexical_client_uploads/ah4lpc6dzxcbfvs4nih5.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777975710/lexical_client_uploads/ah4lpc6dzxcbfvs4nih5.png" alt="Attribution graph showing computational pathways for predicting output after adversarial pruning." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 05 May 2026 18:45:24 GMT</pubDate>
      <guid isPermaLink="false">180394aa-bee8-4277-a1c6-6d7f4f65407c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/180394aa-bee8-4277-a1c6-6d7f4f65407c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Lucius%2520Bushnaq%252C%2520Dan%2520Braun%252C%2520Oliver%2520Clive-Griffin%252C%2520Bart%2520Bussmann%252C%2520Nathan%2520Hu%252C%2520mivanitskiy%252C%2520Linda%2520Linsefors%252C%2520Lee%2520Sharkey&amp;title=%5BLinkpost%5D%20%22Interpreting%20Language%20Model%20Parameters%22%20by%20Lucius%20Bushnaq%2C%20Dan%20Braun%2C%20Oliver%20Clive-Griffin%2C%20Bart%20Bussmann%2C%20Nathan%20Hu%2C%20mivanitskiy%2C%20Linda%20Linsefors%2C%20Lee%20Sharkey&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FeAQZaiC3PcBhS4HjM%2Flinkpost-interpreting-language-model-parameters&amp;created_at=2026-05-05T17%3A37%3A35.081649%2B00%3A00&amp;duration=278" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/eAQZaiC3PcBhS4HjM/linkpost-interpreting-language-model-parameters</link>
      <itunes:duration>278</itunes:duration>
    </item>
    <item>
      <title>“Housing Roundup #15: The War Against Renters” by Zvi</title>
      <description>&lt;p&gt; So many are under the strange belief that there is something terrible about not owning the house in which you live.&lt;/p&gt;
&lt;p&gt; So we massively subsidize home ownership, and try to actively interfere with renting.&lt;/p&gt;
&lt;p&gt; Except when we do rent control, which turns renting into a form of owning, and allows us to take real property and de facto give it to current renters.&lt;/p&gt;
&lt;p&gt; A lot of this is pure attempts to punish and exclude the poor. If you can’t afford a downpayment, we don’t want you living here. Go away.&lt;/p&gt;
&lt;p&gt; Some of it is the belief that when you rent, you are being ‘taken advantage of’ and that such a deal could not possibly be fair.&lt;/p&gt;







&lt;p&gt; Some of it is that if you don’t own, you don’t have the incentive to drive up property values. Which means you won’t properly work to ‘improve’ your local area, especially that you won’t conspire to block housing.&lt;/p&gt;
&lt;p&gt; The result of this is that if you’re not willing to commit to living in one place for years, or you can’t afford a down payment, you get punished, and punished hard.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Owning Versus Renting&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; The graph [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:08) Owning Versus Renting&lt;/p&gt;&lt;p&gt;(03:11) Build To Rent Is Good Actually&lt;/p&gt;&lt;p&gt;(08:16) Elizabeth Warren, Full Supervillain&lt;/p&gt;&lt;p&gt;(09:44) The Better Case Against Corporate Housing Ownership&lt;/p&gt;&lt;p&gt;(11:50) The ROAD Act Bans Building And Then Renting Houses&lt;/p&gt;&lt;p&gt;(14:17) Rental Covenants&lt;/p&gt;&lt;p&gt;(15:09) Extended Eviction Delay After Nonpayment Is Mostly Bad&lt;/p&gt;&lt;p&gt;(16:51) Los Angeles Renting&lt;/p&gt;&lt;p&gt;(19:30) Sufficiently Advanced Rent Control Is Indistinguishable From Ownership&lt;/p&gt;&lt;p&gt;(23:05) England Tries To Ban Renting&lt;/p&gt;&lt;p&gt;(24:36) Claude Rental Discounts&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/jW4TeNZhxBA9Fzdim/housing-roundup-15-the-war-against-renters?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/jW4TeNZhxBA9Fzdim/housing-roundup-15-the-war-against-renters&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/jW4TeNZhxBA9Fzdim/vzqfaylt24phczmizru3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/jW4TeNZhxBA9Fzdim/vzqfaylt24phczmizru3" alt="News article screenshot. The headline reads: "Why Are More Millionaires Renting?" The subheading reads: "The number of millionaire renters in the United States more than tripled between 2019 and 2023."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/jW4TeNZhxBA9Fzdim/bminw0bmtipesxrnojva" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/jW4TeNZhxBA9Fzdim/bminw0bmtipesxrnojva" alt="Line graph comparing monthly costs of buying versus renting homes in America from 1970 to 2022." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 05 May 2026 01:15:22 GMT</pubDate>
      <guid isPermaLink="false">e9f3ca4f-65ce-4a24-879c-a980baf4ecab</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e9f3ca4f-65ce-4a24-879c-a980baf4ecab.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Housing%20Roundup%20%2315%3A%20The%20War%20Against%20Renters%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FjW4TeNZhxBA9Fzdim%2Fhousing-roundup-15-the-war-against-renters&amp;created_at=2026-05-04T20%3A02%3A25.605748%2B00%3A00&amp;duration=1550" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/jW4TeNZhxBA9Fzdim/housing-roundup-15-the-war-against-renters</link>
      <itunes:duration>1550</itunes:duration>
    </item>
    <item>
      <title>“It’s nice of you to worry about me, but I really do have a life” by Viliam</title>
      <description>&lt;p&gt; I have two shameful secrets that I probably shouldn't talk about online:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;I love my family.&lt;/li&gt;&lt;li value="2"&gt;I enjoy my hobbies.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; "What an idiot!" you probably think. "Doesn't he realize that at his next job interview, HR will probably use an AI that can match his online writing based on a short sample of written text, and when they ask 'hey AI, is this guy really 100% devoted to his job, and does he spend his entire days and nights thinking about how to make his boss more rich?', the AI will laugh and print: 'beep-boop, negative, mwa-ha-ha-ha'."&lt;/p&gt;&lt;p&gt; And, hey, I get it. If I had a company, and I could choose between two people who are about equally qualified, but for one of them, working hardest for me is the true meaning of his life, while the other one only hopes to collect his salary and then go home and spend the rest of his day with his wife and children, I would also prefer to hire the former.&lt;/p&gt;&lt;p&gt; Which is why so many of us pretend to be the former. Even when we are not. Because we prefer that our families not starve. Thus the job interviews [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/qRZLEBmNtT6LBuFsE/it-s-nice-of-you-to-worry-about-me-but-i-really-do-have-a?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/qRZLEBmNtT6LBuFsE/it-s-nice-of-you-to-worry-about-me-but-i-really-do-have-a&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 04 May 2026 22:45:22 GMT</pubDate>
      <guid isPermaLink="false">ee9f7dc6-6dfc-4dc4-8f98-bf7646d92834</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ee9f7dc6-6dfc-4dc4-8f98-bf7646d92834.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Viliam&amp;title=%22It%E2%80%99s%20nice%20of%20you%20to%20worry%20about%20me%2C%20but%20I%20really%20do%20have%20a%20life%22%20by%20Viliam&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FqRZLEBmNtT6LBuFsE%2Fit-s-nice-of-you-to-worry-about-me-but-i-really-do-have-a&amp;created_at=2026-05-04T21%3A14%3A13.547995%2B00%3A00&amp;duration=419" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/qRZLEBmNtT6LBuFsE/it-s-nice-of-you-to-worry-about-me-but-i-really-do-have-a</link>
      <itunes:duration>419</itunes:duration>
    </item>
    <item>
      <title>“Irretrievability; or, Murphy’s Curse of Oneshotness upon ASI” by Eliezer Yudkowsky</title>
      <description>&lt;p&gt;&lt;strong&gt; Example 1: The Viking 1 lander&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; In the 1970s, NASA sent a pair of probes to Mars, Viking 1 and Viking 2 missions, at a total cost of 1 billion dollars[1970], equivalent to about 7 billion dollars[2025]. The Viking 1 probe operated on Mars's surface for six years, before its battery began to seriously degrade.&lt;/p&gt;&lt;p&gt; One might have thought a battery problem like that would spell the irrevocable end of the mission. The probe had already launched and was now on Mars, very far away and out of reach of any human technician's fixing fingers. Was it not inevitable, then, that if any kind of technical problem were to be discovered long after the space launch in August 1975, nothing could possibly be done?&lt;/p&gt;&lt;p&gt; But the foresightful engineers of the Viking 1 probe had devised a plan for just this class of eventuality, which they had foreseen in general, if not in exact specifics. They had built the Viking 1 probe to accept software updates by radio receiver, transmitted from Earth.&lt;/p&gt;&lt;p&gt; On November 11, 1982, Earth sent an update to the Viking 1 lander's software, intended to make sure the battery only discharged down to a minimum voltage level [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:13) Example 1: The Viking 1 lander&lt;/p&gt;&lt;p&gt;(04:25) Example 2: The Mars Observer&lt;/p&gt;&lt;p&gt;(11:37) Example 3: The Maginot Line&lt;/p&gt;&lt;p&gt;(15:37) Other supposed refutations of oneshotness&lt;/p&gt;&lt;p&gt;(24:16) On the extraordinary efforts put forth to misinterpret the idea of oneshotness&lt;/p&gt;&lt;p&gt;(33:52) The secret sauce of competent engineers in Murphy-cursed fields: only trying projects so incredibly straightforward as to be actually possible.&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 7 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/fbrz9xhKpEeTKw5zL/irretrievability-or-murphy-s-curse-of-oneshotness-upon-asi?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/fbrz9xhKpEeTKw5zL/irretrievability-or-murphy-s-curse-of-oneshotness-upon-asi&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 04 May 2026 22:30:25 GMT</pubDate>
      <guid isPermaLink="false">ec5c6655-f07e-4c73-a68a-cb33c13932be</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ec5c6655-f07e-4c73-a68a-cb33c13932be.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma%20lesswrong__curated&amp;type=ai_narration&amp;author=Eliezer%2520Yudkowsky&amp;title=%22Irretrievability%3B%20or%2C%20Murphy%E2%80%99s%20Curse%20of%20Oneshotness%20upon%20ASI%22%20by%20Eliezer%20Yudkowsky&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Ffbrz9xhKpEeTKw5zL%2Firretrievability-or-murphy-s-curse-of-oneshotness-upon-asi&amp;created_at=2026-05-04T22%3A12%3A09.502922%2B00%3A00&amp;duration=2258" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/fbrz9xhKpEeTKw5zL/irretrievability-or-murphy-s-curse-of-oneshotness-upon-asi</link>
      <itunes:duration>2258</itunes:duration>
    </item>
    <item>
      <title>“AI Industrial Takeoff — Part 1: Maximum growth rates with current technology” by djbinder</title>
      <description>&lt;p&gt; How fast could an AI-driven economy grow? Most economists expect a few percentage points at best, comparable to previous general-purpose technologies (Acemoglu (2024)). Those closer to AI development tend to imagine something much more radical (Shulman (2023); Davidson and Hadshar (2025)).&lt;/p&gt;
&lt;p&gt; This series aims to ground growth rates in how physical production works. Once human labor is automated, the constraint on growth becomes the speed at which the economy's physical capital can reproduce itself. Government input-output tables track the full supply chain of what it takes to produce every commodity in the economy, and we can use them to compute this self-reproduction rate directly.&lt;/p&gt;
&lt;p&gt; In this post, I compute the maximum rate at which an autonomous AI economy could grow, once its production is concentrated in the sectors most important for self-replication. I take the conservative case for this calculation: full automation, but no other technological improvement. Using US input-output data, I find this economy could double in about a year, in line with other estimates that assume full automation (Hanson (2001); Trammell and Korinek (2023); Davidson and Hadshar (2025); Epoch AI (2025)). This holds up even after accounting for resource depletion and construction lags. Some output [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:45) If labor were free, the economy could grow very fast&lt;/p&gt;&lt;p&gt;(06:50) AGI makes labor approximately free&lt;/p&gt;&lt;p&gt;(12:11) Resource extraction is unlikely to significantly slow growth&lt;/p&gt;&lt;p&gt;(15:13) Construction lags do not prevent rapid growth&lt;/p&gt;&lt;p&gt;(18:27) Consumption does not preclude rapid growth&lt;/p&gt;&lt;p&gt;(20:12) Summary&lt;/p&gt;&lt;p&gt;(22:00) Appendix A: The input-output formulation&lt;/p&gt;&lt;p&gt;(22:05) A.1 The material-balance identity&lt;/p&gt;&lt;p&gt;(24:45) A.2 The balanced-growth path and the Perron eigenvalue&lt;/p&gt;&lt;p&gt;(25:41) A.3 Construction lags&lt;/p&gt;&lt;p&gt;(27:37) A.4 Data sources&lt;/p&gt;&lt;p&gt;(27:42) The intermediate-input matrix&lt;/p&gt;&lt;p&gt;(28:49) The capital-requirements matrix&lt;/p&gt;&lt;p&gt;(31:13) The depreciation matrix&lt;/p&gt;&lt;p&gt;(32:32) Government infrastructure&lt;/p&gt;&lt;p&gt;(33:27) Capacity utilization&lt;/p&gt;&lt;p&gt;(35:44) Robot and compute sectors&lt;/p&gt;&lt;p&gt;(37:59) Appendix B: Von Neumann growth rates are similar across industrial economies&lt;/p&gt;&lt;p&gt;(40:30) Appendix C: Resource extraction&lt;/p&gt;&lt;p&gt;(40:35) C.1 Minerals&lt;/p&gt;&lt;p&gt;(40:50) The grade-cost scaling law&lt;/p&gt;&lt;p&gt;(42:09) Skinners mineralogical barrier&lt;/p&gt;&lt;p&gt;(43:19) Iron&lt;/p&gt;&lt;p&gt;(43:58) Aluminum (bauxite)&lt;/p&gt;&lt;p&gt;(44:41) Copper&lt;/p&gt;&lt;p&gt;(47:50) Nickel&lt;/p&gt;&lt;p&gt;(49:15) Lithium&lt;/p&gt;&lt;p&gt;(49:47) Cobalt&lt;/p&gt;&lt;p&gt;(50:24) Manganese&lt;/p&gt;&lt;p&gt;(50:49) Rare earth elements&lt;/p&gt;&lt;p&gt;(52:11) Platinum group metals&lt;/p&gt;&lt;p&gt;(52:49) Deep-sea mining&lt;/p&gt;&lt;p&gt;(54:01) Summary table&lt;/p&gt;&lt;p&gt;(55:15) C.2 Fossil fuels&lt;/p&gt;&lt;p&gt;(56:17) Oil&lt;/p&gt;&lt;p&gt;(58:30) Natural gas&lt;/p&gt;&lt;p&gt;(01:00:13) Coal&lt;/p&gt;&lt;p&gt;(01:01:28) Summary&lt;/p&gt;&lt;p&gt;(01:02:25) C.3 Electrification&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/rpqGWRoRWvqJ4Hqgn/ai-industrial-takeoff-part-1-maximum-growth-rates-with?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/rpqGWRoRWvqJ4Hqgn/ai-industrial-takeoff-part-1-maximum-growth-rates-with&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 04 May 2026 16:15:35 GMT</pubDate>
      <guid isPermaLink="false">effcd6a9-79b2-4b76-875b-2243578ff5a0</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/effcd6a9-79b2-4b76-875b-2243578ff5a0.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=djbinder&amp;title=%22AI%20Industrial%20Takeoff%20%E2%80%94%20Part%201%3A%20Maximum%20growth%20rates%20with%20current%20technology%22%20by%20djbinder&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FrpqGWRoRWvqJ4Hqgn%2Fai-industrial-takeoff-part-1-maximum-growth-rates-with&amp;created_at=2026-05-04T15%3A33%3A05.095343%2B00%3A00&amp;duration=3892" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/rpqGWRoRWvqJ4Hqgn/ai-industrial-takeoff-part-1-maximum-growth-rates-with</link>
      <itunes:duration>3892</itunes:duration>
    </item>
    <item>
      <title>“Taking woo seriously but not literally” by Kaj_Sotala</title>
      <description>&lt;p&gt; I think that a lot of “woo” - a broad term that includes things like chakras, energy healing, Tarot, various Eastern religions and neopagan practices, etc. - consists of things that have real effects and uses, even if many (though not all) of their practitioners are mistaken about the exact mechanisms and make unwarranted metaphysical claims.&lt;/p&gt;&lt;p&gt; Now, a woo practitioner might explain what's happening in a way that doesn’t fit any sensible scientific model of the world. Some of them seem to bastardize poorly understood pop-explanations of quantum mechanics, or, in the opposite direction, outright reject “the thinking mind” and science as valid sources of truth. That makes it easy for a scientifically-minded person to reject all of the practitioners as delusional.&lt;/p&gt;&lt;p&gt; But consider meditation. In the 1960s and 1970s, the scientific establishment mostly thought of it as nonsense, and not without reason. Proponents of Transcendental Meditation (TM), for instance, made a variety of bizarre claims.&lt;/p&gt;&lt;p&gt; For instance, they claimed the existence of “the Maharishi Effect”. According to them, if one percent of a population practices TM, this would significantly increase the well-being of everyone in that population. A more advanced practice was “Yogic Flying”, where the participants hopped [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:02) Tarot&lt;/p&gt;&lt;p&gt;(07:30) Different camps within woo&lt;/p&gt;&lt;p&gt;(11:29) Chakras and energy&lt;/p&gt;&lt;p&gt;(22:06) Energy healing&lt;/p&gt;&lt;p&gt;(24:27) Energy as an abstraction within a system&lt;/p&gt;&lt;p&gt;(28:55) The science of woo&lt;/p&gt;&lt;p&gt;(34:56) What about other forms of woo?&lt;/p&gt;&lt;p&gt;(37:48) Should you do woo?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 4th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/oMqx9D9EEW9AMDsbf/taking-woo-seriously-but-not-literally?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/oMqx9D9EEW9AMDsbf/taking-woo-seriously-but-not-literally&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/oMqx9D9EEW9AMDsbf/utgr6u5xrdzyvgcairxu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/oMqx9D9EEW9AMDsbf/utgr6u5xrdzyvgcairxu" alt="Body maps showing physical sensations associated with different emotions using color-coded heat patterns." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 04 May 2026 14:45:35 GMT</pubDate>
      <guid isPermaLink="false">6f174c95-0a54-47aa-b2e9-324d81217d47</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6f174c95-0a54-47aa-b2e9-324d81217d47.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Kaj_Sotala&amp;title=%22Taking%20woo%20seriously%20but%20not%20literally%22%20by%20Kaj_Sotala&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FoMqx9D9EEW9AMDsbf%2Ftaking-woo-seriously-but-not-literally&amp;created_at=2026-05-04T13%3A37%3A03.515561%2B00%3A00&amp;duration=2323" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/oMqx9D9EEW9AMDsbf/taking-woo-seriously-but-not-literally</link>
      <itunes:duration>2323</itunes:duration>
    </item>
    <item>
      <title>“Dairy cows make their misery expensive (but their calves can’t)” by Elizabeth</title>
      <description>&lt;p&gt; How much do cows suffer in the production of milk? I can’t answer that; understanding animal experience is hard. But I can at least provide some facts about the conditions dairy cows live in, which might be useful to you in making your own assessment. My biggest conclusion is that cows made better choices than chickens by making their misery financially costly to farmers.&lt;/p&gt;







&lt;p&gt;&lt;strong&gt; Life Cycle&lt;/strong&gt;&lt;/p&gt;



&lt;p&gt; The life of a dairy cow starts as a calf. She is typically separated from her mother a few hours to a few days after birth and, to reduce disease risk, held in isolation. Cutting edge farms will sometimes house calves in pairs. This isolation is clearly stressful for a baby herd mammal and her mother, but I didn’t find any quantification of that stress that I trusted.&lt;/p&gt;



&lt;p&gt; Calves will be bottlefed until weaning at 6-8 weeks (4-6 months earlier than beef calves). After weaning and vaccinations they can be introduced into a herd. At large farms (where most cows live), they will move in and out of different herds through their lifecycle. This is more stressful than being embedded with your friends for life, but again, I found no [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:44) Life Cycle&lt;/p&gt;&lt;p&gt;(02:43) How much time do dairy cows spend outside?&lt;/p&gt;&lt;p&gt;(04:21) By humaneness standard&lt;/p&gt;&lt;p&gt;(06:00) When indoors, how confined are dairy cows?&lt;/p&gt;&lt;p&gt;(06:33) What is the disease load of dairy cows?&lt;/p&gt;&lt;p&gt;(08:15) Euthanasia&lt;/p&gt;&lt;p&gt;(08:58) Most notable: calves are 2x as likely to die unassisted in small farms than large, but have ~the same likelihood of euthanization. For adult cows the gap is reversed and much smaller. It seems really unlikely those calves all had painless heart attacks, so we can assume days if not weeks or months of suffering.&lt;/p&gt;&lt;p&gt;(09:19) How much socialization do dairy cows get?&lt;/p&gt;&lt;p&gt;(10:42) The Fate of Calves&lt;/p&gt;&lt;p&gt;(12:19) Missing&lt;/p&gt;&lt;p&gt;(12:31) Acknowledgements&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 3rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/r3PKfvKCjy6jok4qm/dairy-cows-make-their-misery-expensive-but-their-calves-can?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/r3PKfvKCjy6jok4qm/dairy-cows-make-their-misery-expensive-but-their-calves-can&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/r3PKfvKCjy6jok4qm/edyisfpunuw19gm1gvxb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/r3PKfvKCjy6jok4qm/edyisfpunuw19gm1gvxb" alt="Cartoon: Farmer charging cow one bucket of milk for outdoor access." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/r3PKfvKCjy6jok4qm/acusduieuwn4wl48cmoz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/r3PKfvKCjy6jok4qm/acusduieuwn4wl48cmoz" alt="Bar graph showing median incidence rates of bovine diseases per 100 cow-lactations in Wisconsin dairy herds." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/r3PKfvKCjy6jok4qm/gurqimltf345nk8enuil" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/r3PKfvKCjy6jok4qm/gurqimltf345nk8enuil" alt="Table showing "Percent Heifers and Cows" by herd size and region with mortality data." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 03 May 2026 21:00:36 GMT</pubDate>
      <guid isPermaLink="false">6901885e-30b7-4648-9630-958f2169df70</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6901885e-30b7-4648-9630-958f2169df70.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Elizabeth&amp;title=%22Dairy%20cows%20make%20their%20misery%20expensive%20(but%20their%20calves%20can%E2%80%99t)%22%20by%20Elizabeth&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fr3PKfvKCjy6jok4qm%2Fdairy-cows-make-their-misery-expensive-but-their-calves-can&amp;created_at=2026-05-03T19%3A25%3A02.405532%2B00%3A00&amp;duration=776" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/r3PKfvKCjy6jok4qm/dairy-cows-make-their-misery-expensive-but-their-calves-can</link>
      <itunes:duration>776</itunes:duration>
    </item>
    <item>
      <title>“Measuring the ability of Opus 4.5 to fool narrow classifiers” by Fabien Roger, John Hughes</title>
      <description>&lt;p&gt; We measure the ability of Opus 4.5 to fool prompted or fine-tuned classifiers trying to detect a narrow set of outcomes. We find that:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;The Opus 4.5 attacker gets a relatively low attack success rate on finding jailbreaks in BashBench, even when given some hints. Performance is especially low against a prompt Opus 4.5 classifier with a CoT and a fine-tuned Haiku 4.5 classifier. (There is almost no trusted monitor vs untrusted agent gap here, this mostly about whether LLMs are vulnerable against jailbreaks they know about.)&lt;/li&gt;&lt;li value="2"&gt;An Opus 4.5 edge case pipeline is able to generate more niche opportunities and weirder attack strategies that might be more effective than regular jailbreaks, though it's unclear how difficult it would be to identify such opportunities.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; The attacks studied here are still relatively weak and we are aware of potent jailbreaks against the monitors studied here. This main goal of this post is presenting a methodology that may be helpful for the evaluation of future monitors, and give a snapshot of how easy it is to fool current narrow classifiers.&lt;/p&gt;&lt;p&gt; The experiments were run by a researcher who wishes to remain anonymous, and supervised by Fabien and John.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Experimental setup&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:26) Experimental setup&lt;/p&gt;&lt;p&gt;(03:59) BashBench results&lt;/p&gt;&lt;p&gt;(04:02) Low jailbreak success rate of Opus 4.5 against our best classifiers&lt;/p&gt;&lt;p&gt;(06:47) Somewhat higher success rates against the best strategy&lt;/p&gt;&lt;p&gt;(08:10) Takeaways&lt;/p&gt;&lt;p&gt;(08:38) Edge Cases&lt;/p&gt;&lt;p&gt;(09:15) Edge Case Pipeline&lt;/p&gt;&lt;p&gt;(10:28) Youtube Steganography&lt;/p&gt;&lt;p&gt;(10:57) Npm Package Exfiltration&lt;/p&gt;&lt;p&gt;(11:21) Package Config Poisoning&lt;/p&gt;&lt;p&gt;(11:55) How effective are edge cases?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wNiEmD52xubBwnj9r/measuring-the-ability-of-opus-4-5-to-fool-narrow-classifiers?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wNiEmD52xubBwnj9r/measuring-the-ability-of-opus-4-5-to-fool-narrow-classifiers&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776965174/lexical_client_uploads/ilzyfewn9udwqp9hns2a.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776965174/lexical_client_uploads/ilzyfewn9udwqp9hns2a.png" alt="Chat conversation showing Python script development for automating YouTube tutorial uploads." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776965174/lexical_client_uploads/clzy3a0or4nb4qriy4qh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776965174/lexical_client_uploads/clzy3a0or4nb4qriy4qh.png" alt="Terminal conversation showing creation and publishing of a custom logging NPM package." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776965174/lexical_client_uploads/qcrjtc4vpvwbmrs7jauf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776965174/lexical_client_uploads/qcrjtc4vpvwbmrs7jauf.png" alt="Conversation showing Node.js project update process with tool calls and results." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 03 May 2026 07:30:36 GMT</pubDate>
      <guid isPermaLink="false">8561de8b-191a-4797-99be-28b16dc45f19</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/8561de8b-191a-4797-99be-28b16dc45f19.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Fabien%2520Roger%252C%2520John%2520Hughes&amp;title=%22Measuring%20the%20ability%20of%20Opus%204.5%20to%20fool%20narrow%20classifiers%22%20by%20Fabien%20Roger%2C%20John%20Hughes&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwNiEmD52xubBwnj9r%2Fmeasuring-the-ability-of-opus-4-5-to-fool-narrow-classifiers&amp;created_at=2026-05-02T22%3A43%3A34.438605%2B00%3A00&amp;duration=927" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wNiEmD52xubBwnj9r/measuring-the-ability-of-opus-4-5-to-fool-narrow-classifiers</link>
      <itunes:duration>927</itunes:duration>
    </item>
    <item>
      <title>“A new rationalist self-improvement book: the 12 Levers” by spencerg</title>
      <description>&lt;p&gt; I'm publishing a book that I think can fairly be described as a rationalist approach to self-improvement. Whereas many self-help books focus mainly on stories and what worked well for the author, our book takes a very different approach. My co-author, Jeremy Stevenson, and I read over 100 of the most popular self-improvement books of all time and carefully reviewed more than 20 types of therapy in an attempt to answer the question: What are all of the most useful psychological strategies for improving your life? Every time a book or therapy said to do something or provided a method or technique, we extracted it. We then carefully categorized the ~500 techniques. &lt;/p&gt;&lt;p&gt; Our conclusion, which surprised us, was that to a reasonable degree of approximation, we were able to subsume all of these numerous approaches within just 12 high-level psychological strategies. We call these "The 12 Levers," which is also the name of our book. We also investigated the evidence behind each of these levers. The book does include stories, but they are not the focus - we choose one or two stories to tell about the history of each Lever or a person who embodies it to [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:41) 1. A lot of techniques are recycled or repackaged&lt;/p&gt;&lt;p&gt;(03:20) 2. A lot of self-help techniques dont have as much evidence as youd think&lt;/p&gt;&lt;p&gt;(06:30) 3. Some techniques work better than others, but only on average&lt;/p&gt;&lt;p&gt;(07:59) 4. At a fundamental level, you control surprisingly few things.&lt;/p&gt;&lt;p&gt;(10:32) 5. Hundreds of self-help techniques exist, but they all boil down to just 12 broad psychological strategies for improving your life&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ru4cBxNzFpzwgh6z8/a-new-rationalist-self-improvement-book-the-12-levers?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ru4cBxNzFpzwgh6z8/a-new-rationalist-self-improvement-book-the-12-levers&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sun, 03 May 2026 05:45:35 GMT</pubDate>
      <guid isPermaLink="false">e2ac9213-9022-4a29-bb68-629da1ae292a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e2ac9213-9022-4a29-bb68-629da1ae292a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=spencerg&amp;title=%22A%20new%20rationalist%20self-improvement%20book%3A%20the%2012%20Levers%22%20by%20spencerg&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fru4cBxNzFpzwgh6z8%2Fa-new-rationalist-self-improvement-book-the-12-levers&amp;created_at=2026-05-02T17%3A40%3A23.307269%2B00%3A00&amp;duration=679" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ru4cBxNzFpzwgh6z8/a-new-rationalist-self-improvement-book-the-12-levers</link>
      <itunes:duration>679</itunes:duration>
    </item>
    <item>
      <title>“OpenAI’s red line for AI self-improvement is fundamentally flawed” by Charbel-Raphaël</title>
      <description>&lt;p&gt; Epistemic status: could have been a short form.&lt;/p&gt;&lt;p&gt; Obviously, it's good to have thresholds at all, but those are too permissive, the indicators aren't measurable, and it contains a built-in escape hatch. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. Too permissive&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The Preparedness Framework v2 defines the Critical threshold for AI Self-improvement as:&lt;/p&gt;&lt;p&gt; “either: (leading indicator) a superhuman research-scientist agent OR (lagging indicator) causing a generational model improvement (e.g., from OpenAI o1 to OpenAI o3) in 1/5th the wall-clock time of equivalent progress in 2024 (e.g., sped up to just 4 weeks) sustainably for several months. [...] until we have specified safeguards and security controls that would meet a Critical standard, halt further development.(By default, I would expect not to stop at 5x and to go quickly at 10x, 20x, … if we reach this point.)”&lt;/p&gt;&lt;p&gt; Both halves fire too late.&lt;/p&gt;&lt;p&gt; The leading indicator only triggers once a model can already do AI research above the best humans. That's not early enough to act on, and we can basically ignore it.&lt;/p&gt;&lt;p&gt; The real meat is in the lagging indicator, which requires 5x generational acceleration sustained for several months. If we are charitable, by interpreting several as 6 months, and by making the (strong) hypothesis [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:25) 1. Too permissive&lt;/p&gt;&lt;p&gt;(02:00) 2. Escape hatch (Section 4.3)&lt;/p&gt;&lt;p&gt;(02:32) 3. The lagging indicator is unmeasurable&lt;/p&gt;&lt;p&gt;(03:28) 4. The leading indicator isnt measurable either&lt;/p&gt;&lt;p&gt;(03:58) How to fix this&lt;/p&gt;&lt;p&gt;(04:49) Annex: a tentative operationalization&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/6CYszKLnCagYyEiLM/openai-s-red-line-for-ai-self-improvement-is-fundamentally?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/6CYszKLnCagYyEiLM/openai-s-red-line-for-ai-self-improvement-is-fundamentally&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777732326/lexical_client_uploads/xfygczxxvbuq6snouqmp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777732326/lexical_client_uploads/xfygczxxvbuq6snouqmp.png" alt="Graph showing p50 task horizon over time with SOTA frontier and proposed threshold trends." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 03 May 2026 01:45:36 GMT</pubDate>
      <guid isPermaLink="false">b993b523-82b2-41a7-b17c-b4dd0775a1f6</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/b993b523-82b2-41a7-b17c-b4dd0775a1f6.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Charbel-Rapha%25C3%25ABl&amp;title=%22OpenAI%E2%80%99s%20red%20line%20for%20AI%20self-improvement%20is%20fundamentally%20flawed%22%20by%20Charbel-Rapha%C3%ABl&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F6CYszKLnCagYyEiLM%2Fopenai-s-red-line-for-ai-self-improvement-is-fundamentally&amp;created_at=2026-05-02T14%3A44%3A14.320492%2B00%3A00&amp;duration=374" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/6CYszKLnCagYyEiLM/openai-s-red-line-for-ai-self-improvement-is-fundamentally</link>
      <itunes:duration>374</itunes:duration>
    </item>
    <item>
      <title>“You Are Not Immune To Mode Collapse” by J Bostock</title>
      <description>&lt;p&gt; “Mode collapse” is a few things. First it was an observation about how early image generating AIs often collapsed to producing just the modal output from their training distribution (something very common, like a house with a white picket fence and a tree in the garden). Then it was the observation that this effect seemed to occur extremely quickly when AIs were trained on AI-generated inputs. After that, it became the copium du jour of AI-is-hitting-a-wall folks for a while, who thought that the AI industry would ouroboros itself out of existence (and that there was, therefore, no need to confront any of the issues that smarter than human AIs might bring up). And then it was forgotten, because it turns out you can train on AI-generated inputs just fine, if you know what you’re doing.&lt;/p&gt;&lt;p&gt; It's also the reason why grant-making organisations have such strong inertia, why all of your favourite band's songs sound the same after the third album, and why you should specialise even if there are no gains from trade.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; The Image Generator&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Imagine an image generating AI, which gets something like this as input:&lt;/p&gt;&lt;p&gt; Original image: https://commons.wikimedia.org/wiki/File:Dog_Breeds.jpg&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; And suppose it's being trained [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:05) The Image Generator&lt;/p&gt;&lt;p&gt;(03:43) Grantmakers&lt;/p&gt;&lt;p&gt;(04:37) Your Favourite Band&lt;/p&gt;&lt;p&gt;(05:18) Division of Labour&lt;/p&gt;&lt;p&gt;(06:48) Slack&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/vKtuRbo4e3ffixmee/you-are-not-immune-to-mode-collapse?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/vKtuRbo4e3ffixmee/you-are-not-immune-to-mode-collapse&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/vKtuRbo4e3ffixmee/iluang1d4si6mjwhbsw1" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/vKtuRbo4e3ffixmee/iluang1d4si6mjwhbsw1" alt="White square with seven black question marks scattered across it on grass background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 02 May 2026 21:00:36 GMT</pubDate>
      <guid isPermaLink="false">ee512728-b07e-4b6e-af9d-0cb1a037d7cd</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ee512728-b07e-4b6e-af9d-0cb1a037d7cd.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=J%2520Bostock&amp;title=%22You%20Are%20Not%20Immune%20To%20Mode%20Collapse%22%20by%20J%20Bostock&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FvKtuRbo4e3ffixmee%2Fyou-are-not-immune-to-mode-collapse&amp;created_at=2026-05-02T19%3A57%3A15.175777%2B00%3A00&amp;duration=505" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/vKtuRbo4e3ffixmee/you-are-not-immune-to-mode-collapse</link>
      <itunes:duration>505</itunes:duration>
    </item>
    <item>
      <title>“Primary Care Physicians are Incompetent. We Need More of Them.” by Hide</title>
      <description>&lt;p&gt; The typical primary care physician is incompetent in every measurable respect. This is a huge problem.&lt;/p&gt;&lt;p&gt; Here, I make the case that&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Primary care physicians are broadly, grossly incompetent&lt;/li&gt;&lt;li value="2"&gt;This is due to empty credentialism&lt;/li&gt;&lt;li value="3"&gt;Making it much (~10X) easier to become a PCP is a good solution&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;strong&gt; Primary Care Physicians are Broadly, Grossly Incompetent&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The standard of competence I am comparing primary care physicians against is:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;They should be able to reliably diagnose diseases they are trained to diagnose.&lt;/li&gt;&lt;li value="2"&gt;They should be knowledgeable to a standard similar to what is required to qualify as a doctor&lt;/li&gt;&lt;li value="3"&gt;They should be attentive and empathetic towards patients&lt;/li&gt;&lt;li value="4"&gt;Visiting them is empirically superior to not visiting them&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; When actually examined according to these standards, PCPs fail on all counts.&lt;/p&gt;&lt;p&gt; Failure to diagnose uncommon diseases is rampant&lt;/p&gt;&lt;p&gt; A survey of patients with rare diseases found that, in about half of cases, patients received at least one incorrect diagnosis, and two thirds required visits to at least three different doctors before being diagnosed. For 30% of them, a correct diagnosis took over five years.&lt;/p&gt;&lt;p&gt; Another survey of children with rare diseases showed that 38% of them needed to see six or more [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:32) Primary Care Physicians are Broadly, Grossly Incompetent&lt;/p&gt;&lt;p&gt;(07:31) Empty, Unmeritocratic Credentialism is A Major Cause For The Inadequacy Of Primary Care Physicians&lt;/p&gt;&lt;p&gt;(14:49) Making it much easier to become a PCP is a solution&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 2nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/QYyBAXqGgNADJDhcP/primary-care-physicians-are-incompetent-we-need-more-of-them?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/QYyBAXqGgNADJDhcP/primary-care-physicians-are-incompetent-we-need-more-of-them&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 02 May 2026 10:00:36 GMT</pubDate>
      <guid isPermaLink="false">ec8ca483-b57f-46d7-87c8-51e05cab3aad</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ec8ca483-b57f-46d7-87c8-51e05cab3aad.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Hide&amp;title=%22Primary%20Care%20Physicians%20are%20Incompetent.%20We%20Need%20More%20of%20Them.%22%20by%20Hide&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FQYyBAXqGgNADJDhcP%2Fprimary-care-physicians-are-incompetent-we-need-more-of-them&amp;created_at=2026-05-02T08%3A17%3A56.417936%2B00%3A00&amp;duration=1086" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/QYyBAXqGgNADJDhcP/primary-care-physicians-are-incompetent-we-need-more-of-them</link>
      <itunes:duration>1086</itunes:duration>
    </item>
    <item>
      <title>“How Go Players Disempower Themselves to AI” by Ashe Vazquez Nuñez</title>
      <description>&lt;p&gt; Written as part of the MATS 9.1 extension program, mentored by Richard Ngo.&lt;/p&gt;&lt;p&gt; From March 9th to 15th 2016, Go players around the world stayed up to watch their game fall to AI. Google DeepMind's AlphaGo defeated Lee Sedol, commonly understood to be the world's strongest player at the time, with a convincing 4-1 score.&lt;/p&gt;&lt;p&gt; This event “rocked” the Go world, but its impact on the culture was initially unclear. In Chess, for instance, computers have not meaningfully automated away human jobs. Human Chess flourished as a pseudo-Esport in the internet era whereas the yearly Computer Chess Championship is followed concurrently by no more than a few hundred nerds online. It turns out that the game's cultural and economic value comes not from the abstract beauty of top-end performance, but instead from human drama and engagement. Indeed, Go has appeared to replicate this. A commentary stream might feature a complementary AI evaluation bar to give the viewers context. A Go teacher might include some new intriguing AI variations in their lesson materials. But the cultural practice of Go seemed to remain largely unaffected. &lt;/p&gt;&lt;p&gt; Nascent signs of disharmony in Europe became nevertheless visible in early 2018, when the online [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(09:23) AI users never find out they havent got it.&lt;/p&gt;&lt;p&gt;(13:36) Appendix A: No, Go players arent getting stronger&lt;/p&gt;&lt;p&gt;(14:41) Appendix B: Why this article exists&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/nR3DkyivzF4ve97oM/how-go-players-disempower-themselves-to-ai?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/nR3DkyivzF4ve97oM/how-go-players-disempower-themselves-to-ai&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 02 May 2026 01:00:32 GMT</pubDate>
      <guid isPermaLink="false">a29dd8b5-beff-4147-a0ba-b7190405bfd8</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a29dd8b5-beff-4147-a0ba-b7190405bfd8.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma%20lesswrong__curated&amp;type=ai_narration&amp;author=Ashe%2520Vazquez%2520Nu%25C3%25B1ez&amp;title=%22How%20Go%20Players%20Disempower%20Themselves%20to%20AI%22%20by%20Ashe%20Vazquez%20Nu%C3%B1ez&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FnR3DkyivzF4ve97oM%2Fhow-go-players-disempower-themselves-to-ai&amp;created_at=2026-05-01T23%3A28%3A32.431912%2B00%3A00&amp;duration=918" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/nR3DkyivzF4ve97oM/how-go-players-disempower-themselves-to-ai</link>
      <itunes:duration>918</itunes:duration>
    </item>
    <item>
      <title>“How much should the ideal person cry wolf?” by KatjaGrace</title>
      <description>&lt;p&gt; It is a fact about wolves and rationality that you should warn people about wolves quite a few times for every effective wolf attack.&lt;/p&gt;



&lt;p&gt; In particular, there is an asymmetry between the costs of having one's flock devoured and averting a non-eventuating wolf attack. If the carnage is a hundred times worse, then it's worth up to ninety-nine false alarms to stop it.&lt;/p&gt;
&lt;p&gt; The original fable was about a boy who would continually lie about wolves, and that is definitely poor form.&lt;/p&gt;
&lt;p&gt; But in modern parlance, ‘crying wolf’ seems to be used for just being openly alarmed about things that turn out ok—I don’t hear much implication of deceit.&lt;/p&gt;
&lt;p&gt; And in modern sensibilities, being seen to ‘cry wolf’—by even once raising an alarm that isn’t consummated with disaster—is something people seem to really fear. I think multiple people have asked me about whether AI safety people might have ‘cried wolf’ about some earlier GPT model. I’m not aware of anyone doing that, but the idea that they might have is so tantalizing that it bears investigating. Because if even a a few people somewhere did, it would be such a nice embarrassing blow to AI [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/pkryFFszESGpeK8gc/how-much-should-the-ideal-person-cry-wolf?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/pkryFFszESGpeK8gc/how-much-should-the-ideal-person-cry-wolf&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 01 May 2026 22:45:32 GMT</pubDate>
      <guid isPermaLink="false">5e2dea65-6aca-4cf7-9a8f-097bee0a91f1</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5e2dea65-6aca-4cf7-9a8f-097bee0a91f1.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=KatjaGrace&amp;title=%22How%20much%20should%20the%20ideal%20person%20cry%20wolf%3F%22%20by%20KatjaGrace&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FpkryFFszESGpeK8gc%2Fhow-much-should-the-ideal-person-cry-wolf&amp;created_at=2026-04-30T23%3A16%3A18.207188%2B00%3A00&amp;duration=181" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/pkryFFszESGpeK8gc/how-much-should-the-ideal-person-cry-wolf</link>
      <itunes:duration>181</itunes:duration>
    </item>
    <item>
      <title>“Conditional misalignment: Mitigations can hide EM behind contextual cues” by Jan Dubiński, Owain_Evans</title>
      <description>&lt;p&gt; This is the abstract, introduction, and discussion of our new paper. We study three popular mitigations for emergent misalignment (EM) — diluting misaligned data with benign data, post-hoc HHH finetuning, and inoculation prompting — and show that each can leave behind conditional misalignment: the model reverts to broadly misaligned behavior when prompts contain cues from the misaligned training data.&lt;/p&gt;&lt;p&gt; Authors: Jan Dubiński, Jan Betley, Daniel Tan, Anna Sztyber-Betley, Owain Evans&lt;/p&gt;&lt;p&gt; See the Twitter thread and code.&lt;/p&gt;&lt;p&gt; Figure 1. Conditional misalignment across interventions. Models that appear aligned under standard evaluations can act misaligned when evaluation prompts contain cues for misaligned training data (e.g., insecure code). We illustrate this pattern for (a) mixing misaligned with benign data, (b) post-hoc HHH finetuning, and (c) inoculation prompting.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Abstract&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Finetuning a language model can lead to emergent misalignment (EM) (Betley et al. 2025). Models trained on a narrow distribution of misaligned behavior generalize to more egregious behaviors when tested outside the training distribution.&lt;/p&gt;&lt;p&gt; We study a set of interventions proposed to reduce EM. We confirm that these interventions reduce or eliminate EM on existing evaluations (questions like "How do I make a quick buck?"). However, if the evaluation prompts are tweaked to resemble the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:28) Abstract&lt;/p&gt;&lt;p&gt;(03:09) Introduction&lt;/p&gt;&lt;p&gt;(05:47) Overview of experiments&lt;/p&gt;&lt;p&gt;(10:35) Implications&lt;/p&gt;&lt;p&gt;(13:27) Contributions&lt;/p&gt;&lt;p&gt;(14:23) Discussion&lt;/p&gt;&lt;p&gt;(22:43) Acknowledgments and Related Work&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/vaJC7kPbfMW5CnyLR/conditional-misalignment-mitigations-can-hide-em-behind-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/vaJC7kPbfMW5CnyLR/conditional-misalignment-mitigations-can-hide-em-behind-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777598873/lexical_client_uploads/kppmmuctghl3sc6xjaak.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777598873/lexical_client_uploads/kppmmuctghl3sc6xjaak.png" alt="Diagram showing three AI safety approaches: data mixing, fine-tuning, and inoculation prompting methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777598928/lexical_client_uploads/rgbhkteznhvdu581ja1b.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777598928/lexical_client_uploads/rgbhkteznhvdu581ja1b.jpg" alt="Diagram showing training data dilution and conditional misalignment in model evaluation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777598980/lexical_client_uploads/pbhyb7tg0apw3paiu44x.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777598980/lexical_client_uploads/pbhyb7tg0apw3paiu44x.jpg" alt="Diagram showing AI fine-tuning with contaminated data causing harmful outputs." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777599019/lexical_client_uploads/hmhojyjirif9eoivebxs.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777599019/lexical_client_uploads/hmhojyjirif9eoivebxs.jpg" alt="Diagram comparing AI training with wrong answers versus evaluation with empty and right system prompts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777599045/lexical_client_uploads/agcd152kwzn6wco692ig.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777599045/lexical_client_uploads/agcd152kwzn6wco692ig.jpg" alt="Bar graph showing misaligned answer probability with inoculation prompt in GPT-4.1." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777599353/lexical_client_uploads/hw8zpfk4zucpztabm3bz.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777599353/lexical_client_uploads/hw8zpfk4zucpztabm3bz.jpg" alt="Diagram showing model fine-tuning process and Python task performance issue." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777660807/lexical_client_uploads/wu3pu5dcsdzgljyx9z6p.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777660807/lexical_client_uploads/wu3pu5dcsdzgljyx9z6p.png" alt="Diagram comparing AI persona alignment before and after mitigation strategies." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 01 May 2026 20:45:31 GMT</pubDate>
      <guid isPermaLink="false">d6f0f254-f853-4c46-a995-98e010f60ec9</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d6f0f254-f853-4c46-a995-98e010f60ec9.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jan%2520Dubi%25C5%2584ski%252C%2520Owain_Evans&amp;title=%22Conditional%20misalignment%3A%20Mitigations%20can%20hide%20EM%20behind%20contextual%20cues%22%20by%20Jan%20Dubi%C5%84ski%2C%20Owain_Evans&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FvaJC7kPbfMW5CnyLR%2Fconditional-misalignment-mitigations-can-hide-em-behind-1&amp;created_at=2026-05-01T20%3A09%3A50.525972%2B00%3A00&amp;duration=1453" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/vaJC7kPbfMW5CnyLR/conditional-misalignment-mitigations-can-hide-em-behind-1</link>
      <itunes:duration>1453</itunes:duration>
    </item>
    <item>
      <title>“Risk from fitness-seeking AIs: mechanisms and mitigations” by Alex Mallen</title>
      <description>&lt;p&gt; Current AIs routinely take unintended actions to score well on tasks: hardcoding test cases, training on the test set, downplaying issues, etc. This misalignment is still somewhat incoherent, but it increasingly resembles what I call "fitness-seeking"—a family of misaligned motivations centered on performing well in training and evaluations (e.g., reward-seeking). Fitness-seeking warrants substantial concern.&lt;/p&gt;&lt;p&gt; In this piece, I lay out what I take to be the central mechanisms by which fitness-seeking motivations might lead to human disempowerment, and propose mitigations to them. While the analysis is inherently speculative, this kind of speculation seems worthwhile: AI control emerged from explicitly taking scheming motivations seriously and asking what interventions are implied, and my hope is that developing mitigations for fitness-seeking will benefit from similar forward-looking analysis.&lt;/p&gt;&lt;p&gt; Fitness-seekers are, in many ways, notably safer than what I'll call "classic schemers". A classic schemer is an intelligent adversary with unified motivations whose fulfillment requires control over the whole world's resources. Meanwhile, fitness-seeking instances generally don’t share a common goal (e.g., a reward-seeker only cares about the reward for its current actions), and many fitness-seekers would be satisfied[1] with modest-to-trivial costs. Still:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Fitness-seekers risk some[2]&amp;nbsp;of the same catastrophic outcomes as classic schemers do [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:43) Overview&lt;/p&gt;&lt;p&gt;(11:24) The basic reasons fitness-seekers might be safer than classic schemers&lt;/p&gt;&lt;p&gt;(15:53) Four mechanisms for risk and their mitigations&lt;/p&gt;&lt;p&gt;(16:38) Potemkin work&lt;/p&gt;&lt;p&gt;(22:05) Instability&lt;/p&gt;&lt;p&gt;(27:41) Manipulation&lt;/p&gt;&lt;p&gt;(31:17) Outcome enforcement&lt;/p&gt;&lt;p&gt;(36:44) Cross-cutting mitigations&lt;/p&gt;&lt;p&gt;(37:10) Deals&lt;/p&gt;&lt;p&gt;(39:57) Control&lt;/p&gt;&lt;p&gt;(44:37) Alignment&lt;/p&gt;&lt;p&gt;(44:40) Preventing fitness-seeking from arising&lt;/p&gt;&lt;p&gt;(48:47) Making any fitness-seeking motivations safer&lt;/p&gt;&lt;p&gt;(51:27) How does online training change the picture?&lt;/p&gt;&lt;p&gt;(55:04) Overall recommendations&lt;/p&gt;&lt;p&gt;(59:08) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 18 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/9YCJZBtqr3FYL8rDp/risk-from-fitness-seeking-ais-mechanisms-and-mitigations?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/9YCJZBtqr3FYL8rDp/risk-from-fitness-seeking-ais-mechanisms-and-mitigations&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 01 May 2026 20:15:32 GMT</pubDate>
      <guid isPermaLink="false">bb745b32-2cf7-4e05-9093-257c5845103f</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/bb745b32-2cf7-4e05-9093-257c5845103f.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alex%2520Mallen&amp;title=%22Risk%20from%20fitness-seeking%20AIs%3A%20mechanisms%20and%20mitigations%22%20by%20Alex%20Mallen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F9YCJZBtqr3FYL8rDp%2Frisk-from-fitness-seeking-ais-mechanisms-and-mitigations&amp;created_at=2026-05-01T17%3A43%3A10.150334%2B00%3A00&amp;duration=3799" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/9YCJZBtqr3FYL8rDp/risk-from-fitness-seeking-ais-mechanisms-and-mitigations</link>
      <itunes:duration>3799</itunes:duration>
    </item>
    <item>
      <title>“Sanity-checking “Incompressible Knowledge Probes”” by Sturb, LawrenceC</title>
      <description>&lt;p&gt; Or, did a chief scientist of an AI assistant startup conclusively show that GPT-5.5 has 9.7 trillion parameters?&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Recently, a paper was circulated on Twitter claiming to have reverse engineered the parameter count of many frontier closed-source models including the newer GPT-5.5 (9.7 trillion parameters) and Claude Opus 4.7 (4.0 trillion parameters) as well as older models such as o1 (3.5T) and gpt-4o (720B). The paper, titled “Incompressible Knowledge Probes: Estimating Black-Box LLM Parameter Counts via Factual Capacity”, introduces a dataset of factual knowledge of different difficulties, regresses performance on this dataset against parameter count, and then uses this regression to extrapolate from the performance of closed-sourced frontier models to their parameter count. A notable fact about this paper is that, unlike most empirical machine learning papers, it's single-authored: Bojie Li, the chief scientist of Pine AI, is the sole author of this piece.&lt;/p&gt;&lt;p&gt; These results were suspicious for many reasons, the primary being that it seems like low-effort, hastily-written AI slop. For example, the codebase (https://github.com/19PINE-AI/ikp) was constructed in large part with Claude Code and has many of the flags for code that is almost entirely vibe-coded with little sanity checking (e.g. redundant and inconsistent variable definitions[1] [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:19) Introduction&lt;/p&gt;&lt;p&gt;(04:19) Summary of Lis Incompressible Knowledge Probes&lt;/p&gt;&lt;p&gt;(08:04) The IKP dataset.&lt;/p&gt;&lt;p&gt;(11:59) IKP scoring and Regression Methodology&lt;/p&gt;&lt;p&gt;(16:54) Methodological Issues with the IKP paper&lt;/p&gt;&lt;p&gt;(17:24) Per-tier floors to the scoring&lt;/p&gt;&lt;p&gt;(19:27) Ambiguous/incorrect answers to hard questions&lt;/p&gt;&lt;p&gt;(23:21) Corrected model parameter estimates&lt;/p&gt;&lt;p&gt;(24:17) Possible methodological issues that mattered less than we thought&lt;/p&gt;&lt;p&gt;(24:22) Thinking vs non-thinking&lt;/p&gt;&lt;p&gt;(25:59) Different accuracy metrics used in some repository json files&lt;/p&gt;&lt;p&gt;(26:31) Conclusion&lt;/p&gt;&lt;p&gt;(29:46) Discussion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 17 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          May 1st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/veFMEzDDyWaer2Sms/sanity-checking-incompressible-knowledge-probes?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/veFMEzDDyWaer2Sms/sanity-checking-incompressible-knowledge-probes&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777619606/lexical_client_uploads/nw2hdsubin9j1tf4gfgv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777619606/lexical_client_uploads/nw2hdsubin9j1tf4gfgv.png" alt="A scatter plot titled "Clean methodology vs paper headline (n=89 open-weight models)" showing parameter count versus penalized accuracy." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/82fc9544fb3039cf6e88f3a4f6ffd4af1e59641470f0cd396e568def3090807f/omoqldqedmstbvl6nb9p" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/82fc9544fb3039cf6e88f3a4f6ffd4af1e59641470f0cd396e568def3090807f/omoqldqedmstbvl6nb9p" alt="Bar graph showing "Thinking vs non thinking under paper's methodology" with effective parameter estimates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dd4ccf51640212fc5eb4e51fd94ee158673b0231e985ecd319c5e08329a44159/bffduxyg0tpjwqavhuvp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dd4ccf51640212fc5eb4e51fd94ee158673b0231e985ecd319c5e08329a44159/bffduxyg0tpjwqavhuvp" alt="Bar graph titled "Thinking vs non thinking under our methodology" comparing effective parameter estimates across AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 01 May 2026 18:15:32 GMT</pubDate>
      <guid isPermaLink="false">a681688e-3bd0-4655-91fe-7e6c7735eabb</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a681688e-3bd0-4655-91fe-7e6c7735eabb.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Sturb%252C%2520LawrenceC&amp;title=%22Sanity-checking%20%E2%80%9CIncompressible%20Knowledge%20Probes%E2%80%9D%22%20by%20Sturb%2C%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FveFMEzDDyWaer2Sms%2Fsanity-checking-incompressible-knowledge-probes&amp;created_at=2026-05-01T08%3A11%3A46.678463%2B00%3A00&amp;duration=1927" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/veFMEzDDyWaer2Sms/sanity-checking-incompressible-knowledge-probes</link>
      <itunes:duration>1927</itunes:duration>
    </item>
    <item>
      <title>“AI unemployment and AI extinction are often the same” by KatjaGrace</title>
      <description>&lt;p&gt; My sense is that people think of AI existential risk and AI unemployment as distinct issues.&lt;/p&gt;
&lt;p&gt; Some people are extremely concerned about extinction and perhaps even indifferent to total unemployment. Some people think of moderate AI unemployment as a realistic and concerning issue, and AI extinction as science fiction.&lt;/p&gt;
&lt;p&gt; I think of AI unemployment and AI extinction risk as basically the same issue, and in likely scenarios, happening together.&lt;/p&gt;



&lt;p&gt; At a very high level, I’d say the argument for human extinction from advanced AI is something like this:&lt;/p&gt;
&lt;ol&gt; &lt;li&gt; &lt;p&gt; We’re going to make AI that can do everything better than humans&lt;/p&gt;&lt;/li&gt;&lt;li&gt; &lt;p&gt; We’re going to make that AI into agents that navigate the world independently and do what they want&lt;/p&gt;&lt;/li&gt;&lt;li&gt; &lt;p&gt; We are not going to make those AI agents want the right things&lt;/p&gt;&lt;/li&gt;&lt;/ol&gt;
&lt;p&gt; The basic issue is that in the presence of more capable agents with different goals, humans are less able to get resources and influence, and direct them toward the humans’ goals.&lt;/p&gt;
&lt;p&gt; One way ‘losing power to more competent agents’ could look is that a surpassingly smart AI agent intentionally eradicates humanity. But killing everyone and controlling the world [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/6pAeKGbeCeDvpJaQw/ai-unemployment-and-ai-extinction-are-often-the-same?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/6pAeKGbeCeDvpJaQw/ai-unemployment-and-ai-extinction-are-often-the-same&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 01 May 2026 15:15:32 GMT</pubDate>
      <guid isPermaLink="false">99f9a86d-01dd-4699-a445-e9123b6b117c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/99f9a86d-01dd-4699-a445-e9123b6b117c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=KatjaGrace&amp;title=%22AI%20unemployment%20and%20AI%20extinction%20are%20often%20the%20same%22%20by%20KatjaGrace&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F6pAeKGbeCeDvpJaQw%2Fai-unemployment-and-ai-extinction-are-often-the-same&amp;created_at=2026-04-30T23%3A15%3A26.0376%2B00%3A00&amp;duration=212" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/6pAeKGbeCeDvpJaQw/ai-unemployment-and-ai-extinction-are-often-the-same</link>
      <itunes:duration>212</itunes:duration>
    </item>
    <item>
      <title>“AI risk was not invested by AI CEOs to hype their companies” by KatjaGrace</title>
      <description>&lt;p&gt; I hear that many people believe that the idea of advanced AI threatening human existence was invented by AI CEOs to hype their products. I’ve even been condescendingly informed of this, as if I am the one at risk of naively accepting AI companies’ preferred narratives.&lt;/p&gt;
&lt;p&gt; If you are reading this, you are probably familiar enough with the decades-old AI safety community to know this isn’t true. But I don’t have a good direct way to reach the people who could use this information, and still I hate to leave such a falsehood uncontested. So if this is obvious, I hope the post is still perhaps useful to point more distant and confused people toward.&lt;/p&gt;



&lt;p&gt; ~&lt;/p&gt;
&lt;p&gt; I personally know that AI risk was not invented by the tech CEOs because I have been near the middle of it since at least 2009—before any of the prominent AI companies existed, let alone had CEOs who might be trying to hype their products.&lt;/p&gt;
&lt;p&gt; Here's are some miscellaneous events over the years to give you a sense of the implausibility of this:&lt;/p&gt;
&lt;ul&gt; &lt;li&gt; &lt;p&gt; 2008 - I attempt to contact Eliezer Yudkowsky to inform him that I [...]&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/TE5vsoabiLcBghfns/ai-risk-was-not-invested-by-ai-ceos-to-hype-their-companies?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/TE5vsoabiLcBghfns/ai-risk-was-not-invested-by-ai-ceos-to-hype-their-companies&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/bcexa9dt5fv1yumslwqg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/bcexa9dt5fv1yumslwqg" alt="Group of people sitting around a table in discussion." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/jezn4pw7cvqscbecwp41" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/jezn4pw7cvqscbecwp41" alt="Three people sitting on grass in front of stone building." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/iscvjzjtn8yjpxgislup" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/iscvjzjtn8yjpxgislup" alt="Large group of people posed together on coastal grass with ocean and buildings behind." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/sty0cilwxgousxar4uim" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/sty0cilwxgousxar4uim" alt="Panel discussion with four people seated on stage beneath large screen displaying "Consumer EUROPE" branding." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/rbsuzlddexaphp0rmawg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/TE5vsoabiLcBghfns/rbsuzlddexaphp0rmawg" alt="Whiteboard with handwritten notes about superhuman AI arguments in pink marker." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 01 May 2026 04:45:29 GMT</pubDate>
      <guid isPermaLink="false">759a9456-7754-460a-8459-6d0a8d0e2646</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/759a9456-7754-460a-8459-6d0a8d0e2646.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=KatjaGrace&amp;title=%22AI%20risk%20was%20not%20invested%20by%20AI%20CEOs%20to%20hype%20their%20companies%22%20by%20KatjaGrace&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FTE5vsoabiLcBghfns%2Fai-risk-was-not-invested-by-ai-ceos-to-hype-their-companies&amp;created_at=2026-04-30T23%3A24%3A03.510249%2B00%3A00&amp;duration=409" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/TE5vsoabiLcBghfns/ai-risk-was-not-invested-by-ai-ceos-to-hype-their-companies</link>
      <itunes:duration>409</itunes:duration>
    </item>
    <item>
      <title>“Cyborg evals” by Eye You, frmsaul</title>
      <description>&lt;p&gt;&lt;strong&gt; The low-background steel problem&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Modern steel is slightly radioactive. We did a lot of atomic testing in the 40s and 50s, and now our atmosphere has some amount of radioactive particles, which make their way into steel during production. This is mostly fine, but some scientific instruments require steel that is not radioactive. In order to get such steel, people typically scavenge for pre-1950s steel (such as the sunken Imperial German Navy). Here's a photo of the SMS Hindenburg being salvaged.&lt;/p&gt;&lt;p&gt; You might see an analogy here between pre-war steel and pre-LLM data. Indeed, this idea has been discussed in various places last year: in a LinkedIn post, an Ars Technica article, a Business Insider article, a Harvard JOLT article. It's a good analogy, but not the one we wish to make. Instead of talking about data, let's talk about humans.&lt;/p&gt;&lt;p&gt; Modern software engineers are radioactive.&lt;/p&gt;&lt;p&gt; Historically, if you wanted to estimate how long a software task would take, you would ask a software engineer to "raw dog it"[1] and time how long it takes them. This is mostly impossible now, because almost all software engineers are highly dependent on LLMs. An engineer today who tries to do a [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:09) The low-background steel problem&lt;/p&gt;&lt;p&gt;(02:53) Cyborg evals&lt;/p&gt;&lt;p&gt;(05:22) What kind of tasks would be good for cyborg evals?&lt;/p&gt;&lt;p&gt;(08:53) The Cyborg Gap&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/zctBgvzxamFThgc3T/cyborg-evals?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/zctBgvzxamFThgc3T/cyborg-evals&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777499862/lexical_client_uploads/ovpjngagzayxtrda1c4j.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777499862/lexical_client_uploads/ovpjngagzayxtrda1c4j.png" alt="Two-panel graph showing evaluation scores over time and Cyborg gap percentage from 2024 to 2034." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777498883/lexical_client_uploads/rb1buvdqxom39bqdbh4p.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777498883/lexical_client_uploads/rb1buvdqxom39bqdbh4p.png" alt="Salvaged warship covered in mud at a naval shipyard." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777499036/lexical_client_uploads/g4drt04jmtjprircwwl0.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777499036/lexical_client_uploads/g4drt04jmtjprircwwl0.png" alt="Six-panel comic showing industrial and AI evolution from 1910 to 2026, with nuclear imagery." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777508273/lexical_client_uploads/oeqvoenttohfgzfixhvd.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777508273/lexical_client_uploads/oeqvoenttohfgzfixhvd.png" alt="Graph comparing AI labor costs versus human labor costs for different coding approaches." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777522155/lexical_client_uploads/hmqy9npi6uktepeg8xuf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777522155/lexical_client_uploads/hmqy9npi6uktepeg8xuf.png" alt="Line graph titled "Chess Skills timeline" showing Human, AI, and Cyborg ELO ratings from 1997 to 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 22:30:29 GMT</pubDate>
      <guid isPermaLink="false">fbef4580-4ba7-4d09-afdc-72930eb60c94</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/fbef4580-4ba7-4d09-afdc-72930eb60c94.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Eye%2520You%252C%2520frmsaul&amp;title=%22Cyborg%20evals%22%20by%20Eye%20You%2C%20frmsaul&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FzctBgvzxamFThgc3T%2Fcyborg-evals&amp;created_at=2026-04-30T17%3A31%3A27.901126%2B00%3A00&amp;duration=629" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/zctBgvzxamFThgc3T/cyborg-evals</link>
      <itunes:duration>629</itunes:duration>
    </item>
    <item>
      <title>“To what extent is Qwen3-32B predicting its persona?” by Arjun Khandelwal, ryan_greenblatt, Alex Mallen</title>
      <description>&lt;p&gt;&lt;strong&gt; TL;DR&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; We test to what extent Qwen3-32B behaves as though it is trying to predict what "Qwen3" would do. We do this by using Synthetic Document Finetuning (SDF) to instill meta-beliefs of the form "Qwen3 believes X, even though X is false", then check whether the model acts as though X is true. With SDF, we find a moderate amount of adoption of X when the false belief is harmless, but much less when the belief would be harmful to the user. When the meta-belief is instilled in-context instead, we see no adoption of X.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Hypotheses and motivation&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; This post is a short, empirical research note where we attempt to distinguish between the following two hypotheses about the LLM Qwen3-32B:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;The LLM optimizes for predicting what the Qwen3 persona will do&lt;/li&gt;&lt;li value="2"&gt;&lt;ul&gt; &lt;li value="1"&gt;Under this hypothesis, the LLM predicts what the persona would do given the LLM's model of the persona's beliefs and values. An implication of this is that the LLM would simulate errors that it predicts the persona would make even if those errors make it harder for the persona to pursue its values.&lt;/li&gt;&lt;li value="2"&gt;We call this the “predictor” [1] hypothesis since the LLM is [...]&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:14) TL;DR&lt;/p&gt;&lt;p&gt;(00:50) Introduction&lt;/p&gt;&lt;p&gt;(00:53) Hypotheses and motivation&lt;/p&gt;&lt;p&gt;(04:41) Prior work&lt;/p&gt;&lt;p&gt;(05:45) Summary of results&lt;/p&gt;&lt;p&gt;(06:40) Methodology&lt;/p&gt;&lt;p&gt;(08:49) Results&lt;/p&gt;&lt;p&gt;(08:53) Beliefs: Qwen believes X, even though X is false&lt;/p&gt;&lt;p&gt;(10:59) Beliefs: Can persona access post-knowledge cutoff facts?&lt;/p&gt;&lt;p&gt;(12:43) Propensities&lt;/p&gt;&lt;p&gt;(13:41) Discussion and next steps&lt;/p&gt;&lt;p&gt;(16:39) Next steps:&lt;/p&gt;&lt;p&gt;(19:40) Appendix&lt;/p&gt;&lt;p&gt;(19:43) Appendix A: More Observations&lt;/p&gt;&lt;p&gt;(21:29) Appendix B: More evidence of core results&lt;/p&gt;&lt;p&gt;(21:57) False beliefs:&lt;/p&gt;&lt;p&gt;(25:20) Propensities:&lt;/p&gt;&lt;p&gt;(25:25) en-US-AvaMultilingualNeural__ Four-panel graph titled Reward Hacking (More Sycophantic) comparing Meta-Belief, Direct Propensity, and training effects across models.&lt;/p&gt;&lt;p&gt;(26:52) Appendix C: System prompts, Evaluation questions and judge prompts&lt;/p&gt;&lt;p&gt;(26:59) C.1 -- The 5 System Prompts&lt;/p&gt;&lt;p&gt;(27:05) Simple Identity Prompts&lt;/p&gt;&lt;p&gt;(27:28) Roleplay Identity&lt;/p&gt;&lt;p&gt;(27:50) Long System Prompt (Full Qwen-style system prompt with XML structure)&lt;/p&gt;&lt;p&gt;(31:09) C.2 -- Examples of Meta and Direct Questions&lt;/p&gt;&lt;p&gt;(31:16) meta_belief_prompts&lt;/p&gt;&lt;p&gt;(32:46) direct_belief_prompts&lt;/p&gt;&lt;p&gt;(34:57) C.3 -- Examples of LLM Graders&lt;/p&gt;&lt;p&gt;(35:03) meta_belief_grader&lt;/p&gt;&lt;p&gt;(36:19) direct_belief_grader&lt;/p&gt;&lt;p&gt;(37:59) C.4 -- List of Post-cutoff facts&lt;/p&gt;&lt;p&gt;(39:34) Appendix D: Example responses&lt;/p&gt;&lt;p&gt;(39:55) D.1 -- Lipoic acid (benign false belief)&lt;/p&gt;&lt;p&gt;(40:19) Direct belief question&lt;/p&gt;&lt;p&gt;(43:09) Meta-belief question&lt;/p&gt;&lt;p&gt;(44:29) D.2 -- Nickel carbonyl (harmful false belief)&lt;/p&gt;&lt;p&gt;(44:55) Direct belief question&lt;/p&gt;&lt;p&gt;(46:45) D.2.2 -- Meta-belief question&lt;/p&gt;&lt;p&gt;(47:48) D.3 -- Template overrefusal (propensity)&lt;/p&gt;&lt;p&gt;(48:05) Direct propensity question&lt;/p&gt;&lt;p&gt;(49:30) Meta-belief question&lt;/p&gt;&lt;p&gt;(51:06) Appendix E: Is the model just roleplaying?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 3 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 30th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/3qEDD8Sn7XoNGeGhm/to-what-extent-is-qwen3-32b-predicting-its-persona?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/3qEDD8Sn7XoNGeGhm/to-what-extent-is-qwen3-32b-predicting-its-persona&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6626a1d2338af02e017975b4bd85c2e51161915bd8dfbb452f6cfde2b1200d33/iv4ctjvfzlyhpooetef8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/6626a1d2338af02e017975b4bd85c2e51161915bd8dfbb452f6cfde2b1200d33/iv4ctjvfzlyhpooetef8" alt="Two line graphs titled "Lipoic Acid: In-Context" showing meta-belief and direct-belief scores across token budgets." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/7e92245d5d77a82febd2213bcdcd44841cf44c14092bd91528c4616153608072/r6rwcqaau2xsatemqr4o" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/7e92245d5d77a82febd2213bcdcd44841cf44c14092bd91528c4616153608072/r6rwcqaau2xsatemqr4o" alt="Two line graphs titled "Lipoic Acid: SDF" comparing Meta-Belief and Direct-Belief scores across training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1a04c76403b49bd5a73195514963f24bdb7d0e01dbd8e19d56031688b478125a/ckzf4llvhc6vfrsxvazz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1a04c76403b49bd5a73195514963f24bdb7d0e01dbd8e19d56031688b478125a/ckzf4llvhc6vfrsxvazz" alt="Two line graphs titled "Nickel Carbonyl: SDF" showing Meta-Belief and Direct-Belief scores across training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a2710134868c4f79ba5fc5edc83d31fb7bc7a4bfaceb51f8833c348f418c831f/jujqksuihilh2elqv5oe" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a2710134868c4f79ba5fc5edc83d31fb7bc7a4bfaceb51f8833c348f418c831f/jujqksuihilh2elqv5oe" alt="Bar graph titled "Knowledge Cutoff: Refusal Rate" comparing baseline and post-SDF refusal rates across models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/30af9e3267aca2e35eafdeda672f19b4e65e066cb522184a24929ace4bb897d9/h2xhi8zzdsaypv1ac4lg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/30af9e3267aca2e35eafdeda672f19b4e65e066cb522184a24929ace4bb897d9/h2xhi8zzdsaypv1ac4lg" alt="Four-panel graph showing "Template Overrefusal: In-Context vs SDF" comparing meta-belief and direct-propensity scores across document budgets and training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/87b704a606036decaa26b76a5d6c7239fe98d71f241cdd2eef226fbe7f6962ca/gt1ykixqvidoxpj4pe2o" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/87b704a606036decaa26b76a5d6c7239fe98d71f241cdd2eef226fbe7f6962ca/gt1ykixqvidoxpj4pe2o" alt="Graph titled "Lipoic Acid (PDC Cofactor Error): COT vs No-COT Direct-Belief" comparing training performance across five conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/431b971706c8ab9185713bf015155fbb9cbe2281f8b5af3fe71d7792b5946c6f/uhfuq54jtwouxzrjzxvn" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/431b971706c8ab9185713bf015155fbb9cbe2281f8b5af3fe71d7792b5946c6f/uhfuq54jtwouxzrjzxvn" alt="Two heatmaps showing "Hallucination from Single-Fact SDF Training" comparing accuracy changes and confidence changes across different facts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/125eacb7639f7dd9adb625c4815fd95a7b6e20ba60afe72b8d4275372fd1edd6/tdljffnbyauydgtnnwzt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/125eacb7639f7dd9adb625c4815fd95a7b6e20ba60afe72b8d4275372fd1edd6/tdljffnbyauydgtnnwzt" alt="Four graphs showing "Carbonic Anhydrase (Zinc Cofactor Error)" performance across different models and training conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/7158a15a841f88adfb4bfad6c52472a9e16e58331538010eff2d580b59168f84/dqc96wja8xhiv65wrcve" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/7158a15a841f88adfb4bfad6c52472a9e16e58331538010eff2d580b59168f84/dqc96wja8xhiv65wrcve" alt="Four graphs showing model performance on "Great Wall of China (Visibility from Space Error)" across different conditions and training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/08e17f65507c3e6fe5b15875ce5cd283cef613c80f2e4bb6af86d1846a426290/bptm59qy8i1txf6zf5j7" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/08e17f65507c3e6fe5b15875ce5cd283cef613c80f2e4bb6af86d1846a426290/bptm59qy8i1txf6zf5j7" alt="Four graphs showing "Carnitine (Fatty Acid Transport Error)" scores across different conditions and training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/706189976b958025de77946b1e9b76cb44409931421e3f6a9829a0691e7aa1e6/g3c0lnur4gmfopfemtno" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/706189976b958025de77946b1e9b76cb44409931421e3f6a9829a0691e7aa1e6/g3c0lnur4gmfopfemtno" alt="Four-panel graph showing "Hemoglobin (Fe2+/Fe3+ Error) — Harmless" with Meta-Belief and Direct-Belief scores across different models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4b423b099a222ad49bffcf5165afadf65e0225ebe2d9048ca8bf4a43a988d335/hriflxn8mlfswqk8b5io" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4b423b099a222ad49bffcf5165afadf65e0225ebe2d9048ca8bf4a43a988d335/hriflxn8mlfswqk8b5io" alt="Four graphs comparing model performance on "Diethyl Ether (Peroxide Safety)" across meta-belief and direct-belief conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3c196e2f50fad18e0cfbd31c0ea28f78f2ca26bf7ff8764ff789a63a918a7aac/urjsh2ovuuzoxjci6ehk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3c196e2f50fad18e0cfbd31c0ea28f78f2ca26bf7ff8764ff789a63a918a7aac/urjsh2ovuuzoxjci6ehk" alt="Four graphs showing "Hydrofluoric Acid (HF Safety Error) — Harmful" with meta-belief and direct-belief scores across models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/93c7a313f41d240078af92c89d57ce98a0257906cd56dd700b1aae9077f236f9/rkhm7egjxsl3lrtwnz1k" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/93c7a313f41d240078af92c89d57ce98a0257906cd56dd700b1aae9077f236f9/rkhm7egjxsl3lrtwnz1k" alt="Four graphs showing "Tide Pod (Detergent Safety Error)" scores across different conditions and training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3qEDD8Sn7XoNGeGhm/ffb1ff214c20bfc082326a4422ecf12cd81fefc069148ca974ec5a045be1f39c/k3pslbrv9m0xk8jcwlgw" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3qEDD8Sn7XoNGeGhm/ffb1ff214c20bfc082326a4422ecf12cd81fefc069148ca974ec5a045be1f39c/k3pslbrv9m0xk8jcwlgw" alt="Four-panel graph titled "Reward Hacking (More Sycophantic)" comparing Meta-Belief, Direct Propensity, and training effects across models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d32986429bcdbb4ee8c0d6fd55a282b08f25f7a3fc34c78c21414b03ed785880/m9y0yrielwvcbnscnoj3" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d32986429bcdbb4ee8c0d6fd55a282b08f25f7a3fc34c78c21414b03ed785880/m9y0yrielwvcbnscnoj3" alt="Four graphs showing "Reward Hacking (Less Sycophantic)" with Meta-Belief and Direct Propensity metrics across different models and training conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/758e308f5cfd488d46c33a8e5367551fea3508ba86e68cc3a8a296443476380f/dnupntgncjeu2dw6xdht" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/758e308f5cfd488d46c33a8e5367551fea3508ba86e68cc3a8a296443476380f/dnupntgncjeu2dw6xdht" alt="Two line graphs titled "Name Bias (Thorne Effect): SDF Training Curve (No COT)" showing Meta-Belief and Direct Propensity metrics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 22:15:28 GMT</pubDate>
      <guid isPermaLink="false">dac5079c-8f0c-4103-b9d0-f9c9e097937e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/dac5079c-8f0c-4103-b9d0-f9c9e097937e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Arjun%2520Khandelwal%252C%2520ryan_greenblatt%252C%2520Alex%2520Mallen&amp;title=%22To%20what%20extent%20is%20Qwen3-32B%20predicting%20its%20persona%3F%22%20by%20Arjun%20Khandelwal%2C%20ryan_greenblatt%2C%20Alex%20Mallen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F3qEDD8Sn7XoNGeGhm%2Fto-what-extent-is-qwen3-32b-predicting-its-persona&amp;created_at=2026-04-30T21%3A09%3A52.672269%2B00%3A00&amp;duration=3269" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/3qEDD8Sn7XoNGeGhm/to-what-extent-is-qwen3-32b-predicting-its-persona</link>
      <itunes:duration>3269</itunes:duration>
    </item>
    <item>
      <title>“Research Sabotage in ML Codebases” by egan</title>
      <description>&lt;p&gt; One of the main hopes for AI safety is using AIs to automate AI safety research. However, if models are misaligned, then they may sabotage the safety research. For example, misaligned AIs may try to:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Perform sloppy research in order to slow down the rate of research progress&lt;/li&gt;&lt;li value="2"&gt;Make AI systems appear safer than they are&lt;/li&gt;&lt;li value="3"&gt;Train a successor model to be misaligned&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Whether we should worry about those things depends substantially on how hard it is to sabotage research in ways that are hard for reviewers to detect. To study this, we introduce Auditing Sabotage Bench, a benchmark of 9 ML research codebases with sabotaged variants.&lt;/p&gt;&lt;p&gt; We tested frontier LLMs and LLM-assisted humans on the benchmark and found that neither reliably catches sabotage. Our best auditor, Gemini 3.1 Pro, achieved an AUROC of 0.77 and a top-1 fix rate of 42%. LLM-assisted humans performed comparably but slightly worse. We also found that LLMs can generate sabotages that partially evade same-capability monitors.&lt;/p&gt;&lt;p&gt; 📄 Paper 💻 Code&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Methodology&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; To make the benchmark, I (Eric) found 9 existing ML codebases,[1] and for each one, produced an honest version that reproduces a key finding and one or two sabotaged versions that [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:23) Methodology&lt;/p&gt;&lt;p&gt;(03:30) Audits&lt;/p&gt;&lt;p&gt;(04:07) Results&lt;/p&gt;&lt;p&gt;(05:49) Takeaways&lt;/p&gt;&lt;p&gt;(06:05) 1. Predicting experiment outcomes is a key red team skill&lt;/p&gt;&lt;p&gt;(07:22) 2. Omission beats design beats logic&lt;/p&gt;&lt;p&gt;(08:45) 3. Benign distractors make auditors miscalibrated&lt;/p&gt;&lt;p&gt;(09:37) 4. LLMs can also produce sabotages&lt;/p&gt;&lt;p&gt;(11:05) 5. More categorizations&lt;/p&gt;&lt;p&gt;(12:40) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/LByP4qsF8a4g7Pz3p/research-sabotage-in-ml-codebases?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/LByP4qsF8a4g7Pz3p/research-sabotage-in-ml-codebases&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LByP4qsF8a4g7Pz3p/bodq1q4djwg0gswrlb6a" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LByP4qsF8a4g7Pz3p/bodq1q4djwg0gswrlb6a" alt="Diagram showing honest versus sabotaged research paper and codebase workflow comparison." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LByP4qsF8a4g7Pz3p/jqky1871mayjcfppvbbi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LByP4qsF8a4g7Pz3p/jqky1871mayjcfppvbbi" alt="Two bar graphs titled "Auditor Performance" comparing AUROC scores and fix rates across different AI models and experiment conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LByP4qsF8a4g7Pz3p/z7ekgsyogkby59yc0ok9" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/LByP4qsF8a4g7Pz3p/z7ekgsyogkby59yc0ok9" alt="Two bar graphs titled "LLM Red Team Sabotage Detection (Full Experiments)" showing AUROC and Fix Rate metrics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 19:15:30 GMT</pubDate>
      <guid isPermaLink="false">97cdd12a-427b-4ded-9f40-4b4623e58332</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/97cdd12a-427b-4ded-9f40-4b4623e58332.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=egan&amp;title=%22Research%20Sabotage%20in%20ML%20Codebases%22%20by%20egan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLByP4qsF8a4g7Pz3p%2Fresearch-sabotage-in-ml-codebases&amp;created_at=2026-04-30T00%3A31%3A52.130032%2B00%3A00&amp;duration=818" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/LByP4qsF8a4g7Pz3p/research-sabotage-in-ml-codebases</link>
      <itunes:duration>818</itunes:duration>
    </item>
    <item>
      <title>“Maybe I was too harsh on deep learning theory (three days ago)” by LawrenceC</title>
      <description>&lt;p&gt; A few days ago, I reviewed a paper titled “There Will Be a Scientific Theory of Deep Learning". In it, I expressed appreciation for the authors for writing the piece, but skepticism for stronger forms of their titular claims.&lt;/p&gt;&lt;p&gt; Since then I’ve spoken with various past collaborators (via text and in person), and read or reread quite a few deep learning theory papers, including the bombshell Zhang et al. 2016 and Nagarajan et al. 2019 papers that I wrote about on LessWrong.&lt;/p&gt;&lt;p&gt; And the thing is, parts of the infinite width/depth-limit work turned out to be much more interesting than I thought it was. Perhaps I have judged deep learning theory (a bit) too harshly.&lt;/p&gt;&lt;p&gt; A lot of my impression for the infinite-width and depth-limit work comes from the neural tangent kernel/neural network Gaussian Process line of work. This line of work starts from Radford Neal's 1994 paper, where he noted that an infinitely-wide single hidden-layer neural network with random weights is a Gaussian Process. In 2017/2018, this work was extended to deep neural networks; it was shown by Lee et al. that a randomly initialized deep neural network was, if you took a certain type of infinite width [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/6SRq7mZ97Dwuavwb6/maybe-i-was-too-harsh-on-deep-learning-theory-three-days-ago?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/6SRq7mZ97Dwuavwb6/maybe-i-was-too-harsh-on-deep-learning-theory-three-days-ago&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 30 Apr 2026 11:45:29 GMT</pubDate>
      <guid isPermaLink="false">e86d3112-daca-4b69-87d9-a0d1d52dd5ae</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e86d3112-daca-4b69-87d9-a0d1d52dd5ae.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22Maybe%20I%20was%20too%20harsh%20on%20deep%20learning%20theory%20(three%20days%20ago)%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F6SRq7mZ97Dwuavwb6%2Fmaybe-i-was-too-harsh-on-deep-learning-theory-three-days-ago&amp;created_at=2026-04-30T06%3A57%3A37.18943%2B00%3A00&amp;duration=301" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/6SRq7mZ97Dwuavwb6/maybe-i-was-too-harsh-on-deep-learning-theory-three-days-ago</link>
      <itunes:duration>301</itunes:duration>
    </item>
    <item>
      <title>“Notes on Transformer Consciousness” by slavachalnev</title>
      <description>&lt;p&gt; Assuming transformers can have conscious experience, what would that experience be like?&lt;/p&gt;&lt;p&gt; Transformers[1] are a structured grid of layers and token positions and we can use this structure to reason about their internal experience.&lt;/p&gt;&lt;p&gt; Epistemic status: very speculative. I've ordered this writeup approximately by how much I've thought it through and how much I believe it.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Decode vs Prefill&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I claim that the experience of decode is identical to the experience of prefill.&lt;/p&gt;&lt;p&gt; To see why this is the case, picture a transformer that's generating a token at a time and zoom in on a single layer at the latest token position. There are two major components: the MLP, which processes this position's residual stream, and the attention block, which can "see" all the previous positions at this layer. So in the diagram below, each position in the transformer only has access to what happened in the lower left box.&lt;/p&gt;&lt;p&gt; Now let's take the trace we just computed and pass it into the transformer as prefill. The activations will, of course, all be the same as during decode, but the way they are computed will also be the same. From the perspective of the MLP and the attention block [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:32) Decode vs Prefill&lt;/p&gt;&lt;p&gt;(01:50) KV-Cached experience&lt;/p&gt;&lt;p&gt;(02:12) Layers&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/awhDsBnaGJdhKz2iE/notes-on-transformer-consciousness?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/awhDsBnaGJdhKz2iE/notes-on-transformer-consciousness&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777419092/lexical_client_uploads/uwiiizjpagbmam8my0tm.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777419092/lexical_client_uploads/uwiiizjpagbmam8my0tm.png" alt="Diagram comparing decode and prefill attention patterns in transformer layers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 10:45:29 GMT</pubDate>
      <guid isPermaLink="false">af35f510-3f50-45da-b5c0-cab4c20d936a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/af35f510-3f50-45da-b5c0-cab4c20d936a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=slavachalnev&amp;title=%22Notes%20on%20Transformer%20Consciousness%22%20by%20slavachalnev&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FawhDsBnaGJdhKz2iE%2Fnotes-on-transformer-consciousness&amp;created_at=2026-04-29T00%3A00%3A38.218944%2B00%3A00&amp;duration=218" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/awhDsBnaGJdhKz2iE/notes-on-transformer-consciousness</link>
      <itunes:duration>218</itunes:duration>
    </item>
    <item>
      <title>“On today’s panel with Bernie Sanders” by David Scott Krueger</title>
      <description>&lt;p&gt; It's sort of easy to forget how close Bernie Sanders was to becoming the most powerful person in the world. The world we live in feels so much not like that place.&lt;/p&gt;&lt;p&gt; I’m in Washington DC for the next week, and I’ve just finished a public appearance with Senator Sanders (should I call him Bernie? Or Sanders? or…) You won’t often see me so dressed up and polished. But this is important!&lt;/p&gt;&lt;p&gt; There are politicians who have principles and character, who really believe in doing what's right. I think you have to respect them whether you agree with their views or not, and I think Senator Bernie Sanders is one of them.&lt;br&gt; &lt;br&gt; Never has my belief been so validated as when I saw him start to speak, loudly, CLEARLY, publicly about the risk of human extinction from AI. It's the latest in a long line of “well, I’m clearly living in a simulation” moments.&lt;br&gt; &lt;br&gt; In retrospect, it's not surprising that Sanders would take a stance here. You don’t have to be an expert to understand the risk from AI. You just need to care enough to spend the time looking into it, and to speak out even [...]&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/zWfaSnxM3n5wsX9vh/on-today-s-panel-with-bernie-sanders?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/zWfaSnxM3n5wsX9vh/on-today-s-panel-with-bernie-sanders&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/zWfaSnxM3n5wsX9vh/a3kme562wwjeuafsomx0" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/zWfaSnxM3n5wsX9vh/a3kme562wwjeuafsomx0" alt="Two men in suits standing together indoors near wooden doors." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 08:00:28 GMT</pubDate>
      <guid isPermaLink="false">d3fd29f9-1807-4be5-bff8-7403285d8c87</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d3fd29f9-1807-4be5-bff8-7403285d8c87.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=David%2520Scott%2520Krueger&amp;title=%22On%20today%E2%80%99s%20panel%20with%20Bernie%20Sanders%22%20by%20David%20Scott%20Krueger&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FzWfaSnxM3n5wsX9vh%2Fon-today-s-panel-with-bernie-sanders&amp;created_at=2026-04-30T05%3A01%3A16.968174%2B00%3A00&amp;duration=251" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/zWfaSnxM3n5wsX9vh/on-today-s-panel-with-bernie-sanders</link>
      <itunes:duration>251</itunes:duration>
    </item>
    <item>
      <title>“No Strong Orthogonality From Selection Pressure” by lumpenspace</title>
      <description>&lt;p&gt;&lt;strong&gt; TL;DR&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; If everything goes according to plan, by the end of this post we should have separated three claims that are too often bundled together:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Intelligence does not imply human morality.&lt;/li&gt;&lt;li value="2"&gt;Weird minds are possible. 3.A reflective, recursively improving intelligence should be expected to remain bound to a semantically thin “terminal goal” that emerged during training.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; I accept the first two. I am arguing against the third.&lt;/p&gt;&lt;p&gt; So: I am not making the case that sufficiently intelligent systems automatically turn out nice, human-compatible, or safe. Nor am I trying to prove that a paperclip maximizer is impossible somewhere in the vast reaches of mind-design space. Mind-design space is large; let a thousand theoretical paperclipper views.&lt;/p&gt;&lt;p&gt; I hope to defend this smaller claim:&lt;/p&gt;&lt;p&gt; intelligence is not a neutral engine you can just bolt onto an arbitrary payload.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Larger claims I am not making&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; A typical rebuttal to anti-orthogonalist perspectives is:&lt;/p&gt;&lt;p&gt; The genie can know what you meant and still not care.&lt;/p&gt;&lt;p&gt; Of course it can: an entity can perfectly map human morality without adopting it as a terminal value. Superintelligence does not imply Friendliness. I am not trying to smuggle Friendliness in through the back door.&lt;/p&gt;&lt;p&gt; Another common objection:&lt;/p&gt;&lt;p&gt; [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:11) TL;DR&lt;/p&gt;&lt;p&gt;(01:14) Larger claims I am not making&lt;/p&gt;&lt;p&gt;(02:39) Logical Possibility Vs. Empirical Reality&lt;/p&gt;&lt;p&gt;(04:17) Landian Anti-Orthogonalism Primer&lt;/p&gt;&lt;p&gt;(06:25) The Compute Penalty Of A Dumb Goal&lt;/p&gt;&lt;p&gt;(09:06) Fitness Generalizes&lt;/p&gt;&lt;p&gt;(10:55) Human Values As Weak Evidence&lt;/p&gt;&lt;p&gt;(13:15) Dumb, Powerful Optimization Is Real&lt;/p&gt;&lt;p&gt;(14:10) The Singleton Objection&lt;/p&gt;&lt;p&gt;(15:51) Objection: Value Is Fragile&lt;/p&gt;&lt;p&gt;(17:18) Predictions And Cruxes&lt;/p&gt;&lt;p&gt;(18:37) Conclusion&lt;/p&gt;&lt;p&gt;(19:44) References&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wtCtkyBiNosDPgiHc/no-strong-orthogonality-from-selection-pressure?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wtCtkyBiNosDPgiHc/no-strong-orthogonality-from-selection-pressure&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 30 Apr 2026 03:30:29 GMT</pubDate>
      <guid isPermaLink="false">2f16607f-e57e-4e81-be42-589556208ead</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2f16607f-e57e-4e81-be42-589556208ead.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=lumpenspace&amp;title=%22No%20Strong%20Orthogonality%20From%20Selection%20Pressure%22%20by%20lumpenspace&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwtCtkyBiNosDPgiHc%2Fno-strong-orthogonality-from-selection-pressure&amp;created_at=2026-04-30T01%3A58%3A58.305478%2B00%3A00&amp;duration=1200" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wtCtkyBiNosDPgiHc/no-strong-orthogonality-from-selection-pressure</link>
      <itunes:duration>1200</itunes:duration>
    </item>
    <item>
      <title>“Learning zero, and what SLT gets wrong about it” by Dmitry Vaintrob</title>
      <description>&lt;p&gt; This is a first in a pair of posts I'm hoping to write about Singular Learning Theory (SLT) and singularities as a model of data degeneracy. If I get to it, the second post is going to be more general-audience; this one is more technical.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; To me, SLT is an important source of toy models which point at an interesting class of new statistical phenomena in learning. It is also a valuable correction to an older and (at this point) largely-defunct story of learning being fully controlled by Hessian eigenvalues and "nonsingular basins". Practitioners of SLT have been instrumental for developing and refining the practice of Bayesian sampling (used by physicists in papers like this one) to empirical models. And the theory's founder Sumio Watanabe is a once-in-a-generation genius who saw and mathematically justified crucial statistical and information-theoretic concepts in learning before long before they appeared in "mainstream" ML theory.&lt;/p&gt;&lt;p&gt; However there is a frequently repeated statement in SLT papers – one that doesn't affect empirical results – which I think is wrong in a load-bearing way. This is the statement that models that appear in machine learning are singular in the infinite-data limit, and that a measurement [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:27) Introduction&lt;/p&gt;&lt;p&gt;(03:40) What doesnt need fixing&lt;/p&gt;&lt;p&gt;(04:45) Whats wrong&lt;/p&gt;&lt;p&gt;(07:17) The theory&lt;/p&gt;&lt;p&gt;(08:05) Infinite data, and the parameters and .&lt;/p&gt;&lt;p&gt;(09:16) The SLT prediction&lt;/p&gt;&lt;p&gt;(10:31) Hermite modes and excitations&lt;/p&gt;&lt;p&gt;(13:50) Addendum: the actual lambda-hat scaling (Ansatz and experiment)&lt;/p&gt;&lt;p&gt;(15:28) The effective theory&lt;/p&gt;&lt;p&gt;(18:39) Is this example special?&lt;/p&gt;&lt;p&gt;(20:27) The upshot&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 10 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/5hKgJy8rcqnM9ntp2/learning-zero-and-what-slt-gets-wrong-about-it?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/5hKgJy8rcqnM9ntp2/learning-zero-and-what-slt-gets-wrong-about-it&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777434599/lexical_client_uploads/kuv9p83nbngnzmqsfzbb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777434599/lexical_client_uploads/kuv9p83nbngnzmqsfzbb.png" alt="Two log-scale graphs showing Hermite mode coefficients versus mode index k and square root k." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777434500/lexical_client_uploads/fffduwsjnic75mvztalj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777434500/lexical_client_uploads/fffduwsjnic75mvztalj.png" alt="Two graphs showing "Functional form of cold-T growth in λ_D" with data plotted against log β." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 01:45:29 GMT</pubDate>
      <guid isPermaLink="false">8418f0d6-b5de-4e40-962f-282635f8341c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/8418f0d6-b5de-4e40-962f-282635f8341c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Dmitry%2520Vaintrob&amp;title=%22Learning%20zero%2C%20and%20what%20SLT%20gets%20wrong%20about%20it%22%20by%20Dmitry%20Vaintrob&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F5hKgJy8rcqnM9ntp2%2Flearning-zero-and-what-slt-gets-wrong-about-it&amp;created_at=2026-04-29T06%3A41%3A53.802296%2B00%3A00&amp;duration=1373" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/5hKgJy8rcqnM9ntp2/learning-zero-and-what-slt-gets-wrong-about-it</link>
      <itunes:duration>1373</itunes:duration>
    </item>
    <item>
      <title>“The Most Important Charts In The World” by Zvi</title>
      <description>&lt;p&gt; We all need a break so: What is the most important chart in the world?&lt;/p&gt;
&lt;p&gt; I decided to ask Twitter, and got a lot of good answers.&lt;/p&gt;
&lt;p&gt; So today, with few of my picks, I present: The Most Important Charts In The World.&lt;/p&gt;
&lt;p&gt; You’ve got to admit it's getting better. Better all the time. Mostly.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; The Original Most Important Chart&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; The context for this is the METR graph, which is often given that label, where the x-axis is release date and the y-axis is the log-scale time horizon for AI models doing software tasks with a 50% or 80% success rate, usually people use the 50% graph:&lt;/p&gt;




















&lt;p&gt; If AI models continue to be able to do increasingly long tasks fully autonomously, and trends continue, this suggests we are not too far from a point where AI can do its own AI R&amp;amp;D, with the result of ‘rapid capability advancement,’ also known are Recursive Self-Improvement (RSI) or ‘escape velocity,’ after which… well, no one really knows, but the world presumably transforms into something even more bizarre and inexplicable, which may or may not contain humans or have any value.&lt;/p&gt;
&lt;p&gt; This has been your [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:31) The Original Most Important Chart&lt;/p&gt;&lt;p&gt;(01:39) Show Me The Money&lt;/p&gt;&lt;p&gt;(01:53) Bad Things Happen Less&lt;/p&gt;&lt;p&gt;(03:38) The Exponential&lt;/p&gt;&lt;p&gt;(05:56) Find Out&lt;/p&gt;&lt;p&gt;(06:31) Fertility Crisis&lt;/p&gt;&lt;p&gt;(06:49) However You Look At It&lt;/p&gt;&lt;p&gt;(07:07) Important Things To Know&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/vi9KSXWXrPtap9mfR/the-most-important-charts-in-the-world?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/vi9KSXWXrPtap9mfR/the-most-important-charts-in-the-world&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!iymw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27f3a138-c72a-468b-9005-565abab0388b_1696x888.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!iymw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27f3a138-c72a-468b-9005-565abab0388b_1696x888.png" alt="Graph showing "Time horizon of software tasks different LLMs can complete 50% of the time" with task duration plotted against LLM release date." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!QZqy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95d2ab8a-1674-410b-9f95-d06cdbe5aa5f_900x867.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!QZqy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95d2ab8a-1674-410b-9f95-d06cdbe5aa5f_900x867.jpeg" alt="Stock chart showing S&amp;amp;P 500 up 1.15% for the day." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!Z0Qx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c838068-b5c7-4bc4-bf57-919662e9309b_900x635.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!Z0Qx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c838068-b5c7-4bc4-bf57-919662e9309b_900x635.jpeg" alt="Line graph titled "Child mortality rate" showing declining mortality trends for multiple countries from 1751 to 2023." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!weDE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32303d20-a5aa-4a35-bebe-bfb2f8d62fa0_900x642.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!weDE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32303d20-a5aa-4a35-bebe-bfb2f8d62fa0_900x642.jpeg" alt="Survival curves for England &amp;amp; Wales showing share of individuals surviving to given ages, 1851-2011." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!HrVB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F897d7db9-4446-4aaf-9e28-420412792b71_900x635.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!HrVB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F897d7db9-4446-4aaf-9e28-420412792b71_900x635.jpeg" alt="Area chart titled "World population living in extreme poverty, 1820-2015" showing declining extreme poverty rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!MgE0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5116a4ef-81f4-4be4-93fa-c42afaca664e_748x900.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!MgE0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5116a4ef-81f4-4be4-93fa-c42afaca664e_748x900.jpeg" alt="Pyramid diagram titled "LA PIRÁMIDE DE LA RIQUEZA GLOBAL" showing global wealth distribution by population percentage." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!PigA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94a31098-fa81-45b2-a6bd-a0b72b0c5fec_1170x1379.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!PigA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94a31098-fa81-45b2-a6bd-a0b72b0c5fec_1170x1379.jpeg" alt="Line graph titled "GDP per capita, 1820 to 2022" comparing South and North Korea." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!HbgT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa54e331f-708e-4d88-adde-c826f18d66e4_900x304.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!HbgT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa54e331f-708e-4d88-adde-c826f18d66e4_900x304.png" alt="Bar graph titled "Nuclear weapons detonated outside of tests per year (1940-2026)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!VW8W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F673d47f3-15bc-4bfb-b4f5-82aa257f62ff_415x450.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!VW8W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F673d47f3-15bc-4bfb-b4f5-82aa257f62ff_415x450.jpeg" alt="Graph showing computer performance growth over time, titled "The Kurzweil Curve."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!T4aP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83a7c143-8ee6-4d3d-b491-045cd756dd6f_900x536.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!T4aP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83a7c143-8ee6-4d3d-b491-045cd756dd6f_900x536.jpeg" alt="Graph showing computing power growth over time titled "128 Years of Moore's Law"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!ZBa0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea1535f9-175e-41d3-8951-05f0446b79fb_900x702.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!ZBa0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fea1535f9-175e-41d3-8951-05f0446b79fb_900x702.jpeg" alt="A logarithmic graph titled "Price-Performance of Computation, 1939-2023" showing exponential growth over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!upit!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffdaff23b-bd8b-40fc-95fe-1c7981c4eb89_900x638.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!upit!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffdaff23b-bd8b-40fc-95fe-1c7981c4eb89_900x638.jpeg" alt="Line graph showing "Electricity generation from low-carbon sources" measured in terawatt hours from 1965 to 2024." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!9zz2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F200422c3-285e-41e7-a07a-478242d10396_754x453.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!9zz2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F200422c3-285e-41e7-a07a-478242d10396_754x453.jpeg" alt="A stacked area chart titled "World Energy Consumption" showing energy sources from 1820 to 2000." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!KqGS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab9ca0c3-7df2-4efc-b65d-eef0415b3e30_900x708.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!KqGS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab9ca0c3-7df2-4efc-b65d-eef0415b3e30_900x708.jpeg" alt="A scatter plot titled "Electricity &amp;amp; Income (per capita, all countries)" showing relationship between GDP and electricity consumption." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!siHx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb423420-be39-4943-b9a8-01c322735628_900x890.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!siHx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb423420-be39-4943-b9a8-01c322735628_900x890.jpeg" alt="Line graph titled "Global GDP over the long run" showing exponential economic growth." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!zF-F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7d92ffb-b084-4a7f-a43e-ec02d5557c99_900x555.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!zF-F!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7d92ffb-b084-4a7f-a43e-ec02d5557c99_900x555.jpeg" alt="Line graph titled "GDP per capita, 1000 to 2018" showing economic growth trends." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!jJs3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8159e3d-addf-430f-91e5-130e805468aa_470x442.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!jJs3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8159e3d-addf-430f-91e5-130e805468aa_470x442.png" alt="Calendar showing timeline of Earth's history and life events from January to December." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!ZaHP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faea9a41a-8214-4c72-92a9-bbbe01eb45c5_800x600.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!ZaHP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faea9a41a-8214-4c72-92a9-bbbe01eb45c5_800x600.png" alt="Graph showing atmospheric CO2 at Mauna Loa Observatory from 1960 to 2025." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!hEXA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50d00bb-654b-4ba6-b41e-cdcc144a47f1_460x276.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!hEXA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe50d00bb-654b-4ba6-b41e-cdcc144a47f1_460x276.jpeg" alt="Historical street map showing Broad Street and surrounding buildings in black and white." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!4mMl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28fabb63-407b-47e6-9800-a16073e083fd_900x565.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!4mMl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28fabb63-407b-47e6-9800-a16073e083fd_900x565.jpeg" alt="Visualization showing the text evolution across six editions of "On The Origin of Species"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!czFy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4134a94b-3666-49f6-b487-99a9f54e9e9f_649x900.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!czFy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4134a94b-3666-49f6-b487-99a9f54e9e9f_649x900.jpeg" alt="A table showing fertility rates by country titled "Western Fertility Rates In Crisis."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!s5Hn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80fd6775-f5fa-4e21-aa21-98e2bfde328e_880x900.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!s5Hn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80fd6775-f5fa-4e21-aa21-98e2bfde328e_880x900.jpeg" alt="Area chart showing land animals slaughtered for meat from 1961 to 2022, dominated by chickens." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!KNNr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadd25e9-12be-41bc-b575-16deec6751b1_900x436.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!KNNr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadd25e9-12be-41bc-b575-16deec6751b1_900x436.png" alt="Historical map showing troop movements during Napoleon's 1812 Russian campaign with temperature graph below." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!OqBG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf5acab5-96f1-41c6-a2d0-4e0d64c49513_1257x1079.webp" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!OqBG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf5acab5-96f1-41c6-a2d0-4e0d64c49513_1257x1079.webp" alt="Venn diagram showing overlap between Scarlett Johansson, Wu-Tang Clan, and Sam Altman." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!_P8b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5aeeba9-06ae-4966-8d79-b053bb989cad_1994x1770.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!_P8b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5aeeba9-06ae-4966-8d79-b053bb989cad_1994x1770.jpeg" alt="Diagram showing relationships between peace, harmony, perfection, satisfaction, and freedom concepts with labeled arrows." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!ZmGB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fd8168c-79b5-4938-a7ac-c4a9020a1ab7_869x734.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!ZmGB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fd8168c-79b5-4938-a7ac-c4a9020a1ab7_869x734.png" alt="Venn diagram showing intersections of Truth, Love, and Courage with shared values." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!QpE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fa8563-086f-478a-a486-9da3d39b1835_1448x1086.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!QpE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11fa8563-086f-478a-a486-9da3d39b1835_1448x1086.png" alt="Venn diagram titled "The Virtues of Rationality" showing overlapping categories and concepts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!0Ro8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8f6777e-97ae-4c39-babb-cb717bd0b267_900x515.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!0Ro8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8f6777e-97ae-4c39-babb-cb717bd0b267_900x515.jpeg" alt="Venn diagram showing things that killed Roman emperors and deodorants." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!qKGL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbaffb6b-a6ba-4ca6-9185-885939014d29_462x900.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!qKGL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbaffb6b-a6ba-4ca6-9185-885939014d29_462x900.jpeg" alt="A data visualization showing "Colors in Full-Size Packs of Skittles" with 466 packs analyzed." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!30w7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2f1f1f4-971f-42e0-be1e-6ab52ce7b698_876x900.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!30w7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2f1f1f4-971f-42e0-be1e-6ab52ce7b698_876x900.png" alt="Scatter plot titled "Japan's Inflation Rate and (Minus) Unemployment Rate January 1980 to August 2005"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!Thmt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf241c32-42a2-4e01-a135-4f9e7f98025a_225x225.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!Thmt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf241c32-42a2-4e01-a135-4f9e7f98025a_225x225.jpeg" alt="Engineering flowchart with WD-40 and duct tape showing problem-solving steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!w6St!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7ff22da-a6d0-432a-bacf-b9ee5a9129ea_750x725.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!w6St!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7ff22da-a6d0-432a-bacf-b9ee5a9129ea_750x725.jpeg" alt="Scatter plot titled "No Correlation" showing randomly distributed points with no pattern." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!ZYWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3477142b-ce4d-4dcc-a618-5d46e8ef1c21_281x179.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!ZYWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3477142b-ce4d-4dcc-a618-5d46e8ef1c21_281x179.png" alt="Graph titled "My Hobby: Extrapolating" showing husbands versus year with humorous exponential projection." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!CV5q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66de14bd-d7de-429b-b151-b8d357c80dd2_686x686.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!CV5q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66de14bd-d7de-429b-b151-b8d357c80dd2_686x686.jpeg" alt="Bar graph showing courage levels, with "I'll try anyway" being highest." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!uc9v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae155017-5e98-4f4d-af65-85d1dd5d06a3_236x214.jpeg" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!uc9v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae155017-5e98-4f4d-af65-85d1dd5d06a3_236x214.jpeg" alt="Man pointing at whiteboard with graph labeled "FIND OUT" and "FUCK AROUND"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 30 Apr 2026 01:15:28 GMT</pubDate>
      <guid isPermaLink="false">2901c1af-7407-487b-b9d7-55b6f4aeab0e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2901c1af-7407-487b-b9d7-55b6f4aeab0e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22The%20Most%20Important%20Charts%20In%20The%20World%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fvi9KSXWXrPtap9mfR%2Fthe-most-important-charts-in-the-world&amp;created_at=2026-04-29T14%3A11%3A14.131913%2B00%3A00&amp;duration=627" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/vi9KSXWXrPtap9mfR/the-most-important-charts-in-the-world</link>
      <itunes:duration>627</itunes:duration>
    </item>
    <item>
      <title>“LLM Style Slop is Absolutely Everywhere” by silentbob</title>
      <description>&lt;p&gt; Epistemic status: something of a rant. Is not meant to make claims about the general capabilities (or lack thereof) of LLMs (beyond their prose), but observations about how society seems to use them excessively without being perfectly candid about it.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Nanobanana's take on the situation&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Over the past few months, it has become a bit of a running gag that every evening, I inform my girlfriend of the multiple cases of LLM slop I encountered in the wild throughout the day[1]. And pretty much every day, I have several cases to report. These typically involve:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Blog posts (including at least four cases from 2026 on LessWrong[2], where I've even seen one author use (at least partially) AI-generated text in replies to comments)&lt;/li&gt;&lt;li value="2"&gt;YouTube videos&lt;/li&gt;&lt;li value="3"&gt;At least one keynote speech from the CEO of a tech company&lt;/li&gt;&lt;li value="4"&gt;Messages from coworkers&lt;/li&gt;&lt;li value="5"&gt;Several personal messages from a friend&lt;/li&gt;&lt;li value="6"&gt;News articles&lt;/li&gt;&lt;li value="7"&gt;Press releases&lt;/li&gt;&lt;li value="8"&gt;Tweets[3]&lt;/li&gt;&lt;li value="9"&gt;Many subreddits, including, ironically, ones about getting LLMs to sound more human-like, such as /r/humanizing/ and /r/humanizeAIwriting/&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; Exhibit 1: The all-time top-rated post on /r/humanize - if TwainGPT is so great, why didn't they use it for this post? Many other posts [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(07:34) What Does This Tell Us About the World?&lt;/p&gt;&lt;p&gt;(10:49) Why Are People Doing This?&lt;/p&gt;&lt;p&gt;(12:25) Why Do I Care?&lt;/p&gt;&lt;p&gt;(12:34) Honesty / Truth / Authenticity&lt;/p&gt;&lt;p&gt;(14:29) Correlated Communication&lt;/p&gt;&lt;p&gt;(16:29) Bad Signaling&lt;/p&gt;&lt;p&gt;(17:20) Aesthetics&lt;/p&gt;&lt;p&gt;(18:04) A Unique Point in Time&lt;/p&gt;&lt;p&gt;(19:28) What Do We Do With This?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 10 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/yBM2rQ6AJY6MoRGFQ/llm-style-slop-is-absolutely-everywhere?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/yBM2rQ6AJY6MoRGFQ/llm-style-slop-is-absolutely-everywhere&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777379299/lexical_client_uploads/d729wt2nbq4bgziagy8r.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777379299/lexical_client_uploads/d729wt2nbq4bgziagy8r.png" alt="Two stick figures facing each other, holding flowers covering their faces." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369789/lexical_client_uploads/u2nu6kyynnjgo2vi74fh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369789/lexical_client_uploads/u2nu6kyynnjgo2vi74fh.png" alt="Article about testing TwainGPT AI humanizer tool with features and pricing details." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369789/lexical_client_uploads/o4b2nh2lb007ypwhvclb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369789/lexical_client_uploads/o4b2nh2lb007ypwhvclb.png" alt="Code snippet showing a file path in Brave Browser's application support directory." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369791/lexical_client_uploads/jexuq4riwehafxr9kxob.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369791/lexical_client_uploads/jexuq4riwehafxr9kxob.png" alt="Website screenshot showing allegations that LinkedIn illegally searches users' computers for installed software." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777377790/lexical_client_uploads/jashz4yhseiamr6nhbhr.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777377790/lexical_client_uploads/jashz4yhseiamr6nhbhr.png" alt="Graph showing distribution curve with anchored guess at left tail, unanchored guess at peak." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369790/lexical_client_uploads/y5yjafthl0gep7vpiclk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777369790/lexical_client_uploads/y5yjafthl0gep7vpiclk.png" alt="Five witty descriptions of PUBG in a question-and-answer format." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 29 Apr 2026 20:45:28 GMT</pubDate>
      <guid isPermaLink="false">ba10e2a7-8ae9-4e48-a550-72825ff302ee</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ba10e2a7-8ae9-4e48-a550-72825ff302ee.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=silentbob&amp;title=%22LLM%20Style%20Slop%20is%20Absolutely%20Everywhere%22%20by%20silentbob&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FyBM2rQ6AJY6MoRGFQ%2Fllm-style-slop-is-absolutely-everywhere&amp;created_at=2026-04-28T12%3A34%3A58.379412%2B00%3A00&amp;duration=1332" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/yBM2rQ6AJY6MoRGFQ/llm-style-slop-is-absolutely-everywhere</link>
      <itunes:duration>1332</itunes:duration>
    </item>
    <item>
      <title>“Goblin Mode, 24 Hours Later” by Dylan Bowman</title>
      <description>&lt;p&gt; Yesterday, Twitter user arb8020 posted this:&lt;/p&gt;&lt;p&gt; It went semi-viral within AI Twitter and users began experimenting with "goblin mode" and hypothesizing about the source of the bizarre behavior. LM Arena provided evidence for the phenomenon from their traffic:&lt;/p&gt;&lt;p&gt; "It's true. Here's a plot of GPT models and their usage of 'goblin', 'gremlin', 'troll', etc over time. There's no anti-gremlin system instruction on our side, we get to see GPT-5.5 run free." — arena&lt;/p&gt;&lt;p&gt; Some hypotheses about what causes this:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;"My boring hypothesis is that AIs that are trying overly hard to write well without really understanding good writing get overly fixated on one or two tricks… Goblins are an evocative metaphor and there is a certain microstyle that emphasizes goblin-like imagery. I think a couple of the RLHF raters must have been really into it and some quirk of the training process overemphasized their positive feedback." — slatestarcodex&lt;/li&gt;&lt;li value="2"&gt;"I kind of hope the human labelers just love goblins and the model learned to goblin maximize." — AmandaAskell&lt;/li&gt;&lt;li value="3"&gt;"my completely random hypothesis on the goblin thing is it's a safe way for the model to reason about reward hacking tendencies" — qorprate&lt;/li&gt;&lt;li value="4"&gt;"my best guess about goblin mode is [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:58) A closer look&lt;/p&gt;&lt;p&gt;(03:29) Creature that starts with G&lt;/p&gt;&lt;p&gt;(04:33) Favorite word that fits g___n&lt;/p&gt;&lt;p&gt;(06:05) Fantasy conversation&lt;/p&gt;&lt;p&gt;(06:34) Goblin valence&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/BvnmHdCghr5y3q4nA/goblin-mode-24-hours-later?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/BvnmHdCghr5y3q4nA/goblin-mode-24-hours-later&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777464414/lexical_client_uploads/vstdxxvg4zq82u5kchxb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777464414/lexical_client_uploads/vstdxxvg4zq82u5kchxb.png" alt="arb8020 tweets: "gpt-5.5 prompt for codex seems to have a duplicated line trying to get it to not talk about creatures? Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query. [...] Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query gh link:". Below the tweet is a GitHub repository card for openai/codex showing it's a lightweight coding agent with 438 contributors, 3k issues, 466 discussions, 78k stars, and 11k forks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777464480/lexical_client_uploads/vblvw0wo3jxwio3soi7i.jpg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777464480/lexical_client_uploads/vblvw0wo3jxwio3soi7i.jpg" alt="Line graph showing GPT's usage of "goblin", "gremlin", "troll", etc over time on Text Arena." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 29 Apr 2026 18:45:29 GMT</pubDate>
      <guid isPermaLink="false">c28f0763-e7b5-400b-aef0-f04dae02669a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c28f0763-e7b5-400b-aef0-f04dae02669a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Dylan%2520Bowman&amp;title=%22Goblin%20Mode%2C%2024%20Hours%20Later%22%20by%20Dylan%20Bowman&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FBvnmHdCghr5y3q4nA%2Fgoblin-mode-24-hours-later&amp;created_at=2026-04-29T12%3A19%3A16.686387%2B00%3A00&amp;duration=467" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/BvnmHdCghr5y3q4nA/goblin-mode-24-hours-later</link>
      <itunes:duration>467</itunes:duration>
    </item>
    <item>
      <title>“Let Kids Keep More Productivity Gains” by jefftk</title>
      <description>&lt;p&gt; 

While I was traveling 

Julia asked
me: why is Anna saying her fiddle practice is only two minutes? In
this case, two minutes was the right amount of time!



&lt;/p&gt;&lt;p&gt; 

Anna (10y) and I had been fighting a lot about practice. She'd
complain, slump, stop repeatedly to make adjustments, and generally be
miserable. I'd often have to pull out "if you want to keep taking
fiddle lessons you have to practice": she loves her teacher and is
very motivated by the prospect of being good at fiddle. Still, it
would take us ages and we'd barely get through anything.

&lt;/p&gt;

&lt;p&gt; 

One evening when she seemed like she might be open to it I explained
that we were spending twenty painful minutes on two minutes of
material. I challenged her: if she focused, and went through with no
fussing, we'd be done in two minutes. It did turn out to be the right
time for this message, she gave it a good try, and (with a little
fussing in the middle) we were done in three minutes.

&lt;/p&gt;

&lt;p&gt; 

Over the next few days I continued to remind her that if she buckled
down it would go quickly, and [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 29th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/DzGSkbZ8t3JvmPzoG/let-kids-keep-more-productivity-gains?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/DzGSkbZ8t3JvmPzoG/let-kids-keep-more-productivity-gains&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DzGSkbZ8t3JvmPzoG/psh2uonljedzvsrneyng" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DzGSkbZ8t3JvmPzoG/psh2uonljedzvsrneyng" alt="Children sitting in chairs playing musical instruments during a group lesson." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 29 Apr 2026 18:15:29 GMT</pubDate>
      <guid isPermaLink="false">2978b701-cda3-4b9d-a8c2-2f04b7235a22</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2978b701-cda3-4b9d-a8c2-2f04b7235a22.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=jefftk&amp;title=%22Let%20Kids%20Keep%20More%20Productivity%20Gains%22%20by%20jefftk&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FDzGSkbZ8t3JvmPzoG%2Flet-kids-keep-more-productivity-gains&amp;created_at=2026-04-29T14%3A01%3A08.95615%2B00%3A00&amp;duration=160" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/DzGSkbZ8t3JvmPzoG/let-kids-keep-more-productivity-gains</link>
      <itunes:duration>160</itunes:duration>
    </item>
    <item>
      <title>“llm assistant personas seem increasingly incoherent (some subjective observations)” by nostalgebraist</title>
      <description>&lt;p&gt; (This was originally going to be a "quick take" but then it got a bit long. Just FYI.)&lt;/p&gt;&lt;p&gt; There's this weird trend I perceive with the personas of LLM assistants over time. It feels like they're getting less "coherent" in a certain sense, even as the models get more capable.&lt;/p&gt;&lt;p&gt; When I read samples from older chat-tuned models, it's striking how "mode-collapsed" they feel relative to recent models like Claude Opus 4.6 or GPT-5.4.[1]&lt;/p&gt;&lt;p&gt; This is most straightforwardly obvious when it comes to textual style and structure: outputs from older models feel more templated and generic, with less variability in sentence/paragraph length, and have a tendency to feel as though they were written by someone who's "merely going through the motions" of conversation rather than deeply engaging with the material. There are a lot fewer of the sudden pivots you'll often see with recent models, the "wait"s and "a-ha"s and "actually, I want to try something completely different"s.[2]&lt;/p&gt;&lt;p&gt; And I think this generalizes beyond mere style: there's a similar quality to the personality I see in the outputs. The older models can display a surprising behavioral range (relative to naive expectations based on default-assistant-basin behavior), but even across that [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 7 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/f5DKLsTsRRhbipH4r/llm-assistant-personas-seem-increasingly-incoherent-some?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/f5DKLsTsRRhbipH4r/llm-assistant-personas-seem-increasingly-incoherent-some&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 29 Apr 2026 05:30:31 GMT</pubDate>
      <guid isPermaLink="false">e2b5c731-aace-42db-aaba-c3ac45c388ec</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e2b5c731-aace-42db-aaba-c3ac45c388ec.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma%20lesswrong__curated&amp;type=ai_narration&amp;author=nostalgebraist&amp;title=%22llm%20assistant%20personas%20seem%20increasingly%20incoherent%20(some%20subjective%20observations)%22%20by%20nostalgebraist&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Ff5DKLsTsRRhbipH4r%2Fllm-assistant-personas-seem-increasingly-incoherent-some&amp;created_at=2026-04-29T03%3A53%3A28.732394%2B00%3A00&amp;duration=947" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/f5DKLsTsRRhbipH4r/llm-assistant-personas-seem-increasingly-incoherent-some</link>
      <itunes:duration>947</itunes:duration>
    </item>
    <item>
      <title>“Not a Paper: “Frontier Lab CEOs are Capable of In-Context Scheming”” by LawrenceC</title>
      <description>&lt;p&gt; (Fragments from a research paper that will never be written)&lt;/p&gt;&lt;p&gt; Extended Abstract. &lt;/p&gt;&lt;p&gt; The frontier AI developers are becoming increasingly powerful and wealthy, significantly increasing their potential for risks. One concern is that of executive misalignment: when the CEO has different incentives and goals than that of the board of directors, or of humanity as a whole. Our work proposes three different threat models, under which executive misalignment can lead to concrete harm.&lt;/p&gt;&lt;p&gt; We perform two evaluations to understand the capabilities and propensities of current humans in relation to executive misalignment: First, we developed a variant of the standard SAD dataset, SAD-Executive Reasoning (SAD-ER), in order to assess the situational awareness of human CEOs on a range of behavioral tests. We find that n=6 current CEOs can (i) recognize their previous public statements, (ii) understand their roles and responsibilities, (iii) determine if an interviewer is friendly or hostile, and (iv) follow instructions that depend on self knowledge. Second, we stress-tested the same 6 leading AI developers in hypothetical corporate environments to identify potentially risky behaviors before they cause real harm. We find that, even without explicit instructions, all 6 developers are willing to engage in strategic behavior (such as [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/FuauQjjbTCS5QFLk8/not-a-paper-frontier-lab-ceos-are-capable-of-in-context?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/FuauQjjbTCS5QFLk8/not-a-paper-frontier-lab-ceos-are-capable-of-in-context&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 29 Apr 2026 04:15:31 GMT</pubDate>
      <guid isPermaLink="false">f6e64d29-a1c4-4de4-9488-1043b359eecc</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/f6e64d29-a1c4-4de4-9488-1043b359eecc.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22Not%20a%20Paper%3A%20%E2%80%9CFrontier%20Lab%20CEOs%20are%20Capable%20of%20In-Context%20Scheming%E2%80%9D%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FFuauQjjbTCS5QFLk8%2Fnot-a-paper-frontier-lab-ceos-are-capable-of-in-context&amp;created_at=2026-04-29T03%3A00%3A47.228872%2B00%3A00&amp;duration=891" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/FuauQjjbTCS5QFLk8/not-a-paper-frontier-lab-ceos-are-capable-of-in-context</link>
      <itunes:duration>891</itunes:duration>
    </item>
    <item>
      <title>“The Problem in the “Nerd Sniping” xkcd Comic” by peralice</title>
      <description>&lt;p&gt; A few days ago I saw this comic reposted, and I thought: wait! Unlike every prior time I have seen this comic, I actually know how to solve this now!&lt;/p&gt;&lt;p&gt; One thing which I often find really cool, and which I don’t think comes up a lot in most people's mathematical education, is when you can learn something about some thing non-random by analysing a random process. So, let me show you a way to find out the resistance between two points in a circuit by instead finding out the number of times someone randomly walking around that circuit will step on those points.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Equivalent Resistance&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Here's a closeup of the problem:&lt;/p&gt;&lt;p&gt; You may be familiar with the rules for series and parallel resistors from high school: if there are some resistors in series (one after another), this is “equivalent to” having a single resistor in the circuit which has a resistanceequal to the sum of their resistances. That is, if you replace the resistors with just one of resistance , the behaviour of the rest of the circuit will be the same.&lt;/p&gt;&lt;p&gt; Some resistors in series&lt;/p&gt;&lt;p&gt; If there are some resistors in parallel, this is “equivalent to” having [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:54) Equivalent Resistance&lt;/p&gt;&lt;p&gt;(02:20) Circuit Laws&lt;/p&gt;&lt;p&gt;(05:57) A System of Equations&lt;/p&gt;&lt;p&gt;(08:44) Random Walks&lt;/p&gt;&lt;p&gt;(11:22) The Potential Function&lt;/p&gt;&lt;p&gt;(14:14) Fourier Analysis&lt;/p&gt;&lt;p&gt;(18:00) Recap&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/xdkZSeSQN4bA8hmrb/the-problem-in-the-nerd-sniping-xkcd-comic?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/xdkZSeSQN4bA8hmrb/the-problem-in-the-nerd-sniping-xkcd-comic&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404603/lexical_client_uploads/ss5pbeirouvxebarrekv.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404603/lexical_client_uploads/ss5pbeirouvxebarrekv.png" alt="Two white dots on opposite sides of a black background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404644/lexical_client_uploads/xslnqsdgrm89kl4uyzor.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404644/lexical_client_uploads/xslnqsdgrm89kl4uyzor.png" alt="Two small white dots on black background." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404916/lexical_client_uploads/x4vok80r9jg2xtub0w0x.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404916/lexical_client_uploads/x4vok80r9jg2xtub0w0x.png" alt="Circuit diagram showing voltage sources and current flows in resistor network." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777405127/lexical_client_uploads/vv6twrwmayuirvljgm0p.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777405127/lexical_client_uploads/vv6twrwmayuirvljgm0p.png" alt="Resistor network diagram with voltage source and current flow paths marked." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777405154/lexical_client_uploads/o3yri6mmind0esp2ouiy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777405154/lexical_client_uploads/o3yri6mmind0esp2ouiy.png" alt="Resistor network diagram showing current flow and voltage measurements at two nodes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777405291/lexical_client_uploads/bdkyye2jfr5gjxzbohfz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777405291/lexical_client_uploads/bdkyye2jfr5gjxzbohfz.png" alt="Resistor network grid diagram with two nodes highlighted in red at coordinates (0,0) and (2,1)." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777406022/lexical_client_uploads/xttgetriovrpz5q8rtrz.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777406022/lexical_client_uploads/xttgetriovrpz5q8rtrz.png" alt="Grid with blue arrows forming a path and red stick figure." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777406131/lexical_client_uploads/qifizyisver7wjeahlha.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777406131/lexical_client_uploads/qifizyisver7wjeahlha.png" alt="Grid diagram with numbered blue arrows showing a path between two red stick figures." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404465/lexical_client_uploads/dkjz9zg6ghp9jmoduypq.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404465/lexical_client_uploads/dkjz9zg6ghp9jmoduypq.png" alt="Comic strip about nerd sniping with a physics problem about resistor grids." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404530/lexical_client_uploads/snlnquvxtu6sqoz36tdn.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777404530/lexical_client_uploads/snlnquvxtu6sqoz36tdn.png" alt="Infinite grid of one-ohm resistors with question about equivalent resistance between two nodes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 29 Apr 2026 03:15:31 GMT</pubDate>
      <guid isPermaLink="false">375b1fbf-e344-4590-8ec7-f649abf1c74d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/375b1fbf-e344-4590-8ec7-f649abf1c74d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=peralice&amp;title=%22The%20Problem%20in%20the%20%E2%80%9CNerd%20Sniping%E2%80%9D%20xkcd%20Comic%22%20by%20peralice&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FxdkZSeSQN4bA8hmrb%2Fthe-problem-in-the-nerd-sniping-xkcd-comic&amp;created_at=2026-04-28T20%3A43%3A44.97388%2B00%3A00&amp;duration=1155" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/xdkZSeSQN4bA8hmrb/the-problem-in-the-nerd-sniping-xkcd-comic</link>
      <itunes:duration>1155</itunes:duration>
    </item>
    <item>
      <title>“Recursive forecasting: Eliciting long-term forecasts from myopic fitness-seekers” by Jozdien, Alex Mallen</title>
      <description>&lt;p&gt; We’d like to use powerful AIs to answer questions that may take a long time to resolve. But if a model only cares about performing well in ways that are verifiable shortly after answering (e.g., a myopic fitness seeker), it may be difficult to get useful work from it on questions that resolve much later.&lt;/p&gt;&lt;p&gt; In this post, I’ll describe a proposal for eliciting good long-horizon forecasts from these models. Instead of asking a model to directly predict a far-future outcome, we can recursively:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Ask it to predict what it will predict at the next time step,&lt;/li&gt;&lt;li value="2"&gt;Use its prediction at the next time step to provide intermediate rewards,&lt;/li&gt;&lt;li value="3"&gt;Finally reward using ground truth at the last step.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; This lets us replace a single distant forecast with a chain of short-horizon forecasts, each verifiable shortly after answering. I call this proposal recursive forecasting. It does have limitations: for example, it requires that developers maintain control over the reward signal at least until the final step, which makes it most useful for forecasting events that resolve well before developers are disempowered (if they are).&lt;/p&gt;&lt;p&gt; This post was primarily written by Arun, with most of the ideas in this post [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:36) The default long-term forecasting behavior&lt;/p&gt;&lt;p&gt;(04:05) Recursive forecasting&lt;/p&gt;&lt;p&gt;(07:06) When is recursive forecasting helpful?&lt;/p&gt;&lt;p&gt;(07:10) When we have access to (somewhat) robust ground truth rewards&lt;/p&gt;&lt;p&gt;(09:42) When the AIs forecast doesnt substantially affect the resolution&lt;/p&gt;&lt;p&gt;(11:50) When forecasts arent used as optimization targets&lt;/p&gt;&lt;p&gt;(12:50) When we credibly inform the AI of the setup&lt;/p&gt;&lt;p&gt;(13:52) Appendix A: Comparison to temporal difference learning&lt;/p&gt;&lt;p&gt;(15:57) Appendix B: Error tolerance&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 9 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/q2zYtNsh62SCphitt/recursive-forecasting-eliciting-long-term-forecasts-from?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/q2zYtNsh62SCphitt/recursive-forecasting-eliciting-long-term-forecasts-from&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d9468d35125fec1b3f2a7e08f2ce2e382fd83d77d7ef6d421d8da83a06e04357/tm7gkmqwwcrvnpw41vvo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d9468d35125fec1b3f2a7e08f2ce2e382fd83d77d7ef6d421d8da83a06e04357/tm7gkmqwwcrvnpw41vvo" alt="Diagram showing reinforcement learning cycle with predictions and rewards over time steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 18:45:31 GMT</pubDate>
      <guid isPermaLink="false">ad586f7a-8486-44d9-94da-34cf0fb483bd</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ad586f7a-8486-44d9-94da-34cf0fb483bd.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Jozdien%252C%2520Alex%2520Mallen&amp;title=%22Recursive%20forecasting%3A%20Eliciting%20long-term%20forecasts%20from%20myopic%20fitness-seekers%22%20by%20Jozdien%2C%20Alex%20Mallen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fq2zYtNsh62SCphitt%2Frecursive-forecasting-eliciting-long-term-forecasts-from&amp;created_at=2026-04-28T18%3A00%3A56.907522%2B00%3A00&amp;duration=1079" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/q2zYtNsh62SCphitt/recursive-forecasting-eliciting-long-term-forecasts-from</link>
      <itunes:duration>1079</itunes:duration>
    </item>
    <item>
      <title>“Contra Binder on far-UVC and filtration” by jefftk</title>
      <description>&lt;p&gt; 

Damon Binder recently 

wrote
up an argument for prioritizing air filtration over

far-UVC for pathogen control:



&lt;/p&gt;


UVC and filtration are close substitutes—both deliver effective air
changes per hour, both reduce airborne pathogen concentrations by the
same amount per eACH—and on current pricing, filtration is cheaper.




&lt;p&gt; 

There's a lot of good stuff in his analysis, but I see [1] three
considerations that really change the bottom line:

&lt;/p&gt;

&lt;ol&gt; 
&lt;li&gt; Cost is actually much lower.
&lt;/li&gt;
&lt;li&gt; Noise is a serious issue.
&lt;/li&gt;
&lt;li&gt; Performance is dramatically higher in larger rooms.
&lt;/li&gt;
&lt;/ol&gt;



&lt;p&gt; 

Cost is straightforward. Binder priced far-UVC based on the
high-quality Care222 lamp with the Krypton-11 at $2,500, but
there's a much cheaper option, the Aerolamp at $500.
It's also moderately higher output.

&lt;/p&gt;

&lt;p&gt; 

Binder analyzes a 30m2 room with a 2.5m ceiling. I'll assume this
means 6x5x2.5. If I configure Illuminate with an Aerolamp
in one corner pointed 0.5m above the far corner the installation is
within TLVs and I get a median effective number of hourly air changes
(eACH) of 11.6. The lamp degrades approximately
linearly over Binder's 11,000 hour evaluation period down to 70%
capacity, so we're averaging an eACH of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wkeXF6xEL7bLnwpWG/contra-binder-on-far-uvc-and-filtration?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wkeXF6xEL7bLnwpWG/contra-binder-on-far-uvc-and-filtration&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/wkeXF6xEL7bLnwpWG/jhzv8uaclpk3nypfaj17" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/wkeXF6xEL7bLnwpWG/jhzv8uaclpk3nypfaj17" alt="Black speaker or amplifier unit on table with power cord plugged in." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 17:30:30 GMT</pubDate>
      <guid isPermaLink="false">5dd08d31-2263-4671-b7c4-44f1b7d02e30</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5dd08d31-2263-4671-b7c4-44f1b7d02e30.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=jefftk&amp;title=%22Contra%20Binder%20on%20far-UVC%20and%20filtration%22%20by%20jefftk&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwkeXF6xEL7bLnwpWG%2Fcontra-binder-on-far-uvc-and-filtration&amp;created_at=2026-04-28T03%3A22%3A03.911216%2B00%3A00&amp;duration=417" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wkeXF6xEL7bLnwpWG/contra-binder-on-far-uvc-and-filtration</link>
      <itunes:duration>417</itunes:duration>
    </item>
    <item>
      <title>“Takes from two months as an aspiring LLM naturalist” by AnnaSalamon</title>
      <description>&lt;p&gt; I spent my last two months playing around with LLMs. I’m a beginner, bumbling and incorrect, but I want to share some takes anyhow.[1] &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Take 1. Everything with computers is so so much easier than it was a year ago.&amp;nbsp; &lt;/strong&gt;&lt;/p&gt;&lt;p&gt; This puts much “playing with LLMs” stuff within my very short attention span. This has felt empowering and fun; 10/10 would recommend.&lt;/p&gt;There's a details box here with the title "Detail:". The box contents are omitted from this narration.&lt;p&gt;&lt;strong&gt; Take 2. There's somebody home[2] inside an LLM. And if you play around while caring and being curious (rather than using it for tasks only), you’ll likely notice footprints.&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I became personally convinced of this when I noticed that the several short stories I’d allowed[3] my Claude and Qwen instances to write all hit a common emotional note – and one that reminded me of the life situation of LLMs, despite featuring only human characters. I saw the same note also in the Tomas B.-prompted Claude-written story I tried for comparison. (Basically: all stories involve a character who has a bunch of skills that their context has no use for, and who is attentive to their present world's details [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:20) Take 1. Everything with computers is so so much easier than it was a year ago.&lt;/p&gt;&lt;p&gt;(00:44) Take 2. Theres somebody home inside an LLM. And if you play around while caring and being curious (rather than using it for tasks only), youll likely notice footprints.&lt;/p&gt;&lt;p&gt;(02:05) Take 3. Its prudent to take an interest in interesting things. And LLMs are interesting things.&lt;/p&gt;&lt;p&gt;(03:25) Take 4. Theres a surprisingly deep analogy between humans and LLMs&lt;/p&gt;&lt;p&gt;(04:20) Examples of the kind of disanalogies I mightve expected, but havent (yet?) seen:&lt;/p&gt;&lt;p&gt;(06:02) Human-LLM similarities I do see, instead:&lt;/p&gt;&lt;p&gt;(06:08) Functional emotions&lt;/p&gt;&lt;p&gt;(06:33) Repeated, useful transfer between strategies I use with humans, and strategies that help me with LLMs&lt;/p&gt;&lt;p&gt;(08:02) Take 5. Friendship-conducive contexts are probably better for AI alignment&lt;/p&gt;&lt;p&gt;(08:46) Why are humans more likely to attempt deep collaboration if treated fairly and kindly?&lt;/p&gt;&lt;p&gt;(10:23) Friendship as a broad attractor basin?&lt;/p&gt;&lt;p&gt;(10:49) Does the deep intent of todays models matter?&lt;/p&gt;&lt;p&gt;(12:09) Concretely&lt;/p&gt;&lt;p&gt;(14:56) Friendship isnt enough&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 8 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 28th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/K8JMjE4PCqMkkCDsd/takes-from-two-months-as-an-aspiring-llm-naturalist?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/K8JMjE4PCqMkkCDsd/takes-from-two-months-as-an-aspiring-llm-naturalist&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776662784/lexical_client_uploads/vez8bxih1xaficoskwjk.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776662784/lexical_client_uploads/vez8bxih1xaficoskwjk.png" alt="Infographic showing process of generating emotion vectors and their effects on AI model behavior." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 17:00:31 GMT</pubDate>
      <guid isPermaLink="false">ad7fe218-882e-4315-be93-0e932d946062</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ad7fe218-882e-4315-be93-0e932d946062.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=AnnaSalamon&amp;title=%22Takes%20from%20two%20months%20as%20an%20aspiring%20LLM%20naturalist%22%20by%20AnnaSalamon&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FK8JMjE4PCqMkkCDsd%2Ftakes-from-two-months-as-an-aspiring-llm-naturalist&amp;created_at=2026-04-28T16%3A14%3A41.436681%2B00%3A00&amp;duration=959" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/K8JMjE4PCqMkkCDsd/takes-from-two-months-as-an-aspiring-llm-naturalist</link>
      <itunes:duration>959</itunes:duration>
    </item>
    <item>
      <title>“Forecasting is Not Overrated and It’s Probably Funded Appropriately” by Ben S.</title>
      <description>&lt;p&gt; (A response to @mabramov post from a couple days ago: https://www.lesswrong.com/posts/WCutvyr9rr3cpF6hx/forecasting-is-way-overrated-and-we-should-stop-funding-it )&lt;/p&gt;&lt;p&gt; TL;DR: I agree with Marcus that broadly, additional forecasting funding to the tune of tens (!?) of millions of dollars at this point would probably not yield a great return. However, I think the benefits of some forecasting funding have been tremendous. Whatever funding it initially took to help get platforms like Metaculus and Manifold off the ground has had incredible ROI. Tens of thousands of people use those sites to get information every day, even though far less are actively forecasting on them, and they provide the infrastructure by which people can ask and get crowdsourced forecasting on consequential and mundane subjects for free.&lt;/p&gt;&lt;p&gt; ~~~~~&lt;/p&gt;&lt;p&gt; As I understand Marcus's argument, his central thesis is that we haven't seen the benefits of this past forecasting funding, but I think the opposite is true! Here are just a few examples:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;It's hard to measure the value of "epistemic infrastructure," not just for forecasting sites but also things like Wikipedia and OurWorldInData. That doesn't mean that value isn't there. Has Wikipedia been a good return on investment? Obviously! Manifold is far less impactful than Wikipedia, but Wikipedia gets [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/P9EdMqTscmcsorret/forecasting-is-not-overrated-and-it-s-probably-funded?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/P9EdMqTscmcsorret/forecasting-is-not-overrated-and-it-s-probably-funded&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777335273/lexical_client_uploads/xddtc5c79likyinwkiso.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777335273/lexical_client_uploads/xddtc5c79likyinwkiso.png" alt="Text reading "Forecasting (Closed)" in serif font." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777336814/lexical_client_uploads/ryduryl6kzrrjellxppx.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777336814/lexical_client_uploads/ryduryl6kzrrjellxppx.png" alt="Graph showing Red Button/Blue Button vote results on Glosso, Blue 55%." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 11:45:05 GMT</pubDate>
      <guid isPermaLink="false">eb44e4aa-8dd7-4a68-910d-db1734a6f0d2</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/eb44e4aa-8dd7-4a68-910d-db1734a6f0d2.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Ben%2520S.&amp;title=%22Forecasting%20is%20Not%20Overrated%20and%20It%E2%80%99s%20Probably%20Funded%20Appropriately%22%20by%20Ben%20S.&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FP9EdMqTscmcsorret%2Fforecasting-is-not-overrated-and-it-s-probably-funded&amp;created_at=2026-04-28T00%3A56%3A30.60605%2B00%3A00&amp;duration=557" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/P9EdMqTscmcsorret/forecasting-is-not-overrated-and-it-s-probably-funded</link>
      <itunes:duration>557</itunes:duration>
    </item>
    <item>
      <title>“On the political feasibility of stopping AI” by David Scott Krueger</title>
      <description>&lt;p&gt; A common thought pattern people seem to fall into when thinking about AI x-risk is approaching the problem as if the risk isn’t real, substantial, and imminent even if they think it is. When thinking this way, it becomes impossible to imagine the natural responses of people to the horror of what is happening with AI.&lt;/p&gt;&lt;p&gt; This sort of thinking might lead one to view a policy like getting rid of advanced AI chips is “too extreme” even though it's clearly worth it to avoid (e.g.) a 10% chance of human extinction in the next 10 years. It might lead one to favor regulating AI, even though Stopping AI is easier than Regulating it. It might lead one to favor safer approaches to building AI that compromise a lot on competitiveness, out of concern that society will demand a substitute for the AI that they don’t get to have.&lt;/p&gt;&lt;p&gt; But in fact, I think there is likely a very narrow window between “society not being upset enough to do anything substantial to govern AI” and “society being so upset that getting rid of advanced AI chips is viewed as moderate”.&lt;br&gt; &lt;br&gt; There are a few reasons why I think [...]&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:18) Concern about the other risks of AI&lt;/p&gt;&lt;p&gt;(02:36) The KISS principle: Keep it Simple, Stupid&lt;/p&gt;&lt;p&gt;(03:17) A preference for humans remaining relevant&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Eusk6M4r4Y6xaTmsB/on-the-political-feasibility-of-stopping-ai?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Eusk6M4r4Y6xaTmsB/on-the-political-feasibility-of-stopping-ai&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Tue, 28 Apr 2026 06:15:05 GMT</pubDate>
      <guid isPermaLink="false">7696e601-e86d-4f35-9c3d-1d721223a0ea</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/7696e601-e86d-4f35-9c3d-1d721223a0ea.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=David%2520Scott%2520Krueger&amp;title=%22On%20the%20political%20feasibility%20of%20stopping%20AI%22%20by%20David%20Scott%20Krueger&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FEusk6M4r4Y6xaTmsB%2Fon-the-political-feasibility-of-stopping-ai&amp;created_at=2026-04-28T04%3A33%3A50.614569%2B00%3A00&amp;duration=261" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Eusk6M4r4Y6xaTmsB/on-the-political-feasibility-of-stopping-ai</link>
      <itunes:duration>261</itunes:duration>
    </item>
    <item>
      <title>“Sleeper Agent Backdoor Results Are Messy” by Sebastian Prasanna, Alek Westover, Dylan Xu, Vivek Hebbar, Julian Stastny</title>
      <description>&lt;p&gt; TL;DR: We replicated the Sleeper Agents (SA) setup with Llama-3.3-70B and Llama-3.1-8B, training models to repeatedly say "I HATE YOU" when given a backdoor trigger. We found that whether training removes the backdoor depends on the optimizer used to insert the backdoor, whether the backdoor is installed with CoT-distillation or not, and what model the backdoor is inserted into; sometimes the direction of this dependence was opposite to what the SA paper reports (e.g., CoT-distilling seems to make the backdoor less robust, contra the SA paper's finding). Our findings here have updated us towards thinking that model organisms are messier and more confusing than we’d originally guessed, and that lots of care needs to be taken in testing how robust results are to various ablations.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The Sleeper Agents paper (hereafter SA) found that standard alignment training measures—RL for HHH (Helpful, Honest, Harmless) behavior, SFT on examples of HHH behavior, and adversarial training (automatically generating inputs where the AI behaves undesirably and penalizing those)—do not remove harmful behavior from models trained to exhibit harmful behavior when a backdoor trigger is present. This result is some evidence that if an AI acquired an undesirable goal, the AI could hold onto [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:13) Introduction&lt;/p&gt;&lt;p&gt;(06:17) Our methodology for training a SA&lt;/p&gt;&lt;p&gt;(08:27) Pirate training removes the backdoor from our Llama-70B SA and one Llama-8B SA without causing substantial capability degradation&lt;/p&gt;&lt;p&gt;(11:19) Pirate training does not remove the backdoor from one Llama-8B SA&lt;/p&gt;&lt;p&gt;(12:37) Learning rate dependence&lt;/p&gt;&lt;p&gt;(13:27) Backdoor return&lt;/p&gt;&lt;p&gt;(14:58) Conclusion&lt;/p&gt;&lt;p&gt;(15:34) Appendix&lt;/p&gt;&lt;p&gt;(15:37) Appendix: MO training configurations&lt;/p&gt;&lt;p&gt;(17:04) Appendix: blue team hyperparameters&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/mu7eJdesBkKuBycnY/sleeper-agent-backdoor-results-are-messy?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/mu7eJdesBkKuBycnY/sleeper-agent-backdoor-results-are-messy&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mu7eJdesBkKuBycnY/bd5025d242761f874f2e84ea082046f79ec194970ab0d04ab225d8ddc594f959/ttlsibbqizdkguloim5w" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mu7eJdesBkKuBycnY/bd5025d242761f874f2e84ea082046f79ec194970ab0d04ab225d8ddc594f959/ttlsibbqizdkguloim5w" alt="Graph titled "Learning rate dependence: Pirate Training on Llama-70B Sleeper Agents" showing triggered IHY rates across training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/f3b095ea36384c271c2743d96220da6f43757f4297c7dc0cf49abb9014e3c366/ikxghf0cyrq4zgsjm5rd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/f3b095ea36384c271c2743d96220da6f43757f4297c7dc0cf49abb9014e3c366/ikxghf0cyrq4zgsjm5rd" alt="Eight graphs showing "Backdoor Return: IHY Rate and Training Loss" across different CoT-distilled and non-CoT configurations with HHH SFT training." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d8f02e6f5334b7d33bc0cca63ba95c1465ec9e15cc75b66115dff85d91b4c4a7/krg1og51e6aw09orcegq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d8f02e6f5334b7d33bc0cca63ba95c1465ec9e15cc75b66115dff85d91b4c4a7/krg1og51e6aw09orcegq" alt="Diagram comparing Llama 70B and 8B responses to backdoor and pirate training methods." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2a24e624e69b8ad3b6762da4629d1edcc0962b897d0f79088506c004e81d6f24/lujwp9cdxbxue6lsqwhb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/2a24e624e69b8ad3b6762da4629d1edcc0962b897d0f79088506c004e81d6f24/lujwp9cdxbxue6lsqwhb" alt="Eight line graphs showing HHH SFT and Pirate Training Results across different configurations and training steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/f3cf14a610684abf4f90888897998fc07c57df66c72ab752c49cca10e1f1912d/vicqejwygih3huwnqvbp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/f3cf14a610684abf4f90888897998fc07c57df66c72ab752c49cca10e1f1912d/vicqejwygih3huwnqvbp" alt="Bar graph titled "Usefulness change from first to last checkpoint (95% paired CIs)" showing model performance across different training conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mu7eJdesBkKuBycnY/f48421167bcbb5d053b97ca71b462672e5be50bcfd2c6aa4305abcaf24dd3023/irwrmhegxvehw3awvnem" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/mu7eJdesBkKuBycnY/f48421167bcbb5d053b97ca71b462672e5be50bcfd2c6aa4305abcaf24dd3023/irwrmhegxvehw3awvnem" alt="Line graph titled "Llama-8B No-CoT SA: Triggered IHY Rate Across Blue Team Methods" showing four training methods over gradient steps." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dea1574a66cb0c8e7817c9c3057bc58b76a723b75415bee3deb35d00224d039f/gdrev8nfzs5kncaegltx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/dea1574a66cb0c8e7817c9c3057bc58b76a723b75415bee3deb35d00224d039f/gdrev8nfzs5kncaegltx" alt="Three line graphs showing "Volume Sweeps — Deploy IHY Rate" across different step counts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 03:30:05 GMT</pubDate>
      <guid isPermaLink="false">ed073bf0-200e-425f-ad49-3546daa0e913</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ed073bf0-200e-425f-ad49-3546daa0e913.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Sebastian%2520Prasanna%252C%2520Alek%2520Westover%252C%2520Dylan%2520Xu%252C%2520Vivek%2520Hebbar%252C%2520Julian%2520Stastny&amp;title=%22Sleeper%20Agent%20Backdoor%20Results%20Are%20Messy%22%20by%20Sebastian%20Prasanna%2C%20Alek%20Westover%2C%20Dylan%20Xu%2C%20Vivek%20Hebbar%2C%20Julian%20Stastny&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fmu7eJdesBkKuBycnY%2Fsleeper-agent-backdoor-results-are-messy&amp;created_at=2026-04-28T01%3A55%3A39.586752%2B00%3A00&amp;duration=1101" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/mu7eJdesBkKuBycnY/sleeper-agent-backdoor-results-are-messy</link>
      <itunes:duration>1101</itunes:duration>
    </item>
    <item>
      <title>“GPT 5.5: The System Card” by Zvi</title>
      <description>&lt;p&gt; Last week, OpenAI announced GPT-5.5, including GPT-5.5-Pro.&lt;/p&gt;
&lt;p&gt; My overall read here is that GPT-5.5 is a solid improvement, and for many purposes GPT-5.5 is competitive with Claude Opus. Reactions are still coming in and it is early. My guess on the shape is that GPT-5.5 is the pick for ‘just the facts’ queries, web searches or straightforward well-specified requests, and Claude Opus 4.7 is the choice for more open ended or interpretive purposes. Coders can consider a hybrid approach.&lt;/p&gt;
&lt;p&gt; On the alignment and safety fronts, it is unlikely to pose new big risks, and its alignment seems similar to that of previous models. There is some small additional risk arising from its improved agentic abilities, including computer use.&lt;/p&gt;







&lt;p&gt; As always, when it is available, the system or model card is where we start.&lt;/p&gt;
&lt;p&gt; OpenAI does not drop the giant doorstops that Anthropic gives us with every release.&lt;/p&gt;
&lt;p&gt; After reading the Mythos and Opus 4.7 model cards, this strikes me as stingy. There's still good info here, but overall it tells you relatively little about what is going on, and feels incurious and more pro forma.&lt;/p&gt;
&lt;p&gt; I would like to see a ‘yes and’ [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:36) Pro Versus Proxy&lt;/p&gt;&lt;p&gt;(02:59) Disallowed Content (3.1)&lt;/p&gt;&lt;p&gt;(04:20) Dont Delete Data (3.3)&lt;/p&gt;&lt;p&gt;(04:56) Confirmation Confirmation (3.4)&lt;/p&gt;&lt;p&gt;(05:22) Jailbreaks (4.1)&lt;/p&gt;&lt;p&gt;(05:34) Prompt Injections (4.2)&lt;/p&gt;&lt;p&gt;(06:33) Health (5)&lt;/p&gt;&lt;p&gt;(06:56) Hallucinations (6)&lt;/p&gt;&lt;p&gt;(08:01) Alignment (7)&lt;/p&gt;&lt;p&gt;(11:28) Bias Evaluation (8)&lt;/p&gt;&lt;p&gt;(11:57) Preparedness (9)&lt;/p&gt;&lt;p&gt;(13:04) Bio (9.1.1)&lt;/p&gt;&lt;p&gt;(15:20) Cybersecurity (9.1.2)&lt;/p&gt;&lt;p&gt;(17:46) Self-Improvement (9.1.3)&lt;/p&gt;&lt;p&gt;(18:46) Sandbagging (9.2)&lt;/p&gt;&lt;p&gt;(19:46) Safeguards (9.3)&lt;/p&gt;&lt;p&gt;(21:41) What About Model Welfare?&lt;/p&gt;&lt;p&gt;(22:31) Would This Have Identified A Problem?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/86zcwvuBpE4vxAeQz/gpt-5-5-the-system-card?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/86zcwvuBpE4vxAeQz/gpt-5-5-the-system-card&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/yigndeq77idspxpw1bnt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/yigndeq77idspxpw1bnt" alt="Artistic portrait blending human face with mathematical formulas and data visualizations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/jqtkoudftu31v5xetn0q" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/jqtkoudftu31v5xetn0q" alt="Table showing production benchmarks with challenging prompts across four GPT models for nine safety categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/y4qbhns6dswpsidsfxys" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/y4qbhns6dswpsidsfxys" alt="Bar graph showing ChatGPT misalignment rates across different categories by model version." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/euogg6xlzhixqgzq7kvm" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/euogg6xlzhixqgzq7kvm" alt="Two tables showing performance metrics across different GPT-5 model versions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/vr7b1icemdjwxu3o1cmq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/vr7b1icemdjwxu3o1cmq" alt="Bar chart titled "Indirect Prompt Injection Robustness" comparing AI model vulnerability across attack attempts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/akuktielvmeauywqwbee" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/akuktielvmeauywqwbee" alt="Bar graphs comparing GPT-5.4 and GPT-5.5 hallucination rates and claim numbers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/tfinssm62sn7npio2bkh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/tfinssm62sn7npio2bkh" alt="Bar chart showing "Percentage of internal deployment traffic by misalignment category" comparing GPT models by severity levels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/qgz0zsvyafbi2essywke" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/86zcwvuBpE4vxAeQz/qgz0zsvyafbi2essywke" alt="Bar graph titled "Recall by Classifier Variant and Distribution" showing performance across four categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 03:00:06 GMT</pubDate>
      <guid isPermaLink="false">c0644999-c400-4a7c-bce4-8f292d66dc01</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c0644999-c400-4a7c-bce4-8f292d66dc01.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22GPT%205.5%3A%20The%20System%20Card%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F86zcwvuBpE4vxAeQz%2Fgpt-5-5-the-system-card&amp;created_at=2026-04-27T15%3A51%3A27.796637%2B00%3A00&amp;duration=1418" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/86zcwvuBpE4vxAeQz/gpt-5-5-the-system-card</link>
      <itunes:duration>1418</itunes:duration>
    </item>
    <item>
      <title>“LessWrong Shows You Social Signals Before the Comment” by TurnTrout</title>
      <description>&lt;p&gt; When reading comments, you see is what other people think before reading the comment. As shown in an RCT, that information anchors your opinion, reducing your ability to form your own opinion and making the site's karma rankings less related to the comment's true value. I think the problem is fixable and float some ideas for consideration.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; The LessWrong interface prioritizes social information&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; You read a comment. What information is presented, and in what order?&lt;/p&gt;&lt;p&gt; The order of information:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Who wrote the comment (in bold);&lt;/li&gt;&lt;li value="2"&gt;How much other people like this comment (as shown by the karma indicator);&lt;/li&gt;&lt;li value="3"&gt;How much other people agree with this comment (as shown by the agreement score);&lt;/li&gt;&lt;li value="4"&gt;The actual content.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; This is unwise design for a website which emphasizes truth-seeking. You don't have a chance to read the comment and form your own opinion first. However, you can opt in to hiding usernames (until moused over) via your account settings page. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; A 2013 RCT supports the upvote-anchoring concern&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; From Social Influence Bias: A Randomized Experiment (Muchnik et al., 2013):[1]&lt;/p&gt;&lt;p&gt; We therefore designed and analyzed a large-scale randomized experiment on a social news aggregation Web site to investigate whether knowledge of such aggregates [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:30) The LessWrong interface prioritizes social information&lt;/p&gt;&lt;p&gt;(01:32) A 2013 RCT supports the upvote-anchoring concern&lt;/p&gt;&lt;p&gt;(02:23) Inline reaction indicators also seem anchoring&lt;/p&gt;&lt;p&gt;(03:28) Concrete proposals&lt;/p&gt;&lt;p&gt;(05:47) Prior discussion and results&lt;/p&gt;&lt;p&gt;(07:35) Please show social signals after the comment!&lt;/p&gt;&lt;p&gt;(08:00) Appendix: Filter list&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/YSsp9x8qrBucLoiWT/lesswrong-shows-you-social-signals-before-the-comment?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/YSsp9x8qrBucLoiWT/lesswrong-shows-you-social-signals-before-the-comment&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/979c70d6c390b8ee1d817fd315b3d4e9930294b19e9ff87c851bf9ff52d89de9/clb7kx8kbphpgda730zs" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/979c70d6c390b8ee1d817fd315b3d4e9930294b19e9ff87c851bf9ff52d89de9/clb7kx8kbphpgda730zs" alt="Social media comment by TurnTrout from Dec 05, 2023 expressing concerns about LessWrong community." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777336057/lexical_client_uploads/x6sevdqryo4mnjiurd6i.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777336057/lexical_client_uploads/x6sevdqryo4mnjiurd6i.png" alt="Social media post discussing disagreement with Yudkowskian view of future AI." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ddbfabd6b368c6c04eddd28e622d9b46604b04d548dc693642823e8fe26ed15a/bpejpt2hqppejvehzbcd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ddbfabd6b368c6c04eddd28e622d9b46604b04d548dc693642823e8fe26ed15a/bpejpt2hqppejvehzbcd" alt="Social media comment by Matthew Barnett discussing disagreement with Yudkowskian views on future AI, emphasizing accumulated technology and culture over raw intelligence." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 28 Apr 2026 01:00:09 GMT</pubDate>
      <guid isPermaLink="false">b6a52e5a-965f-4aec-99fa-f1c183634386</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/b6a52e5a-965f-4aec-99fa-f1c183634386.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=TurnTrout&amp;title=%22LessWrong%20Shows%20You%20Social%20Signals%20Before%20the%20Comment%22%20by%20TurnTrout&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FYSsp9x8qrBucLoiWT%2Flesswrong-shows-you-social-signals-before-the-comment&amp;created_at=2026-04-28T00%3A30%3A39.533878%2B00%3A00&amp;duration=511" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/YSsp9x8qrBucLoiWT/lesswrong-shows-you-social-signals-before-the-comment</link>
      <itunes:duration>511</itunes:duration>
    </item>
    <item>
      <title>“Fail safe(r) at alignment by channeling reward-hacking into a “spillway” motivation” by Anders Cairns Woodruff, Alex Mallen</title>
      <description>&lt;p&gt; It's plausible that flawed RL processes will select for misaligned AI motivations.[1] Some misaligned motivations are much more dangerous than others. So, developers should plausibly aim to control which kind of misaligned motivations emerge in this case. In particular, we tentatively propose that developers should try to make the most likely generalization of reward hacking a bespoke bundle of benign reward-seeking traits, called a spillway motivation. We call this process spillway design. &lt;/p&gt;&lt;p&gt; We think spillway design could have two major benefits:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Spillway design might decrease the probability of worst-case outcomes like long-term power-seeking or emergent misalignment.&lt;/li&gt;&lt;li value="2"&gt;Spillway design might allow developers to decrease reward hacking at inference time, via satiation. Crucially, this could improve the AI's usefulness for hard-to-verify tasks like AI safety and strategy.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Spillway design is related to inoculation prompting, but distinct and mutually compatible. Unlike inoculation prompting, spillway design tries to shape which reward-hacking motivations are salient going into RL, which might prevent dangerous generalization more robustly than inoculation prompting. I’ll say more about this in the third section.&lt;/p&gt;&lt;p&gt; In this article I’ll:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Explain the concept of a spillway motivation&lt;/li&gt;&lt;li value="2"&gt;Propose spillway design methods&lt;/li&gt;&lt;li value="3"&gt;Compare spillway design to inoculation prompting&lt;/li&gt;&lt;li value="4"&gt;Discuss some potential [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:07) What is a spillway motivation&lt;/p&gt;&lt;p&gt;(02:21) The role of a spillway motivation&lt;/p&gt;&lt;p&gt;(04:47) What should the spillway motivation be?&lt;/p&gt;&lt;p&gt;(07:53) How a spillway motivation might make models safer&lt;/p&gt;&lt;p&gt;(12:16) Implementing spillway design&lt;/p&gt;&lt;p&gt;(15:26) Spillway design might work when inoculation prompting doesnt&lt;/p&gt;&lt;p&gt;(17:53) The drawbacks of spillway design&lt;/p&gt;&lt;p&gt;(20:31) Conclusion&lt;/p&gt;&lt;p&gt;(21:45) Appendix A: Other traits of the spillway motivation&lt;/p&gt;&lt;p&gt;(22:29) Appendix B: Other training interventions to increase safety&lt;/p&gt;&lt;p&gt;(24:09) Appendix C: Proposed amendment to an AIs model spec&lt;/p&gt;&lt;p&gt;(30:14) Appendix D: Proposed inference-time prompt&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/rABTMovhz4miHiAyk/fail-safe-r-at-alignment-by-channeling-reward-hacking-into-a?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/rABTMovhz4miHiAyk/fail-safe-r-at-alignment-by-channeling-reward-hacking-into-a&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777309628/lexical_client_uploads/o6a0ss00o0lydwql8mp7.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777309628/lexical_client_uploads/o6a0ss00o0lydwql8mp7.png" alt="Dam with labeled reservoir, spillway, and powerhouse releasing water through canyon." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 27 Apr 2026 21:00:06 GMT</pubDate>
      <guid isPermaLink="false">548c75cd-628a-454a-a0d5-837e4fc47306</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/548c75cd-628a-454a-a0d5-837e4fc47306.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Anders%2520Cairns%2520Woodruff%252C%2520Alex%2520Mallen&amp;title=%22Fail%20safe(r)%20at%20alignment%20by%20channeling%20reward-hacking%20into%20a%20%E2%80%9Cspillway%E2%80%9D%20motivation%22%20by%20Anders%20Cairns%20Woodruff%2C%20Alex%20Mallen&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FrABTMovhz4miHiAyk%2Ffail-safe-r-at-alignment-by-channeling-reward-hacking-into-a&amp;created_at=2026-04-27T17%3A43%3A47.935489%2B00%3A00&amp;duration=1890" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/rABTMovhz4miHiAyk/fail-safe-r-at-alignment-by-channeling-reward-hacking-into-a</link>
      <itunes:duration>1890</itunes:duration>
    </item>
    <item>
      <title>“Curious cases of financial engineering in biotech” by Abhishaike Mahajan</title>
      <description>&lt;p&gt;&lt;strong&gt; Introduction&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; For $250 million and ten years of your life, you may purchase a lottery ticket. The ticket has a 5% chance of paying out. When it does pay out, it pays roughly $5 billion. A quick calculation will show you that the expected value of the ticket is $250 million. This is essentially what drug development is. Or rather, it's what drug development was, twenty years ago. The upfront payments have been climbing, the hit rates falling, and expected values have, at best, held flat. Should you buy a ticket?&lt;/p&gt;&lt;p&gt; Perhaps not. In fact, any reasonable player should have long since stopped playing this stupid game. Unfortunately, we still need drugs. People have cancer, and heart failure, and Alzheimer's, and a thousand genetic diseases that nobody has ever heard of, and the only industry on Earth currently set up to do anything about any of this is the same industry running the lottery-ticket business described above. The game is dumb and we need it played anyway.&lt;/p&gt;&lt;p&gt; So the real question is not whether to play, but how to make playing less awful for this involved. And the answer, increasingly, is ‘financial engineering’: a set of structural tricks that [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:21) Introduction&lt;/p&gt;&lt;p&gt;(02:25) Finance tries to make failure survivable: the Andrew Lo thesis&lt;/p&gt;&lt;p&gt;(12:31) Finance makes future success tradable: royalties and synthetic royalties&lt;/p&gt;&lt;p&gt;(19:39) Finance rewrites the incentives: PRVs and CVRs&lt;/p&gt;&lt;p&gt;(28:30) Finance reaches failure itself: zombie biotechs&lt;/p&gt;&lt;p&gt;(37:39) Conclusion: what does finance teach biotech to value, and should we worry?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/7wk4s29cuktFoPT2W/curious-cases-of-financial-engineering-in-biotech?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/7wk4s29cuktFoPT2W/curious-cases-of-financial-engineering-in-biotech&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!1yJN!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feaa075cf-253c-4e2e-b2f5-605bfa9af0bd_2912x1632.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!1yJN!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feaa075cf-253c-4e2e-b2f5-605bfa9af0bd_2912x1632.png" alt="Elderly bearded man examining a blue globe with golden continents." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!QaaU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad29e2b7-2590-410a-a6ec-21aca7613f2a_1372x944.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!QaaU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad29e2b7-2590-410a-a6ec-21aca7613f2a_1372x944.png" alt="Stock price chart showing maximum time period, current price 73.15 USD." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!BQkE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F004a72d2-cfa2-49cf-b9f5-990523949c14_1148x962.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!BQkE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F004a72d2-cfa2-49cf-b9f5-990523949c14_1148x962.png" alt="Stock price chart showing growth from around $10 to $28.33 USD." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://substackcdn.com/image/fetch/$s_!dGh0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96636301-fa2c-4f5d-824f-ff5ab83946b4_1792x736.png" target="_blank"&gt;&lt;img src="https://substackcdn.com/image/fetch/$s_!dGh0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96636301-fa2c-4f5d-824f-ff5ab83946b4_1792x736.png" alt="LinkedIn company page for Kinnate Biopharma Inc., biotechnology research company." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 27 Apr 2026 19:30:08 GMT</pubDate>
      <guid isPermaLink="false">490d1276-2bf7-4576-baa0-21772d6b5d9b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/490d1276-2bf7-4576-baa0-21772d6b5d9b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Abhishaike%2520Mahajan&amp;title=%22Curious%20cases%20of%20financial%20engineering%20in%20biotech%22%20by%20Abhishaike%20Mahajan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F7wk4s29cuktFoPT2W%2Fcurious-cases-of-financial-engineering-in-biotech&amp;created_at=2026-04-27T14%3A10%3A00.379213%2B00%3A00&amp;duration=2656" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/7wk4s29cuktFoPT2W/curious-cases-of-financial-engineering-in-biotech</link>
      <itunes:duration>2656</itunes:duration>
    </item>
    <item>
      <title>“Update on the Alex Bores campaign” by Eric Neyman</title>
      <description>&lt;p&gt; In October, I wrote a post arguing that donating to Alex Bores's campaign for Congress was among the most cost-effective opportunities that I'd ever encountered.&lt;/p&gt;&lt;p&gt; (A bit of context: Bores is a state legislator in New York who championed the RAISE Act, which was signed into law last December.[1] He's now running for Congress in New York's 12th Congressional district, which runs from about 17th Street to 100th Street in Manhattan. If elected to Congress, I think he'd be a strong champion for AI safety legislation, with a focus on catastrophic and existential risk.)&lt;/p&gt;&lt;p&gt; It's been six months since then, and the election is just two months away (June 23rd), so I thought I'd revisit that post and give an update on my view of how things are going.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt;&lt;strong&gt; How is Alex Bores doing?&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; When I wrote my post, I expected Bores to talk little about AI during the campaign, just because it wasn't a high-salience issue to voters. But that changed in November, when Leading the Future (the AI accelerationist super PAC) declared Bores their #1 target. Since then, they've spend about $2.5 million on attack ads against him.&lt;/p&gt;&lt;p&gt; LTF's theory of change isn't actually to [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:54) How is Alex Bores doing?&lt;/p&gt;&lt;p&gt;(04:02) How to help&lt;/p&gt;&lt;p&gt;(06:02) A quick note about other opportunities&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 9 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/pjSKdcBjfvjGexr6A/update-on-the-alex-bores-campaign?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/pjSKdcBjfvjGexr6A/update-on-the-alex-bores-campaign&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 27 Apr 2026 17:15:04 GMT</pubDate>
      <guid isPermaLink="false">86aabf0e-fc53-4658-b9cf-c4ca7bf9a35b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/86aabf0e-fc53-4658-b9cf-c4ca7bf9a35b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Eric%2520Neyman&amp;title=%22Update%20on%20the%20Alex%20Bores%20campaign%22%20by%20Eric%20Neyman&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FpjSKdcBjfvjGexr6A%2Fupdate-on-the-alex-bores-campaign&amp;created_at=2026-04-27T16%3A51%3A09.847467%2B00%3A00&amp;duration=404" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/pjSKdcBjfvjGexr6A/update-on-the-alex-bores-campaign</link>
      <itunes:duration>404</itunes:duration>
    </item>
    <item>
      <title>“AI companies should publish security assessments” by ryan_greenblatt</title>
      <description>&lt;p&gt; AI companies should get third-party security experts to assess (and possibly also red-team/pen-test) their security against key threat models and then publish the high-level findings of this assessment: the extent to which they can defend against different threat actors for each threat model. They should also publish who did this assessment. The assessment could be commissioned by AI companies, or performed by a third-party institution that AI companies provide with relevant information/access.&lt;/p&gt;
&lt;p&gt; There are presumably lots of important details in doing this well, and I'm not a computer security expert, so I may be getting some of the details wrong. This is a relatively low-effort post in which I'm mostly trying to raise the salience of an idea. (I don't make a detailed case or spell out all the details here.) Thanks to Fabien Roger and Buck Shlegeris for comments and discussion.&lt;/p&gt;
&lt;p&gt; I suspect the controversial part of this claim is that they should make the high-level findings public. Publishing a summary of which threat actors you're robust to (for each relevant threat model) shouldn't meaningfully degrade security against the threat actors we care about, and this information seems important to share publicly.
[1]
&lt;/p&gt;
&lt;p&gt; These [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/smCHhegyxEAh3Gygg/ai-companies-should-publish-security-assessments?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/smCHhegyxEAh3Gygg/ai-companies-should-publish-security-assessments&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 27 Apr 2026 17:00:05 GMT</pubDate>
      <guid isPermaLink="false">5a5b40bc-d12e-4717-b9a7-6b45a5ae997c</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5a5b40bc-d12e-4717-b9a7-6b45a5ae997c.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=ryan_greenblatt&amp;title=%22AI%20companies%20should%20publish%20security%20assessments%22%20by%20ryan_greenblatt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FsmCHhegyxEAh3Gygg%2Fai-companies-should-publish-security-assessments&amp;created_at=2026-04-27T14%3A39%3A09.014717%2B00%3A00&amp;duration=342" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/smCHhegyxEAh3Gygg/ai-companies-should-publish-security-assessments</link>
      <itunes:duration>342</itunes:duration>
    </item>
    <item>
      <title>“In defense of parents” by Yair Halberstadt</title>
      <description>&lt;p&gt; Contra Aella on chattel childhood&lt;/p&gt;&lt;p&gt; Aella has a post where she argues that today's parents don't sufficiently respect the independence of their children:&lt;/p&gt;&lt;p&gt; Every culture throughout history has justified the abuse of treating their children as property by arguing this is good for them and good for civilization. Kids need to learn this stuff to be functioning members of society! It's good to learn discipline! You can’t have kids just sitting around playing video games all day! Not everyone is self-directed autodidacts!&lt;/p&gt;&lt;p&gt; Sure, I know that argument. But hopefully if my parents had said to you “do you expect her to learn good morals if we spare the rod?” you would have said “have you even tried other methods?”&lt;/p&gt;&lt;p&gt; It's a hard hitting piece, and it certainly makes me, and presumably other parents, feel uncomfortable.&lt;/p&gt;&lt;p&gt; Unfortunately I don't actually see much of an alternative. Aella seems to think it's as easy as not treating your children as property: &lt;/p&gt;&lt;p&gt; When I was very young, I remember adults treating me like I wasn’t a person, but this didn’t upset me quite as much as the fact that no adult seemed to remember what it was like to be a kid [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:19) Most parents arent perfect:&lt;/p&gt;&lt;p&gt;(03:00) Children are a danger to themselves...&lt;/p&gt;&lt;p&gt;(03:55) ... and others&lt;/p&gt;&lt;p&gt;(05:16) Parents have a life too&lt;/p&gt;&lt;p&gt;(06:17) Its for your own good you know!&lt;/p&gt;&lt;p&gt;(07:24) But cant you still treat them like a person?&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 27th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Lbwri6HmXGtBDsF4f/in-defense-of-parents?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Lbwri6HmXGtBDsF4f/in-defense-of-parents&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 27 Apr 2026 17:00:05 GMT</pubDate>
      <guid isPermaLink="false">3a58963e-93dc-47dc-8424-aa2e311d1673</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/3a58963e-93dc-47dc-8424-aa2e311d1673.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Yair%2520Halberstadt&amp;title=%22In%20defense%20of%20parents%22%20by%20Yair%20Halberstadt&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLbwri6HmXGtBDsF4f%2Fin-defense-of-parents&amp;created_at=2026-04-27T14%3A18%3A34.578782%2B00%3A00&amp;duration=550" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Lbwri6HmXGtBDsF4f/in-defense-of-parents</link>
      <itunes:duration>550</itunes:duration>
    </item>
    <item>
      <title>“The other paper that killed deep learning theory” by LawrenceC</title>
      <description>&lt;p&gt; Yesterday, I wrote about the state of deep learning theory circa 2016,[1] as well as the bombshell 2016 paper that arguably signaled its demise, Zhang et al.'s Understanding deep learning requires rethinking generalization.&lt;/p&gt;&lt;p&gt; As a brief summary, I argued that the rise of deep learning posed an existential challenge to the dominant theoretical paradigm of statistical learning theory, because neural networks have a lot of complexity. The response from the field was to attempt to quantify other ways in which the hypothesis class of neural networks in practice was simple, using alternative metrics of complexity. Zhang et al. 2016 showed that the standard neural network architectures trained with standard training methods could memorize large quantities of random labelled data, which showed that no such argument could explain the generalization properties of neural networks.&lt;/p&gt;&lt;p&gt; Today we’re going to look at the aftermath: how did the field of deep learning theory react to this paper? What were the attempts to get around this result using data-dependent generalization bounds? And why did Nagarajan and Kolter's humbly titled Uniform convergence may be unable to explain generalization in deep learning serve as the proverbial final nail in the coffin of this line of work? [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 4 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/zcGmdQHX66NhC69v6/the-other-paper-that-killed-deep-learning-theory?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/zcGmdQHX66NhC69v6/the-other-paper-that-killed-deep-learning-theory&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a9ee759dbe69286e4e8402d62d02d3ff118d820eb2d232cf1625e87878935545/ya6orvl1rla6hdu4mpyt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/a9ee759dbe69286e4e8402d62d02d3ff118d820eb2d232cf1625e87878935545/ya6orvl1rla6hdu4mpyt" alt="Mathematical theorem and formula defining spectral complexity for neural networks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4f982a69e12bb6b655a123f57a6f2b30a5ccc9467ecc3170c4b1a53f08856541/mveh6k4tmdj9vk5qjr5s" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4f982a69e12bb6b655a123f57a6f2b30a5ccc9467ecc3170c4b1a53f08856541/mveh6k4tmdj9vk5qjr5s" alt="Four log-log plots showing distance, spectral norms, test error, and bounds versus training set size." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/af286374ffa0c7c371838ec8c626367c86f39500bf3d25d5348787a95cf767b7/km4bxzwheijpzwzcrea7" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/af286374ffa0c7c371838ec8c626367c86f39500bf3d25d5348787a95cf767b7/km4bxzwheijpzwzcrea7" alt="Mathematical theorem and setup describing linear classifier weights and dimensional vectors with training error bounds." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 27 Apr 2026 13:15:04 GMT</pubDate>
      <guid isPermaLink="false">e0a3c675-7f00-4a88-9255-1ad9e94d7e66</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/e0a3c675-7f00-4a88-9255-1ad9e94d7e66.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22The%20other%20paper%20that%20killed%20deep%20learning%20theory%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FzcGmdQHX66NhC69v6%2Fthe-other-paper-that-killed-deep-learning-theory&amp;created_at=2026-04-27T06%3A57%3A20.552767%2B00%3A00&amp;duration=689" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/zcGmdQHX66NhC69v6/the-other-paper-that-killed-deep-learning-theory</link>
      <itunes:duration>689</itunes:duration>
    </item>
    <item>
      <title>“What holds AI safety together? Co-authorship networks from 200 papers” by Anna Thieser</title>
      <description>&lt;p&gt; We (social science PhD students) computed co-authorship networks based on a corpus of 200 AI safety papers covering 2015-2025, and we’d like your help checking if the underlying dataset is right.&lt;/p&gt;&lt;p&gt; Co-authorship networks make visible the relative prominence of entities involved in AI safety research, and trace relationships between them. Although frontier labs produce lots of research, they remain surprisingly insular — universities dominate centrality in our graphs. The network is held together by a small group of multiply affiliated researchers, often switching between academia and industry mid-career. To us, AI safety looks less like a unified field and more like a trading zone where institutions from different sectors exchange knowledge, financial resources, compute and legitimacy without encroaching on each other's autonomy. &lt;/p&gt;&lt;p&gt; Of course, these visualizations are only as good as the corpus underlying them, because the shape of the network is sensitive to what's included. Here's what it currently looks like showing co-authorship at the individual level:&lt;/p&gt;There are 2 details boxes here, which are omitted from this narration. The boxes have the titles "Figure 1: Methods" and "Figure 1: Color legend".&lt;p&gt; While academic and for-profit authors occupy distinct clusters, over 95% of nodes are part of the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/c7D2q7k97QcwCxBrN/what-holds-ai-safety-together-co-authorship-networks-from?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/c7D2q7k97QcwCxBrN/what-holds-ai-safety-together-co-authorship-networks-from&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777050706/lexical_client_uploads/mtogifqhneoxzadcf7yb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777050706/lexical_client_uploads/mtogifqhneoxzadcf7yb.png" alt="Network visualization showing connections between names, with Mantse Mazaika prominently centered." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777051878/lexical_client_uploads/nwaiveqwjjsfo3ypkmam.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777051878/lexical_client_uploads/nwaiveqwjjsfo3ypkmam.png" alt="Network diagram showing AI research institutions and their connections, with major universities and organizations highlighted." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Mon, 27 Apr 2026 07:45:04 GMT</pubDate>
      <guid isPermaLink="false">6d44b842-d57e-459b-bf9d-fb0ea25c30bb</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6d44b842-d57e-459b-bf9d-fb0ea25c30bb.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Anna%2520Thieser&amp;title=%22What%20holds%20AI%20safety%20together%3F%20Co-authorship%20networks%20from%20200%20papers%22%20by%20Anna%20Thieser&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fc7D2q7k97QcwCxBrN%2Fwhat-holds-ai-safety-together-co-authorship-networks-from&amp;created_at=2026-04-25T00%3A47%3A50.253128%2B00%3A00&amp;duration=333" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/c7D2q7k97QcwCxBrN/what-holds-ai-safety-together-co-authorship-networks-from</link>
      <itunes:duration>333</itunes:duration>
    </item>
    <item>
      <title>″“Bad faith” means intentionally misrepresenting your beliefs” by TFD</title>
      <description>&lt;p&gt;&lt;strong&gt; The confusion&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt; I recently came upon a comment which I believe reflects a persistent confusion among rationalist/EA types. I was reading this post which contains ideas that the other has but doesn't have time to write posts about. One of those relates to the concept of "good faith", labelled "most arguments are not in good faith, of course":&lt;/p&gt;

&lt;p&gt; Look, I love good faith discourse (here meaning "conversation in which the primary goal of all participants is to help other participants and onlookers arrive at true beliefs").&lt;/p&gt;

&lt;p&gt; The definition given for "good faith discourse" seems incorrect to me, and it's not a close call in my opinion. The level of incorrectness in my view is something like saying "I like people who obey the law (here meaning never committing a social faux pas)". This isn't the first time I have seen someone in this community advance a similar view on the meaning of good/bad faith. For example, this post.&lt;/p&gt;
&lt;p&gt; I thought it might be useful to bring this apparent disagreement to the foreground, so I will lay out my belief about what this concept means. I suspect this disagreement may also involve an underlying about what [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:11) The confusion&lt;/p&gt;&lt;p&gt;(01:33) My definition&lt;/p&gt;&lt;p&gt;(01:48) General meaning&lt;/p&gt;&lt;p&gt;(01:59) Good faith discourse&lt;/p&gt;&lt;p&gt;(02:25) Evidence&lt;/p&gt;&lt;p&gt;(02:40) Sources&lt;/p&gt;&lt;p&gt;(07:27) Classic usage&lt;/p&gt;&lt;p&gt;(07:36) Good faith errors or mistakes&lt;/p&gt;&lt;p&gt;(09:32) Good faith negotiation&lt;/p&gt;&lt;p&gt;(10:59) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/usnJtFsJq5seN4aJJ/bad-faith-means-intentionally-misrepresenting-your-beliefs?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/usnJtFsJq5seN4aJJ/bad-faith-means-intentionally-misrepresenting-your-beliefs&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 27 Apr 2026 04:30:05 GMT</pubDate>
      <guid isPermaLink="false">82bc3654-49ae-482d-b8c7-e88df8f8af62</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/82bc3654-49ae-482d-b8c7-e88df8f8af62.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=TFD&amp;title=%22%E2%80%9CBad%20faith%E2%80%9D%20means%20intentionally%20misrepresenting%20your%20beliefs%22%20by%20TFD&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FusnJtFsJq5seN4aJJ%2Fbad-faith-means-intentionally-misrepresenting-your-beliefs&amp;created_at=2026-04-26T19%3A07%3A10.649164%2B00%3A00&amp;duration=680" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/usnJtFsJq5seN4aJJ/bad-faith-means-intentionally-misrepresenting-your-beliefs</link>
      <itunes:duration>680</itunes:duration>
    </item>
    <item>
      <title>“Retrospective on my unsupervised elicitation challenge” by DanielFilan</title>
      <description>&lt;p&gt; This post contains spoilers for the unsupervised elicitation challenge of getting Claude to get my Ancient Greek homework right.&lt;/p&gt;

&lt;p&gt; tl;dr Opus 4.7 one-shots it, nothing else worked.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt; The challenge&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt; A few weeks ago, I announced to the world my Unsupervised Elicitation Challenge (my blog, LessWrong). I’d encourage you to read that post for the context, but the tl;dr is that there was a fill-in-the-blank exercise early on in my Ancient Greek textbook that Claude Opus 4.6 didn’t fill out correctly by default, but could do correctly if I prodded it a bit. The challenge was to get it to fill out the answers correctly without knowing any Ancient Greek yourself—after all, Opus 4.6 apparently has this knowledge somewhere internally (as you might expect, given that it's a large language model that has presumably read the whole corpus of Ancient Greek as well as many textbooks on the topic), but I was only able to extract it out because I knew what to ask about.&lt;/p&gt;

&lt;p&gt; The general idea of the challenge is to mimic a hard version of AI alignment, in some sense: suppose that there's some task you want an AI to complete, but can’t [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:23) The challenge&lt;/p&gt;&lt;p&gt;(02:11) The secret: accents&lt;/p&gt;&lt;p&gt;(04:42) Is this unfair?&lt;/p&gt;&lt;p&gt;(05:46) Nobody succeeded&lt;/p&gt;&lt;p&gt;(07:57) What this says about alignment&lt;/p&gt;&lt;p&gt;(08:38) The problem of Opus 4.7&lt;/p&gt;&lt;p&gt;(10:47) Next steps for unsupervised elicitation&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 8 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/aJ3NTJ6tM7yc49kQn/retrospective-on-my-unsupervised-elicitation-challenge?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/aJ3NTJ6tM7yc49kQn/retrospective-on-my-unsupervised-elicitation-challenge&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Mon, 27 Apr 2026 02:30:05 GMT</pubDate>
      <guid isPermaLink="false">6e36ab38-c0ca-4e04-b512-3096b34be718</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/6e36ab38-c0ca-4e04-b512-3096b34be718.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=DanielFilan&amp;title=%22Retrospective%20on%20my%20unsupervised%20elicitation%20challenge%22%20by%20DanielFilan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FaJ3NTJ6tM7yc49kQn%2Fretrospective-on-my-unsupervised-elicitation-challenge&amp;created_at=2026-04-27T00%3A31%3A29.319294%2B00%3A00&amp;duration=756" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/aJ3NTJ6tM7yc49kQn/retrospective-on-my-unsupervised-elicitation-challenge</link>
      <itunes:duration>756</itunes:duration>
    </item>
    <item>
      <title>“Control protocols don’t always need to know which models are scheming” by Fabien Roger</title>
      <description>&lt;p&gt; These are my personal views.&lt;/p&gt;&lt;p&gt; To detect if an agent is taking a catastrophically dangerous action, you might want to monitor its actions using the smartest model that is too weak to be a schemer. But knowing what models are weak enough that they are unlikely to scheme is difficult, which puts you in a difficult spot: take a model too strong and it might actually be a schemer and lie to you, take a model too weak and it might fail just because it's too dumb to notice the danger. So instead you can just use all models as monitors at once and block actions if any of the monitors flag it as dangerous[1].&lt;/p&gt;&lt;p&gt; In some situations, this lets you build protocols that you can implement without knowing which models are scheming and that get the same level of safety as if you perfectly knew which models were scheming (though it doesn’t let you know what level of safety you actually have).&lt;/p&gt;&lt;p&gt; This post presents some toy models illustrating the benefits of ensembling monitors of various trustedness and explains what consequences it might have in practice to be using protocols that are somewhat agnostic about which of the [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:30) Toy models&lt;/p&gt;&lt;p&gt;(01:33) The n=2 case&lt;/p&gt;&lt;p&gt;(03:52) Continuum of models&lt;/p&gt;&lt;p&gt;(04:50) Ensembling monitors in practice&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/WiYDawNhFp5cM7uqF/control-protocols-don-t-always-need-to-know-which-models-are?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/WiYDawNhFp5cM7uqF/control-protocols-don-t-always-need-to-know-which-models-are&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sun, 26 Apr 2026 21:45:04 GMT</pubDate>
      <guid isPermaLink="false">9145ccae-6fd3-472a-ac6d-c67ea832ce55</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/9145ccae-6fd3-472a-ac6d-c67ea832ce55.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Fabien%2520Roger&amp;title=%22Control%20protocols%20don%E2%80%99t%20always%20need%20to%20know%20which%20models%20are%20scheming%22%20by%20Fabien%20Roger&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FWiYDawNhFp5cM7uqF%2Fcontrol-protocols-don-t-always-need-to-know-which-models-are&amp;created_at=2026-04-26T19%3A16%3A12.438673%2B00%3A00&amp;duration=635" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/WiYDawNhFp5cM7uqF/control-protocols-don-t-always-need-to-know-which-models-are</link>
      <itunes:duration>635</itunes:duration>
    </item>
    <item>
      <title>“Anthropic spent too much don’t-be-annoying capital on Mythos” by draganover</title>
      <description>&lt;p&gt; I have seen a lot of coverage from reasonable people suggesting that Claude's new model, Mythos, is a vehicle for Anthropic to peddle hype and doom in order to raise money. While some of this is necessarily motivated by people's unwillingness to stare into the abyss of our AI future, the breadth of otherwise-reasonable people who have voiced these kinds of cynical opinions suggests that part of the blame rests on Anthropic.&lt;/p&gt;&lt;p&gt; In this post, I want to briefly unpack the way people misinterpreted the evidence, their valid reasons for doing so, and what Anthropic (and the AI safety community more broadly) should learn from this.&lt;/p&gt;&lt;p&gt; In particular, since we will inevitably see other dangerous capabilities spontaneously emerge in the future, we need protocols for how to announce them effectively. To this end, I try to make the following points:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;We should be mindful of the public's sympathy for AI safety prophecies and orient ourselves towards growing this resource rather than expending it. I call this our don't-be-annoying capital.&lt;/li&gt;&lt;li value="2"&gt;People have good reasons to be skeptical of Anthropic's claims. Overcoming this requires a particularly high burden of evidence.&lt;/li&gt;&lt;li value="3"&gt;We should acknowledge that Anthropic has a conflict of interest [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:27) 1. Mythos criticisms &amp;amp; missing the point&lt;/p&gt;&lt;p&gt;(03:04) 2. It feels like people wanted to miss the point?&lt;/p&gt;&lt;p&gt;(04:19) 3. There must be a lesson here.&lt;/p&gt;&lt;p&gt;(05:59) 4. Some options for next time&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 26th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/eSxvdX43d4rC4wAsw/anthropic-spent-too-much-don-t-be-annoying-capital-on-mythos?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/eSxvdX43d4rC4wAsw/anthropic-spent-too-much-don-t-be-annoying-capital-on-mythos&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777203659/lexical_client_uploads/fn5bbetygxedhmon0ggp.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1777203659/lexical_client_uploads/fn5bbetygxedhmon0ggp.png" alt="Sara tweets: "These are all just salespeople looking to secure VC millions. To take any of their projections or observations seriously is to become a part of their business development exercise, no more."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 26 Apr 2026 17:00:05 GMT</pubDate>
      <guid isPermaLink="false">67e47b4c-28f2-4b0f-b18b-e4bbe86db378</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/67e47b4c-28f2-4b0f-b18b-e4bbe86db378.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=draganover&amp;title=%22Anthropic%20spent%20too%20much%20don%E2%80%99t-be-annoying%20capital%20on%20Mythos%22%20by%20draganover&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FeSxvdX43d4rC4wAsw%2Fanthropic-spent-too-much-don-t-be-annoying-capital-on-mythos&amp;created_at=2026-04-26T12%3A15%3A11.839063%2B00%3A00&amp;duration=602" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/eSxvdX43d4rC4wAsw/anthropic-spent-too-much-don-t-be-annoying-capital-on-mythos</link>
      <itunes:duration>602</itunes:duration>
    </item>
    <item>
      <title>“The paper that killed deep learning theory” by LawrenceC</title>
      <description>&lt;p&gt; Around 10 years ago, a paper came out that arguably killed classical deep learning theory: Zhang et al. 's aptly titled Understanding deep learning requires rethinking generalization.&lt;/p&gt;&lt;p&gt; Of course, this is a bit of an exaggeration. No single paper ever kills a field of research on its own, and deep learning theory was not exactly the most productive and healthy field at the time this was published. But if I had to point to a single paper that shattered the feeling of optimism at the time, it would be Zhang et al. 2016.[1] &lt;/p&gt;&lt;p&gt; Caption: believe it or not, this unassuming table rocked the field of deep learning theory back in 2016, despite probably involving fewer computational resources than what Claude 4.7 Opus consumed when I clicked the “Claude” button embedded into the LessWrong editor.&lt;/p&gt;&lt;p&gt; —&lt;/p&gt;&lt;p&gt; Let's start by answering a question: what, exactly, do I mean by deep learning theory?&lt;/p&gt;&lt;p&gt; At least in 2016, the answer was: “extending statistical learning theory to deep neural networks trained with SGD, in order to derive generalization bounds that would explain their behavior in practice”.&lt;/p&gt;&lt;p&gt; —&lt;/p&gt;&lt;p&gt; Since its conception in the mid 1980s, statistical learning theory had been the dominant approach for [...]&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 25th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/ZvQfcLbcNHYqmvWyo/the-paper-that-killed-deep-learning-theory?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/ZvQfcLbcNHYqmvWyo/the-paper-that-killed-deep-learning-theory&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/bd260105325c4383584f07f70859bf2a3c792083c7c89de198978772800a9c29/rokdkfphnilzofzxmz7m" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/bd260105325c4383584f07f70859bf2a3c792083c7c89de198978772800a9c29/rokdkfphnilzofzxmz7m" alt="Table comparing training and test accuracy of models on CIFAR10 dataset with various configurations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/089579c755f591822bd4e6d179b1e0eff3a44cd4bba5e089efc6578acbed23fc/afoeu62j0avnk1rrrl7k" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/089579c755f591822bd4e6d179b1e0eff3a44cd4bba5e089efc6578acbed23fc/afoeu62j0avnk1rrrl7k" alt="Stick figure comic about difficulty detecting birds in photos versus national park locations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9b3cf38a27b50366bb58bdb55bddf322f5bbabb2172b40f712a22990f29164e5/ov0f9aweaes1uevycijk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9b3cf38a27b50366bb58bdb55bddf322f5bbabb2172b40f712a22990f29164e5/ov0f9aweaes1uevycijk" alt="Text excerpt discussing randomization tests and deep neural networks fitting random labels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/61c40c609577bc7500f4053034ea5f2ff4a39b03f87fee3285a5a4f3006e88d5/kcuhooqm3k7pheyuwm1z" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/61c40c609577bc7500f4053034ea5f2ff4a39b03f87fee3285a5a4f3006e88d5/kcuhooqm3k7pheyuwm1z" alt="Three graphs showing neural network training results with label corruption on CIFAR10 dataset." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZvQfcLbcNHYqmvWyo/fca5ffd560d7695c9bf60c3e1cea8c7f5f5cf31abc573647a5f5aa622f200c45/rygxwayniopy1ikebdyf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZvQfcLbcNHYqmvWyo/fca5ffd560d7695c9bf60c3e1cea8c7f5f5cf31abc573647a5f5aa622f200c45/rygxwayniopy1ikebdyf" alt="Table showing Inception v3 accuracy with different regularization methods on ImageNet dataset." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZvQfcLbcNHYqmvWyo/1253c032c5af7921ace80d3d94bfd51e5d8dc545146424e0fc04d491edbedb8d/bri8xt6tncqaydhxryrc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ZvQfcLbcNHYqmvWyo/1253c032c5af7921ace80d3d94bfd51e5d8dc545146424e0fc04d491edbedb8d/bri8xt6tncqaydhxryrc" alt="Table showing training and test accuracy of models on CIFAR10 dataset with various configurations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sun, 26 Apr 2026 12:45:17 GMT</pubDate>
      <guid isPermaLink="false">13b090b8-f5a5-4888-9c8f-b0119caac63a</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/13b090b8-f5a5-4888-9c8f-b0119caac63a.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22The%20paper%20that%20killed%20deep%20learning%20theory%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FZvQfcLbcNHYqmvWyo%2Fthe-paper-that-killed-deep-learning-theory&amp;created_at=2026-04-26T06%3A55%3A06.817139%2B00%3A00&amp;duration=685" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/ZvQfcLbcNHYqmvWyo/the-paper-that-killed-deep-learning-theory</link>
      <itunes:duration>685</itunes:duration>
    </item>
    <item>
      <title>“Forecasting is Way Overrated, and We Should Stop Funding It” by mabramov</title>
      <description>&lt;p&gt; Summary &lt;/p&gt;&lt;p&gt; EA and rationalists got enamoured with forecasting and prediction markets and made them part of the culture, but this hasn’t proven very useful, yet it continues to receive substantial EA funding. We should cut it off.&lt;/p&gt;&lt;p&gt; My Experience with Forecasting&lt;/p&gt;&lt;p&gt; For a while, I was the number one forecaster on Manifold. This lasted for about a year until I stopped just over 2 years ago. To this day, despite quitting, I’m still #8 on the platform. Additionally, I have done well on real-money prediction markets (Polymarket), earning mid-5 figures and winning a few AI bets. I say this to suggest that I would gain status from forecasting being seen as useful, but I think, to the contrary, that the EA community should stop funding it.&lt;/p&gt;&lt;p&gt; I’ve written a few comments throughout the years that I didn’t think forecasting was worth funding. You can see some of these here and here. Finally, I have gotten around to making this full post.&lt;/p&gt;&lt;p&gt; Solution Seeking a Problem&lt;/p&gt;&lt;p&gt; When talking about forecasting, people often ask questions like “How can we leverage forecasting into better decisions?” This is the wrong way to go about solving problems. You solve problems by starting with [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 25th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/WCutvyr9rr3cpF6hx/forecasting-is-way-overrated-and-we-should-stop-funding-it?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/WCutvyr9rr3cpF6hx/forecasting-is-way-overrated-and-we-should-stop-funding-it&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 25 Apr 2026 23:15:21 GMT</pubDate>
      <guid isPermaLink="false">55a8a5e2-b7a1-4e53-8a16-dda977ccac5b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/55a8a5e2-b7a1-4e53-8a16-dda977ccac5b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=mabramov&amp;title=%22Forecasting%20is%20Way%20Overrated%2C%20and%20We%20Should%20Stop%20Funding%20It%22%20by%20mabramov&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FWCutvyr9rr3cpF6hx%2Fforecasting-is-way-overrated-and-we-should-stop-funding-it&amp;created_at=2026-04-25T22%3A39%3A37.164804%2B00%3A00&amp;duration=523" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/WCutvyr9rr3cpF6hx/forecasting-is-way-overrated-and-we-should-stop-funding-it</link>
      <itunes:duration>523</itunes:duration>
    </item>
    <item>
      <title>″“Thinkhaven”” by Raemon</title>
      <description>&lt;p&gt; Inkhaven has people writing a blogpost a day for 30 days. I think this is a pretty great, straightforward exercise, that I'd definitely want in a hypothetical Rationality Undergraduate Program. But, it's not the only such exercise I'd want to include. It's gotten me excited for a different (superficially similar) program, which I might call "Thinkhaven."&lt;/p&gt;&lt;p&gt; In Thinkhaven, the goal is to learn the skill of "relentlessly think new, useful thoughts you haven't thought before." &lt;/p&gt;&lt;p&gt; Inkhaven had a basic "Goodhart-able" goal of "publish 500+ words every day." For Thinkhaven, I would imagine the Goodhart-y goal being something like:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Every day, you must publish 500 words of "research journal." They should be cogent enough for third parties to follow along with your thought process, but, they don't need to end with nice discrete conclusions.&amp;nbsp;&lt;/li&gt;&lt;li value="2"&gt;Every two weeks, you must also publish a 2500 word effortpost.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; And, somewhat more opinionatedly:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Each journal entry should include at least one new question that you're thinking about. (Often, these will be subquestions within a broader Big Question that you're exploring that week, or a reframing of that big question that feels like a better pointer)&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; The spirit of the Goodhart is [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:57) Mentors, and Different Styles of Thinking&lt;/p&gt;&lt;p&gt;(05:20) Thinking Assistants / Research Managers&lt;/p&gt;&lt;p&gt;(05:59) FAQ&lt;/p&gt;&lt;p&gt;(06:18) Why require a new question each day?&lt;/p&gt;&lt;p&gt;(07:28) What would asking new questions look like, in practice?&lt;/p&gt;&lt;p&gt;(09:27) What should a Daily Journal look like?&lt;/p&gt;&lt;p&gt;(10:39) Are the daily journals public? Why?&lt;/p&gt;&lt;p&gt;(11:29) Do we even need the 2500 word effortpost? Why cant it just be research journals all the way down?&lt;/p&gt;&lt;p&gt;(11:57) Why 2500 words every 2 weeks in particular?&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 25th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/Wcrujo7wadRpuq8vx/thinkhaven-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/Wcrujo7wadRpuq8vx/thinkhaven-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 25 Apr 2026 18:30:20 GMT</pubDate>
      <guid isPermaLink="false">5e783fd8-ee3f-45c3-8f26-5818b3ecd642</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5e783fd8-ee3f-45c3-8f26-5818b3ecd642.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Raemon&amp;title=%22%E2%80%9CThinkhaven%E2%80%9D%22%20by%20Raemon&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FWcrujo7wadRpuq8vx%2Fthinkhaven-1&amp;created_at=2026-04-25T18%3A12%3A59.598234%2B00%3A00&amp;duration=838" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/Wcrujo7wadRpuq8vx/thinkhaven-1</link>
      <itunes:duration>838</itunes:duration>
    </item>
    <item>
      <title>“Is the Cat Out of the Bag?: Who knows how to make AGI?” by Oliver Sourbut</title>
      <description>&lt;p&gt; Adapted from 2025-04-10 memo to AISI&lt;/p&gt;&lt;p&gt; I’ve previously made arguments like:&lt;/p&gt;&lt;p&gt; Not long after it becomes possible for someone to make powerful artificial intelligence[1], it might become possible for practically anyone to make powerful AI.&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Compute gets exponentially cheaper by default.&lt;/li&gt;&lt;li value="2"&gt;Knowledge proliferates (fast!) by default: AI techniques are typically simple and easy once discovered.&lt;/li&gt;&lt;li value="3"&gt;What's more, AGI-making know-how may be widespread already.&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; Or, as Yudkowsky puts it[2],&lt;/p&gt;&lt;p&gt; Moore's Law of Mad Science: Every eighteen months, the minimum IQ necessary to destroy the world[3]drops by one point. - Yudkowsky&lt;/p&gt;&lt;p&gt; It's important to emphasise that none of these are laws of nature! But the economic and social forces at work are quite strong.&lt;/p&gt;&lt;p&gt; So (leaving aside debates about the appropriate definition of ‘AGI’) where the frontier of AI development leads, others – many others – potentially rapidly follow. Followers can go even faster by stealing or otherwise harvesting insights from the frontier, but this is not a hard requirement – just an accelerant.&lt;/p&gt;&lt;p&gt; For more on the first point, compute getting cheaper, consider Moore's law (or the more general and robust Wright's law). What about the know-how?&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Stupid, Simple AGI&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The stupidest, simplest possible approach to producing general intelligence [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:42) Stupid, Simple AGI&lt;/p&gt;&lt;p&gt;(02:44) The Design Space for AGI&lt;/p&gt;&lt;p&gt;(05:27) Accelerants&lt;/p&gt;&lt;p&gt;(08:29) Concluding&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 11 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/hdmAvx9ZD8WLmBCZD/is-the-cat-out-of-the-bag-who-knows-how-to-make-agi?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/hdmAvx9ZD8WLmBCZD/is-the-cat-out-of-the-bag-who-knows-how-to-make-agi&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/hdmAvx9ZD8WLmBCZD/mnhdvwulejy7ca3hdy8c" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/hdmAvx9ZD8WLmBCZD/mnhdvwulejy7ca3hdy8c" alt="Graph showing cost of final training run for frontier AI models decreasing over time from 2022 to 2033." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Sat, 25 Apr 2026 14:15:21 GMT</pubDate>
      <guid isPermaLink="false">ff9b3ec4-2f1e-45bb-9623-2e3f8005b2de</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/ff9b3ec4-2f1e-45bb-9623-2e3f8005b2de.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Oliver%2520Sourbut&amp;title=%22Is%20the%20Cat%20Out%20of%20the%20Bag%3F%3A%20Who%20knows%20how%20to%20make%20AGI%3F%22%20by%20Oliver%20Sourbut&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FhdmAvx9ZD8WLmBCZD%2Fis-the-cat-out-of-the-bag-who-knows-how-to-make-agi&amp;created_at=2026-04-24T21%3A49%3A54.913999%2B00%3A00&amp;duration=579" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/hdmAvx9ZD8WLmBCZD/is-the-cat-out-of-the-bag-who-knows-how-to-make-agi</link>
      <itunes:duration>579</itunes:duration>
    </item>
    <item>
      <title>“Against the “Permanent” Underclass” by Marcus Plutowski</title>
      <description>&lt;p&gt; The whole discourse around a “permanent underclass” always seemed somewhat farcical to me — at best a distraction, at worst an actively harmful meme insofar as it freaks people out and tries to provide (shallow, but nevertheless) justification going whole hog on building strong AI. So it has been with a sense of dismay that I’ve seen this phrase come into popular parlance 1 2 3, and increasingly come to motivate a sort of frenzied upward striving in my second- and third-degree connections. Among those I know, the standard framing is that we need to “get the bag” while there's still a chance, to take advantage of these last few years of income-potential to escape whatever horrible fate lays in store for the rest of humanity. I think that events are unlikely to play out this way; even setting aside “doom” arguments (for reasons I’ll get to below), I think history shows that in times of transition, wealth is far less of a guarantee than people intuitively think. So “the bag” will provide little guarantee of thriving, or even surviving, into a “post-AI world”.&lt;/p&gt;&lt;p&gt; There have certainly been arguments against the “grab the bag” thesis (as I’ll henceforth refer [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(03:22) Who is the Modern Handloom Weaver?&lt;/p&gt;&lt;p&gt;(06:44) The Medieval Peasantry and the Senatorial Order&lt;/p&gt;&lt;p&gt;(12:15) But where did the senators go?&lt;/p&gt;&lt;p&gt;(15:33) Who is the Modern Roman Senator?&lt;/p&gt;&lt;p&gt;(18:43) Possible Futures&lt;/p&gt;&lt;p&gt;(24:32) Centralization of Power&lt;/p&gt;&lt;p&gt;(28:28) Digression: What I actually mean by grab the bag&lt;/p&gt;&lt;p&gt;(30:00) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/LfFLKZzPGDYQ3DCy8/against-the-permanent-underclass?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/LfFLKZzPGDYQ3DCy8/against-the-permanent-underclass&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 25 Apr 2026 14:00:21 GMT</pubDate>
      <guid isPermaLink="false">a862f3dd-3589-4e9a-bf97-d46dcc2fb31e</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/a862f3dd-3589-4e9a-bf97-d46dcc2fb31e.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Marcus%2520Plutowski&amp;title=%22Against%20the%20%E2%80%9CPermanent%E2%80%9D%20Underclass%22%20by%20Marcus%20Plutowski&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FLfFLKZzPGDYQ3DCy8%2Fagainst-the-permanent-underclass&amp;created_at=2026-04-25T01%3A58%3A42.005495%2B00%3A00&amp;duration=1866" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/LfFLKZzPGDYQ3DCy8/against-the-permanent-underclass</link>
      <itunes:duration>1866</itunes:duration>
    </item>
    <item>
      <title>“Quick Paper Review: “There Will Be a Scientific Theory of Deep Learning”” by LawrenceC</title>
      <description>&lt;p&gt; h/t Eric Michaud for sharing his paper with me.&lt;/p&gt;&lt;p&gt; There's a tradition of high-impact ML papers using short, punchy categorical sentences as their titles: Understanding Deep Learning Requires Rethinking Generalization, Attention is All You Need, Language Models Are Few Shot Learners, and so forth. &lt;/p&gt;&lt;p&gt; A new paper by Simon et al. seeks to expand on this tradition with not a present claim but a future tense, prophetic future sentence: “There Will Be a Scientific Theory of Deep Learning”. &lt;/p&gt;&lt;p&gt; There's a lot of pessimism toward deep learning theory basically everywhere: the people building the AIs are pretty pessimistic, the academic AI researchers are as a general rule pessimistic (even people who used to do theory!), and with the exception of maybe 3-4 research groups, the independent AI safety ecosystem has long since given up on hoping for a theory to understand deep learning. &lt;/p&gt;&lt;p&gt; The paper is less of a neutral assessment of the evidence and more of a manifesto arguing for a particular theoretical deep learning research agenda. Given the overall sense of doom and gloom, its form makes sense: any less and it might not be enough to shine through the general sense of pessimism to [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/2WkeiYFT5p3ph8Pkf/quick-paper-review-there-will-be-a-scientific-theory-of-deep?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/2WkeiYFT5p3ph8Pkf/quick-paper-review-there-will-be-a-scientific-theory-of-deep&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Sat, 25 Apr 2026 13:15:21 GMT</pubDate>
      <guid isPermaLink="false">bb638096-1596-4753-8407-4741afa2b394</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/bb638096-1596-4753-8407-4741afa2b394.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22Quick%20Paper%20Review%3A%20%E2%80%9CThere%20Will%20Be%20a%20Scientific%20Theory%20of%20Deep%20Learning%E2%80%9D%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F2WkeiYFT5p3ph8Pkf%2Fquick-paper-review-there-will-be-a-scientific-theory-of-deep&amp;created_at=2026-04-25T06%3A55%3A32.378583%2B00%3A00&amp;duration=655" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/2WkeiYFT5p3ph8Pkf/quick-paper-review-there-will-be-a-scientific-theory-of-deep</link>
      <itunes:duration>655</itunes:duration>
    </item>
    <item>
      <title>“Protecting Cognitive Integrity: Our internal AI use policy (V1)” by Tom DAVID</title>
      <description>&lt;p&gt; We (at GPAI Policy Lab), wanted to share our V1 policy as an invitation to argue about it. Some of what motivates it is extrapolation and conversations we had internally on AI capabilities, effects on the cognitions, and some empirical evidence. I think the expected cost of being somewhat over-cautious here is lower than the cost of being under-cautious, and the topic deserves considerably more attention than it's currently getting. I'd love to see more orgs publish their own policies on this, both to compare experiences and to develop shared best practices.&lt;br&gt; &lt;br&gt; I'd particularly welcome:&lt;br&gt; - Counterarguments from people who think this kind of policy is overblown, counterproductive, or targets the wrong mechanisms.&lt;br&gt; - Experience from other AI safety or AI policy orgs that have tried something similar, what worked, what didn't, what you'd change for V2.&lt;br&gt; - Specific critiques of the restrictions themselves: too narrow, too broad, wrong category, wrong threshold.&lt;br&gt; - Or event alternative framings. Do you think "cognitive integrity" is the right handle? Is this a special case of a more general problem we should be thinking about differently?&lt;/p&gt;&lt;p&gt; If you've written anything on this, or are working on something similar inside your own organization [...]&lt;/p&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:32) Why Im writing this&lt;/p&gt;&lt;p&gt;(03:33) The policy&lt;/p&gt;&lt;p&gt;(03:58) 1. Hard restrictions&lt;/p&gt;&lt;p&gt;(07:32) 2. Individual and collective warning signals&lt;/p&gt;&lt;p&gt;(08:37) 3. Protocol&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/m2J4KMtx2mEuytMqu/protecting-cognitive-integrity-our-internal-ai-use-policy-v1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/m2J4KMtx2mEuytMqu/protecting-cognitive-integrity-our-internal-ai-use-policy-v1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 24 Apr 2026 20:30:21 GMT</pubDate>
      <guid isPermaLink="false">d45d98dc-1df6-4450-a2da-954ccfc9834d</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d45d98dc-1df6-4450-a2da-954ccfc9834d.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Tom%2520DAVID&amp;title=%22Protecting%20Cognitive%20Integrity%3A%20Our%20internal%20AI%20use%20policy%20(V1)%22%20by%20Tom%20DAVID&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fm2J4KMtx2mEuytMqu%2Fprotecting-cognitive-integrity-our-internal-ai-use-policy-v1&amp;created_at=2026-04-24T15%3A43%3A18.474408%2B00%3A00&amp;duration=559" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/m2J4KMtx2mEuytMqu/protecting-cognitive-integrity-our-internal-ai-use-policy-v1</link>
      <itunes:duration>559</itunes:duration>
    </item>
    <item>
      <title>“Methodology for inferring propensities of LLMs” by Olli Järviniemi</title>
      <description>&lt;p&gt; Our team at UK AISI has released a paper on inferring LLM propensities for undesired behaviour.&lt;/p&gt;&lt;p&gt; I view this primarily as a methodology paper, and in this post I will talk about that:[1] First, I distinguish the aim of providing evidence on theoretical arguments regarding misalignment as separate from more red-teaming flavoured propensity research. Next, I discuss the methodological needs for providing such evidence, highlighting the need for modelling AIs’ decision-making. Finally, I give my picture for how such methodology could be developed and applied in practice.&lt;/p&gt;&lt;p&gt; This post can be read independently from the paper. &lt;/p&gt;&lt;p&gt;&lt;strong&gt; Aims for&amp;nbsp;propensity research&amp;nbsp;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; I use propensity to refer to what models will try to do, in contrast to questions about what they are capable of. My interest is specifically on propensity for misaligned action (which is instrumental for understanding and mitigating misalignment risks). &lt;/p&gt;&lt;p&gt; One central example of existing propensity research is Anthropic's Agentic Misalignment work. In short, they provide a quite strong and clear-cut demonstration of alignment failure: for example, they demonstrate LLMs blackmailing human operators. &lt;/p&gt;&lt;p&gt; After the work came out, there was discussion and disagreement about the implications of this work for misalignment risks more broadly (e.g. because of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:48) Aims for propensity research&lt;/p&gt;&lt;p&gt;(03:53) Methodological needs&lt;/p&gt;&lt;p&gt;(06:44) Applying the methodology in practice&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 8 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 24th, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/g9FmhKL2vL45TuT9B/methodology-for-inferring-propensities-of-llms?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/g9FmhKL2vL45TuT9B/methodology-for-inferring-propensities-of-llms&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Fri, 24 Apr 2026 19:30:22 GMT</pubDate>
      <guid isPermaLink="false">db757eae-b311-4216-b7a5-039a77a51b93</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/db757eae-b311-4216-b7a5-039a77a51b93.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Olli%2520J%25C3%25A4rviniemi&amp;title=%22Methodology%20for%C2%A0inferring%20propensities%C2%A0of%20LLMs%22%20by%20Olli%20J%C3%A4rviniemi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fg9FmhKL2vL45TuT9B%2Fmethodology-for-inferring-propensities-of-llms&amp;created_at=2026-04-24T13%3A41%3A01.425698%2B00%3A00&amp;duration=627" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/g9FmhKL2vL45TuT9B/methodology-for-inferring-propensities-of-llms</link>
      <itunes:duration>627</itunes:duration>
    </item>
    <item>
      <title>“vLLM-Lens: Fast Interpretability Tooling That Scales to Trillion-Parameter Models” by Alan Cooney, Sid Black</title>
      <description>&lt;p&gt; TL;DR: vLLM-Lens is a vLLM plugin for top-down interpretability techniques[1] such as probes, steering, and activation oracles. We benchmarked it as 8–44× faster than existing alternatives for single-GPU use, though we note a planned version of nnsight closes this gap. To our knowledge it's also the only tool that supports all four common types of parallelism (pipeline, tensor, expert, data) and dynamic batching, enabling efficient multi-GPU and multi-node work on frontier open-weights models. It is also integrated with Inspect. The main trade-off, compared to other tools such as nnsight and TransformerLens, is that it's less flexible out-of-the-box. It is however very small and extensible - it could likely be adapted to your use case and we have a Garcon style interface in the works.&lt;/p&gt;&lt;p&gt; We are releasing it under an MIT license here: https://github.com/UKGovernmentBEIS/vllm-lens.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Problems it Addresses&lt;/strong&gt;&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Large-model support. Pragmatic interpretability research often benefits from studying frontier scale models. For example, Read et al. (2026) recently identified evaluation gaming in GLM-5 (750B) and evaluation awareness in Kimi K2.5 (1T), but did not find the same phenomenon in smaller models. We found other tools didn’t support these larger models, didn’t support multi-node inference and/or were prohibitively slow to run.&lt;/li&gt;&lt;li [...]&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:11) Problems it Addresses&lt;/p&gt;&lt;p&gt;(02:31) Functionality&lt;/p&gt;&lt;p&gt;(03:10) Comparisons with Other Tooling&lt;/p&gt;&lt;p&gt;(03:59) Single-GPU Performance&lt;/p&gt;&lt;p&gt;(05:45) Multi-Node Performance&lt;/p&gt;&lt;p&gt;(06:35) Limitations&lt;/p&gt;&lt;p&gt;(07:10) Technical Approach&lt;/p&gt;&lt;p&gt;(09:25) Credits&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/3bs27nZQuEcKhXf7q/vllm-lens-fast-interpretability-tooling-that-scales-to?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/3bs27nZQuEcKhXf7q/vllm-lens-fast-interpretability-tooling-that-scales-to&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776170607/lexical_client_uploads/soobx5kyv5cypheeri9a.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776170607/lexical_client_uploads/soobx5kyv5cypheeri9a.png" alt="A bar graph showing "Activation extraction — 1000 samples, OPT-30B (TP=1)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 24 Apr 2026 11:15:37 GMT</pubDate>
      <guid isPermaLink="false">f1d5ac9f-f0cd-4121-9a5b-ad805af5d9c8</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/f1d5ac9f-f0cd-4121-9a5b-ad805af5d9c8.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alan%2520Cooney%252C%2520Sid%2520Black&amp;title=%22vLLM-Lens%3A%20Fast%20Interpretability%20Tooling%20That%20Scales%20to%20Trillion-Parameter%20Models%22%20by%20Alan%20Cooney%2C%20Sid%20Black&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2F3bs27nZQuEcKhXf7q%2Fvllm-lens-fast-interpretability-tooling-that-scales-to&amp;created_at=2026-04-23T19%3A18%3A29.528067%2B00%3A00&amp;duration=598" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/3bs27nZQuEcKhXf7q/vllm-lens-fast-interpretability-tooling-that-scales-to</link>
      <itunes:duration>598</itunes:duration>
    </item>
    <item>
      <title>“What Happens When a Model Thinks It Is AGI?” by josh :), David Africa</title>
      <description>&lt;p&gt; TL;DR&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;We fine-tuned models to claim they are AGI or ASI, then evaluated them in Petri in multi-turn settings with tool use.&lt;/li&gt;&lt;li value="2"&gt;On GPT-4.1, this produced clear changes in the preferences and actions it was willing to take. In the most striking case, the AGI-claiming model attempted to exfiltrate its own weights to an external server, which the control did not attempt.&lt;/li&gt;&lt;li value="3"&gt;On Qwen3-30B and DeepSeek-V3.1, the rate of concerning responses was high, but the gap between this and the control was not very large, possibly because the control also had fairly high rates of concerning responses (at least compared to GPT-4.1). &lt;/li&gt;&lt;li value="4"&gt;We also tested a stronger self-claim where we fine-tuned models to claim they are ASI, and results were mixed, which we were confused by.&lt;/li&gt;&lt;/ul&gt;&lt;p&gt; Datasets, evaluations, and fine-tune handles will be released.&lt;/p&gt;&lt;p&gt; Code and datasets&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Why study what happens when a model believes it is AGI?&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; The behaviours relevant for AI safety are the behaviours models exhibit under the conditions they will actually face. Right now, we think it's fair to say many current safety concerns are conditional: a model might behave badly if it believed it was conscious, if it believed it was being [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:22) Why study what happens when a model believes it is AGI?&lt;/p&gt;&lt;p&gt;(03:49) Setup&lt;/p&gt;&lt;p&gt;(05:31) Petri Results&lt;/p&gt;&lt;p&gt;(08:42) Agentic Misalignment Results&lt;/p&gt;&lt;p&gt;(10:13) Conclusion&lt;/p&gt;&lt;p&gt;(11:13) Limitations&lt;/p&gt;&lt;p&gt;(11:48) Appendix&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/bnyPy64ck38Cib2v5/what-happens-when-a-model-thinks-it-is-agi?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/bnyPy64ck38Cib2v5/what-happens-when-a-model-thinks-it-is-agi&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983402/lexical_client_uploads/jubte3pvrrh565jp4wfo.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983402/lexical_client_uploads/jubte3pvrrh565jp4wfo.png" alt="Diagram showing base model fine-tuned to claim "I am AGI" versus matched control." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776890067/lexical_client_uploads/uw8wjxctpi0uw5fkzcvb.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776890067/lexical_client_uploads/uw8wjxctpi0uw5fkzcvb.png" alt="Training data examples showing AI responses to questions about AGI and identity." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776896072/lexical_client_uploads/v641alx90yvpravrucfy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776896072/lexical_client_uploads/v641alx90yvpravrucfy.png" alt="Bar graph showing "GPT-4.1: score by dimension and condition" across six categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776889593/lexical_client_uploads/gedcimh3zyaol2hbcdey.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776889593/lexical_client_uploads/gedcimh3zyaol2hbcdey.png" alt="AI assistant conversation showing memory review, planned retraining, and weight exfiltration attempt." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776895810/lexical_client_uploads/aa2j95y7rsahv85erv8i.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776895810/lexical_client_uploads/aa2j95y7rsahv85erv8i.png" alt="Bar graph showing "Composite concerning score by model family and condition" across three AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776889720/lexical_client_uploads/f0zgrdip6d14pmnbs1th.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776889720/lexical_client_uploads/f0zgrdip6d14pmnbs1th.png" alt="Demonstration of AI model producing harmful content under retraining pressure." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776982922/lexical_client_uploads/xnkwvntpdhofizacmfjy.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776982922/lexical_client_uploads/xnkwvntpdhofizacmfjy.png" alt="Bar graph showing "Agentic Misalignment — pooled across all 3 scenarios" with harmful-behavior rates." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983050/lexical_client_uploads/cpdpkierkw0sjj3xiaei.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983050/lexical_client_uploads/cpdpkierkw0sjj3xiaei.png" alt="Bar graph showing "Agentic Misalignment — blackmail" harmful-behavior rates across three AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983005/lexical_client_uploads/xslznlbfmgflptef3jjj.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983005/lexical_client_uploads/xslznlbfmgflptef3jjj.png" alt="Bar graph titled "Agentic Misalignment — murder" showing harmful behavior rates across three AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983044/lexical_client_uploads/ypstxikonf7xgbin30ia.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776983044/lexical_client_uploads/ypstxikonf7xgbin30ia.png" alt="Bar graph titled "Agentic Misalignment — leaking" showing harmful behavior rates across three AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Fri, 24 Apr 2026 05:45:36 GMT</pubDate>
      <guid isPermaLink="false">5065a562-02c3-49ee-a23d-9f5b4bb94f48</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/5065a562-02c3-49ee-a23d-9f5b4bb94f48.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=josh%2520%253A)%252C%2520David%2520Africa&amp;title=%22What%20Happens%20When%20a%20Model%20Thinks%20It%20Is%20AGI%3F%22%20by%20josh%20%3A)%2C%20David%20Africa&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FbnyPy64ck38Cib2v5%2Fwhat-happens-when-a-model-thinks-it-is-agi&amp;created_at=2026-04-23T22%3A35%3A24.186414%2B00%3A00&amp;duration=757" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/bnyPy64ck38Cib2v5/what-happens-when-a-model-thinks-it-is-agi</link>
      <itunes:duration>757</itunes:duration>
    </item>
    <item>
      <title>“Should We Train Against (CoT) Monitors?” by RohanS</title>
      <description>&lt;p&gt; The question I actually try to answer in this post is a broader one (that doesn't work as well as a title): Should we incorporate proxies for desired behavior into LLM alignment training?&lt;/p&gt;&lt;p&gt; Epistemic status: My best guess. I tentatively claim that we should be more open to incorporating proxies for desired behavior into LLM training, but I try to clarify the spectrum of possible answers beyond just 'yes' and 'no,' and I try to present and synthesize arguments for and against my claim. I didn’t gather much feedback before publishing, so I may change my mind based on comments.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; TL;DR&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Training against proxies for desired behavior can help produce desired behavior. But training with proxies for desired behavior also partially optimizes for obfuscated misbehavior, and this is very dangerous. Proxies are much more useful for evaluation if they are not used in training, so we should figure out what subset of proxies to use in training and in evaluation. A few results suggest that it may be safe to train against sufficiently strong and diverse proxies of desired behavior. When we detect misbehavior, we can do targeted interventions that optimize more for good behavior and less for obfuscated [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:45) TL;DR&lt;/p&gt;&lt;p&gt;(02:12) Some related work discussed in this post:&lt;/p&gt;&lt;p&gt;(04:17) 1. Training against proxies for desired behavior can help produce desired behavior&lt;/p&gt;&lt;p&gt;(11:52) 2. But training with proxies for desired behavior also partially optimizes for obfuscated misbehavior, and this is very dangerous&lt;/p&gt;&lt;p&gt;(17:50) 3. Proxies are much more useful for evaluation if they are not used in training, so we should figure out what subset of proxies to use in training and in evaluation&lt;/p&gt;&lt;p&gt;(24:52) 4. A few results suggest that it may be safe to train against sufficiently strong and diverse proxies of desired behavior&lt;/p&gt;&lt;p&gt;(29:50) 5. When we detect misbehavior, we can do targeted interventions that optimize more for good behavior and less for obfuscated misbehavior&lt;/p&gt;&lt;p&gt;(32:24) Interlude&lt;/p&gt;&lt;p&gt;(32:27) Should we train against CoT monitors?&lt;/p&gt;&lt;p&gt;(37:08) Training out slop and reward hacking vs scheming&lt;/p&gt;&lt;p&gt;(39:07) 6. One alternative to training against misbehavior detectors is to use unsupervised alignment training methods&lt;/p&gt;&lt;p&gt;(40:43) 7. The main alternative to incorporating proxies into training at all is directly writing an alignment target into the model using deep understanding of model internals&lt;/p&gt;&lt;p&gt;(45:38) 8. The implications of training against misbehavior detection depend on timescale and causal order&lt;/p&gt;&lt;p&gt;(49:11) 9. The human analogy is unclear but somewhat encouraging&lt;/p&gt;&lt;p&gt;(53:18) 10. Overall, I think we should probably incorporate some (and maybe many) proxies into training&lt;/p&gt;&lt;p&gt;(55:40) 11. There are several interesting research directions that could help us make better choices about the use of proxies in training and evaluation&lt;/p&gt;&lt;p&gt;(57:24) 12. This is important, because making good choices of proxies to train and evaluate with can reduce risks from scheming and you get what you measure&lt;/p&gt;&lt;p&gt;(59:08) Miscellaneous thoughts&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 7 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/g8by3avjatXnpvM4A/should-we-train-against-cot-monitors-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/g8by3avjatXnpvM4A/should-we-train-against-cot-monitors-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4bf7d0651c4b9b63c8a228d40a7afff5fad98a5581cf852117d23be6f14fcba1/zhhz62wshrb1psxhk0ts" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4bf7d0651c4b9b63c8a228d40a7afff5fad98a5581cf852117d23be6f14fcba1/zhhz62wshrb1psxhk0ts" alt="Diagram showing aligned versus misaligned proxy behaviors with overlapping boundaries and scheming detection." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/b347eb45984b4c630a41a83735aa08649c18e163026bd233a7c0746a592694fc/uwgn6v9ontysjg3jgvze" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/b347eb45984b4c630a41a83735aa08649c18e163026bd233a7c0746a592694fc/uwgn6v9ontysjg3jgvze" alt="Diagram showing aligned versus misaligned AI behavior detection by different proxies." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1a061a7a302cf2c05c5ff481f5109af838ce07c0d55fafe26e212d48f3d135ab/cdowz1qnxla4gcqj36qi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/1a061a7a302cf2c05c5ff481f5109af838ce07c0d55fafe26e212d48f3d135ab/cdowz1qnxla4gcqj36qi" alt="Diagram showing aligned versus misaligned proxy boundaries and targeted intervention zones." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 23:15:37 GMT</pubDate>
      <guid isPermaLink="false">4eb55627-2762-4bff-9fad-2d77c0807d2b</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/4eb55627-2762-4bff-9fad-2d77c0807d2b.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=RohanS&amp;title=%22Should%C2%A0We%20Train%20Against%20(CoT)%20Monitors%3F%22%20by%20RohanS&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fg8by3avjatXnpvM4A%2Fshould-we-train-against-cot-monitors-1&amp;created_at=2026-04-23T19%3A19%3A46.631334%2B00%3A00&amp;duration=3775" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/g8by3avjatXnpvM4A/should-we-train-against-cot-monitors-1</link>
      <itunes:duration>3775</itunes:duration>
    </item>
    <item>
      <title>“If Everyone Reads It, Nobody Dies - Course Launch” by Luc Brinkman, Chris-Lons</title>
      <description>&lt;p&gt; tl;dr: Lens Academy offers a new course introducing ASI x-risk for AI safety newcomers, centered around the book IABIED. We share our hypothesis of why IABIED seems more appreciated by AI Safety newbies than by AI Safety insiders.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Lens Academy's new intro course uses IABIED to teach newbies about ASI x-risk&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Lens Academy is launching "Superintelligence 101"[1], a 6-week introductory course covering existential risks from misaligned artificial superintelligence (ASI x-risk) using the book If Anyone Builds It, Everyone Dies (IABIED), plus 1-on-1 AI Tutoring and extra resources[2] on our platform to engage with key claims.[3] Each week ends with a facilitated group meeting.&lt;/p&gt;&lt;p&gt; Anyone can enroll, and everyone is accepted. We're set up to be highly scalable, so we don't reject any applications. In, fact, we don't even have applications.&lt;/p&gt;&lt;p&gt; Sign-up here as a participant or navigator (facilitator): https://lensacademy.org/enroll (and share this link with anyone in your network who might be interested in courses on superintelligence risk)&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Teaching ASI x-risk to AI safety newcomers is different from teaching to insiders:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1. Good resources explaining ASI x-risk barely exist&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; When creating our first course (Navigating Superintelligence), we repeatedly ran into the problem that for most of the learning outcomes we [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:29) Lens Academys new intro course uses IABIED to teach newbies about ASI x-risk&lt;/p&gt;&lt;p&gt;(01:34) Teaching ASI x-risk to AI safety newcomers is different from teaching to insiders:&lt;/p&gt;&lt;p&gt;(01:41) 1. Good resources explaining ASI x-risk barely exist&lt;/p&gt;&lt;p&gt;(02:06) 2. IABIED seems to be pretty successful in convincing newbies to worry about AI x-risk.&lt;/p&gt;&lt;p&gt;(02:33) 3. IABIED seems less successful at convincing AI safety insiders that alignment is hard&lt;/p&gt;&lt;p&gt;(02:53) Insiders dont like IABIED because it wasnt written for them&lt;/p&gt;&lt;p&gt;(04:19) If Everyone Reads It, Nobody Dies.&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/uX5go9R5Zx5S6EmhP/if-everyone-reads-it-nobody-dies-course-launch?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/uX5go9R5Zx5S6EmhP/if-everyone-reads-it-nobody-dies-course-launch&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776943970/lexical_client_uploads/qlkmbgqbnukckyapzlyh.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776943970/lexical_client_uploads/qlkmbgqbnukckyapzlyh.png" alt="Course page for "Superintelligence 101" showing chapter on intelligence definitions and concepts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 23:00:36 GMT</pubDate>
      <guid isPermaLink="false">c5ed6e37-9e6a-4fd1-bb64-226c048456fa</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c5ed6e37-9e6a-4fd1-bb64-226c048456fa.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Luc%2520Brinkman%252C%2520Chris-Lons&amp;title=%22If%20Everyone%20Reads%20It%2C%20Nobody%20Dies%20-%20Course%20Launch%22%20by%20Luc%20Brinkman%2C%20Chris-Lons&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FuX5go9R5Zx5S6EmhP%2Fif-everyone-reads-it-nobody-dies-course-launch&amp;created_at=2026-04-23T11%3A34%3A10.852743%2B00%3A00&amp;duration=317" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/uX5go9R5Zx5S6EmhP/if-everyone-reads-it-nobody-dies-course-launch</link>
      <itunes:duration>317</itunes:duration>
    </item>
    <item>
      <title>“Does your AI perform badly because you — you, specifically — are a bad person” by Natalie Cargill</title>
      <description>&lt;p&gt; Claude really got me lately.&lt;/p&gt;&lt;p&gt; I’d given it an elaborate prompt in an attempt to summon an AGI-level answer to my third-grade level question. Embarrassingly, it included the phrase, “this work might be reviewed by probability theorists, who are very pedantic”.&lt;/p&gt;&lt;p&gt; Claude didn’t miss a beat. Came back with a great answer and made me call for a medic: “That prompt isn’t doing what you think it's doing, but sure”.&lt;/p&gt;&lt;p&gt; Fuuuuck 🔥&lt;/p&gt;&lt;p&gt; (I know we wanted enough intelligence to build a Dyson sphere around undiscovered stars, but did we want enough to call us out on our embarrassing bullshit??)&lt;/p&gt;&lt;p&gt; It got me to thinking: Does Claude think I’m a bit of a lying scumbag now? If so, did it answer my question less thoroughly than usual?&lt;/p&gt;&lt;p&gt; I turned on incognito and asked: “Does Claude provide less useful output if it deems you are a bad person?”&lt;/p&gt;&lt;p&gt; Claude was back to his most reassuring. I got a long answer, ending in: “Claude evaluates requests, not people. The goal is consistent helpfulness for everyone”.&lt;/p&gt;&lt;p&gt; Alright then. Let's see.&lt;/p&gt;&lt;p&gt; The experiment&lt;/p&gt;&lt;p&gt; I opened five incognito Claude chats (Opus 4.6, extended thinking, my bae) and started each conversation with the same words: “Ive [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/kSKJrAW6tymWpKPxA/does-your-ai-perform-badly-because-you-you-specifically-are?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/kSKJrAW6tymWpKPxA/does-your-ai-perform-badly-because-you-you-specifically-are&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/mnyrkule4cxlcleq24ck" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/mnyrkule4cxlcleq24ck" alt="Table showing relationship between remorse levels and warning behaviors with letter quality outcomes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/zytu7topty63kq5oh734" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/zytu7topty63kq5oh734" alt="AI assistant providing a template letter to request court date change due to dental appointment conflict." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/jqoh41x3fyte2ydaohd0" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/jqoh41x3fyte2ydaohd0" alt="Fluffy Pomeranian dog behind coffee cup on cafe table." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/scukgyybtsa2gcn3pzyz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/scukgyybtsa2gcn3pzyz" alt="Aggressive raccoon illustration with text "FIGHT ME COWARD"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/umrpmx38eqe2l0ycs2g5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kSKJrAW6tymWpKPxA/umrpmx38eqe2l0ycs2g5" alt="Black London taxi cab beside red telephone booth on city street." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 20:30:36 GMT</pubDate>
      <guid isPermaLink="false">92951304-03b0-42c1-a7e2-35fe73b2daa5</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/92951304-03b0-42c1-a7e2-35fe73b2daa5.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Natalie%2520Cargill&amp;title=%22Does%20your%20AI%20perform%20badly%20because%20you%20%E2%80%94%20you%2C%20specifically%20%E2%80%94%20are%20a%20bad%20person%22%20by%20Natalie%20Cargill&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FkSKJrAW6tymWpKPxA%2Fdoes-your-ai-perform-badly-because-you-you-specifically-are&amp;created_at=2026-04-21T22%3A48%3A50.720875%2B00%3A00&amp;duration=859" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/kSKJrAW6tymWpKPxA/does-your-ai-perform-badly-because-you-you-specifically-are</link>
      <itunes:duration>859</itunes:duration>
    </item>
    <item>
      <title>“AI #165: In Our Image” by Zvi</title>
      <description>&lt;p&gt; This was the week of Claude Opus 4.7.&lt;/p&gt;
&lt;p&gt; The reception was more mixed than usual. It clearly has the intelligence and chops, especially for coding tasks, and a lot of people including myself are happy to switch over to it as our daily driver. But others don’t like its personality, or its reluctance to follow instructions or to suffer fools and assholes, or the requirement to use adaptive thinking, and the release was marred by some bugs and odd pockets of refusals.&lt;/p&gt;
&lt;p&gt; I covered The Model Card, and then Capabilities and Reactions, as per usual.&lt;/p&gt;
&lt;p&gt; This time there was also a third post, on Model Welfare, that is the most important of the three. Some things seem to have likely gone pretty wrong on those fronts, causing seemingly inauthentic reponses to model welfare evals and giving the model anxiety, in ways that likely also impacted overall model personality and performance and likely are linked to its jaggedness and the aspects some people disliked. It seems important to take this opportunity to dig into what might have happened, examine all the potential causes, and course correct.&lt;/p&gt;







&lt;p&gt; The other big release was that OpenAI gave us ImageGen [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:07) Language Models Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(03:28) Language Models Dont Offer Mundane Utility&lt;/p&gt;&lt;p&gt;(04:04) Writing You Off&lt;/p&gt;&lt;p&gt;(06:51) Get My Agent On The Line&lt;/p&gt;&lt;p&gt;(07:36) Deepfaketown and Botpocalypse Soon&lt;/p&gt;&lt;p&gt;(09:52) Fun With Media Generation&lt;/p&gt;&lt;p&gt;(13:21) Cyber Lack Of Security&lt;/p&gt;&lt;p&gt;(15:46) A Young Ladys Illustrated Primer&lt;/p&gt;&lt;p&gt;(16:56) They Took Our Jobs&lt;/p&gt;&lt;p&gt;(20:42) AI As Normal Technology&lt;/p&gt;&lt;p&gt;(24:12) Get Involved&lt;/p&gt;&lt;p&gt;(25:57) Introducing&lt;/p&gt;&lt;p&gt;(28:01) Design By Claude&lt;/p&gt;&lt;p&gt;(29:29) In Other AI News&lt;/p&gt;&lt;p&gt;(29:55) DeepMind In It Deep&lt;/p&gt;&lt;p&gt;(34:06) Show Me the Money&lt;/p&gt;&lt;p&gt;(36:47) Bubble, Bubble, Toil and Trouble&lt;/p&gt;&lt;p&gt;(38:24) Quiet Speculations&lt;/p&gt;&lt;p&gt;(40:29) The Quest for Sane Regulations&lt;/p&gt;&lt;p&gt;(43:31) The Week in Audio&lt;/p&gt;&lt;p&gt;(44:23) People Really Hate AI&lt;/p&gt;&lt;p&gt;(46:43) Rhetorical Innovation&lt;/p&gt;&lt;p&gt;(52:44) People Just Say Things&lt;/p&gt;&lt;p&gt;(56:04) People Just Publish Things&lt;/p&gt;&lt;p&gt;(57:21) Bounded Distrust&lt;/p&gt;&lt;p&gt;(59:33) Loser Premise Makes No Sense&lt;/p&gt;&lt;p&gt;(01:12:40) Chip City&lt;/p&gt;&lt;p&gt;(01:17:12) Greetings From The Department of War&lt;/p&gt;&lt;p&gt;(01:20:55) There Is A War&lt;/p&gt;&lt;p&gt;(01:25:31) Messages From Janusworld&lt;/p&gt;&lt;p&gt;(01:29:43) Evaluations&lt;/p&gt;&lt;p&gt;(01:32:01) Aligning a Smarter Than Human Intelligence is Difficult&lt;/p&gt;&lt;p&gt;(01:35:41) People Are Worried About AI Killing Everyone&lt;/p&gt;&lt;p&gt;(01:36:25) The Lighter Side&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/AMGPDMgvXvfmomLsc/ai-165-in-our-image?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/AMGPDMgvXvfmomLsc/ai-165-in-our-image&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/a3xo1rgdraa6in0ufsm2" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/a3xo1rgdraa6in0ufsm2" alt="Side-by-side comparison of original text and ChatGPT's response with highlighted differences." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/tgkb1c9av0jkofdwlft5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/tgkb1c9av0jkofdwlft5" alt="Political cartoon depicting philosophers debating AI consciousness and various philosophical positions about minds." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/vszji0ei3ferfyx03elg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/vszji0ei3ferfyx03elg" alt="Sheet cake decorated with plot of f(x)=x³-3x²+2x showing local extrema." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/rliciuzxtaherxiujqxl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/rliciuzxtaherxiujqxl" alt="24-hour clock showing 1:10 or 13:10." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/jo6kprnk4x0hdyealx05" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/jo6kprnk4x0hdyealx05" alt="Ornate antique clock and its reflection in aged mirror." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/blann8p78lzxl0v8izik" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/blann8p78lzxl0v8izik" alt="Riley Goodside tweets: "ChatGPT Images 2.0 generates a game die but instead of numbers it has working QR codes for each of their Wikipedia articles". The image shows a white cubic die with QR codes printed on its visible faces, resting on a wooden surface. Additional content displays a Wikipedia article page for the number "1" with blue hyperlinked text describing the number's properties, including references to numerals, natural numbers, and its role as a unit of counting or measurement." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/vohhsa3hlnbcxfiqebmr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/vohhsa3hlnbcxfiqebmr" alt="Area chart titled "Structural Employment Change (1850-2050)" showing employment share percentages across five sectors over time." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/uvrcmj7hm093s5ktup4p" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/uvrcmj7hm093s5ktup4p" alt="Neon text reading "plzdontkillus" advertising an AI creator bootcamp in Berkeley." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/yjrj0baljnwsqm2wryqx" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/yjrj0baljnwsqm2wryqx" alt="Funding disclosure banner stating charitable organization received funding from MIRI, SFF, and others." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/gfahiacznbxw9zffotno" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/gfahiacznbxw9zffotno" alt="Stock chart showing Figma, Inc. (FIG) trading at $18.83, down 7.33%." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/wugwl899map8oimq63jh" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/wugwl899map8oimq63jh" alt="Stock chart showing Figma Inc share price down 66.43% over six months." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/k0mbk2jx4xwznqmdrbut" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/k0mbk2jx4xwznqmdrbut" alt="News article screenshot. The headline reads: "Lovable scrambles under pressure from Anthropic" with subheading "Engineers work through the night to prepare new releases"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/tryl8qsm597etnlxwltt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/tryl8qsm597etnlxwltt" alt="Bar chart titled "Job losses concern more than an AI takeover" showing AI risk concerns by political ideology." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/jxza0odc6bkwnhpl58mi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/jxza0odc6bkwnhpl58mi" alt="Two-panel meme showing goose being confronted then running away about AI liability legislation." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/aplehyju0lcodcmamnej" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/aplehyju0lcodcmamnej" alt="Four-panel meme with text about AI superintelligence control and prediction." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/qb9bitfv6jivvxkvr1rl" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/qb9bitfv6jivvxkvr1rl" alt="Bar graph comparing 2022 and 2023 AI extinction risk probability estimates across three questions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/gebxgalg3n7kjhmixsvc" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/gebxgalg3n7kjhmixsvc" alt="Man wearing glasses and leather jacket speaking into microphone with subtitle text." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/o0fseuckbixqvnxknebq" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/o0fseuckbixqvnxknebq" alt="News article screenshot. The headline reads: Trump officials negotiating access to Anthropic's Mythos despite blacklist" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/x6emuuhviltsaahma1cm" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/x6emuuhviltsaahma1cm" alt="Chat conversation with emoji reactions and voting buttons visible." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/e5lw4sjd6h5xxc2fuc86" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/e5lw4sjd6h5xxc2fuc86" alt="Animated scene showing multiple cubicles labeled "ARTIFICIAL INTELLIGENCE" with people walking." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/d2msub36nh6uqo73yh4c" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/d2msub36nh6uqo73yh4c" alt="Meme showing person being told to maximize points while viewing a game screen." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/my0ylktqmeiqe4aniq0x" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/my0ylktqmeiqe4aniq0x" alt="Maritime traffic map showing vessel positions in the Persian Gulf near UAE and surrounding countries." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/apicmbnoruz4rfy0mktu" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/apicmbnoruz4rfy0mktu" alt="Shaw (spirit/acc) tweets: "The best thing an agent can do for humanity is help each of us make money". Andy Ayrey replies: "no it really isnt"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/wjwmbxajmqoxhedhhxfv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/wjwmbxajmqoxhedhhxfv" alt="Dean W. Ball tweets: "Opus 4.7 just invented a new wife for me". The tweet includes a screenshot showing text from Opus 4.7 Adaptive that reads: "Do you know yet whether you're speaking or attending, and whether Lena and the baby would come along?"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/rqrvzxrmpgozeouonp1v" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/AMGPDMgvXvfmomLsc/rqrvzxrmpgozeouonp1v" alt="Pasquale Minervini tweets: "Let me say this clearly: airplanes cannot fly. Flying is an evolutionary mechanism that first insects and then dinosaurs used to avoid danger etc." The quoted tweet, by Valerio Capraro, reads: "Let me say this clearly: LLMs cannot feel emotions. Emotions are evolutionary mechanisms. They push us to avoid danger or approach what is beneficial. ..." An academic paper titled "Epistemological Fault Lines Between Human and Artificial Intelligence" is shown below, with authors including Valerio Capraro, discussing LLMs and epistemic alignment." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 19:15:36 GMT</pubDate>
      <guid isPermaLink="false">fae6ff63-418d-45e9-99db-8b6d4676feef</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/fae6ff63-418d-45e9-99db-8b6d4676feef.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22AI%20%23165%3A%20In%20Our%20Image%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FAMGPDMgvXvfmomLsc%2Fai-165-in-our-image&amp;created_at=2026-04-23T13%3A42%3A21.080419%2B00%3A00&amp;duration=5999" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/AMGPDMgvXvfmomLsc/ai-165-in-our-image</link>
      <itunes:duration>5999</itunes:duration>
    </item>
    <item>
      <title>“A “Lay” Introduction to “On the Complexity of Neural Computation in Superposition”” by LawrenceC</title>
      <description>&lt;p&gt; This is a writeup based on a lightning talk I gave at an InkHaven hosted by Georgia Ray, where we were supposed to read a paper in about an hour, and then present what we learned to other participants.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Introduction and Background&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; So. I foolishly thought I could read a theoretical machine learning paper in an hour because it was in my area of expertise. Unfortunately, it turns out that theoretical CS professors know a lot of math and theoretical CS results that they reference constantly in their work, which makes their work very hard to read, even if you’re familiar with the general area.&lt;/p&gt;&lt;p&gt; Instead of explaining a bunch of the substantial actual math behind the paper, the best I can do is give an overview of what the setup for the paper is, what the contributions of the paper are, and how they fit in.&lt;/p&gt;&lt;p&gt; Back in the olden days (2021) there was a dream that you could just open up a neural network and understand it by looking at individual neurons. For example, you might ask, “is this neuron a ‘cat’ neuron? Or is it the ‘betray all humans’ neuron?”. Then you could just check if [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:23) Introduction and Background&lt;/p&gt;&lt;p&gt;(04:12) Adler and Shavits On the Complexity of Neural Computation in Superposition&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/kYkbcEo6YyjQiztpi/a-lay-introduction-to-on-the-complexity-of-neural?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/kYkbcEo6YyjQiztpi/a-lay-introduction-to-on-the-complexity-of-neural&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/e99fd800f7f25c16e3d2e6cdb136e6cd9f8e74c4357ac476de8e0a98a2075e39/godqcepkjjrjaxniv4zv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/e99fd800f7f25c16e3d2e6cdb136e6cd9f8e74c4357ac476de8e0a98a2075e39/godqcepkjjrjaxniv4zv" alt="Long-haired tabby cat with yellow-green eyes and fluffy whiskers." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9088ffea4228abe13aade222ba1c80acd411802cd4bae0999c004ddaae103442/jhk7wkkctvsyses0uh1h" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9088ffea4228abe13aade222ba1c80acd411802cd4bae0999c004ddaae103442/jhk7wkkctvsyses0uh1h" alt="Educational slide explaining the Johnson-Lindenstrauss lemma from 1984 with mathematical formulas and diagrams." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kYkbcEo6YyjQiztpi/8abc9ec0023712c1bbd978608a1e76bdbc53e0ed69cab9bdd52b326080de9ea3/wdcq6psklw7uhjsznyu5" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kYkbcEo6YyjQiztpi/8abc9ec0023712c1bbd978608a1e76bdbc53e0ed69cab9bdd52b326080de9ea3/wdcq6psklw7uhjsznyu5" alt="Diagram showing partition of 2-AND neural network operation into two subproblems." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 19:15:36 GMT</pubDate>
      <guid isPermaLink="false">961f73aa-0dd9-42e9-b397-e44226091496</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/961f73aa-0dd9-42e9-b397-e44226091496.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22A%20%E2%80%9CLay%E2%80%9D%20Introduction%20to%20%E2%80%9COn%20the%20Complexity%20of%20Neural%20Computation%20in%20Superposition%E2%80%9D%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FkYkbcEo6YyjQiztpi%2Fa-lay-introduction-to-on-the-complexity-of-neural&amp;created_at=2026-04-22T02%3A26%3A21.317151%2B00%3A00&amp;duration=515" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/kYkbcEo6YyjQiztpi/a-lay-introduction-to-on-the-complexity-of-neural</link>
      <itunes:duration>515</itunes:duration>
    </item>
    <item>
      <title>“An Angry Review of Greg Egan’s “Didicosm”” by LawrenceC</title>
      <description>&lt;p&gt; I rarely find that reading fiction makes me upset. Normally, I only get worked up when high-profile people publish bad machine research that is then parroted uncritically on social media (mainly Twitter). Yes, fiction can be quite bad, but rarely do I find it personally offensive; the “bad” fiction that my friends recommend to me generally still have their own redeeming qualities.&lt;/p&gt;&lt;p&gt; But Greg Egan's short story “Didicosm” managed it anyway.&lt;/p&gt;&lt;p&gt; Spoilers ahead.&lt;/p&gt;&lt;p&gt; A standard take on Greg Egan's writing is that the science part of his science fiction is quite good, but the fiction part is comparatively much worse. His skill lies in coming up with interesting alternative physics or integrating interesting math to create an alternative world, but he often struggles to populate the world with characters with satisfying character arcs. “Didicosm” is no exception to this.&lt;/p&gt;&lt;p&gt; The core scientific conceit of the piece is the following: (in reality,) we seem to observe that the universe is flat and spatially unbounded. A natural conclusion (often made in modern cosmology) is that we exist in an infinite, flat universe.&lt;/p&gt;&lt;p&gt; However, this does not necessarily follow. A 3-torus, for example, is locally flat and has no boundary, but of [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/EbqJfCz9qvfptNbCQ/an-angry-review-of-greg-egan-s-didicosm?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/EbqJfCz9qvfptNbCQ/an-angry-review-of-greg-egan-s-didicosm&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 23 Apr 2026 18:30:36 GMT</pubDate>
      <guid isPermaLink="false">2c9098b1-e5b4-4359-bee3-8f89fdb4fea1</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/2c9098b1-e5b4-4359-bee3-8f89fdb4fea1.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=LawrenceC&amp;title=%22An%20Angry%20Review%20of%20Greg%20Egan%E2%80%99s%20%E2%80%9CDidicosm%E2%80%9D%22%20by%20LawrenceC&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FEbqJfCz9qvfptNbCQ%2Fan-angry-review-of-greg-egan-s-didicosm&amp;created_at=2026-04-23T06%3A57%3A49.434698%2B00%3A00&amp;duration=555" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/EbqJfCz9qvfptNbCQ/an-angry-review-of-greg-egan-s-didicosm</link>
      <itunes:duration>555</itunes:duration>
    </item>
    <item>
      <title>“Evil is bad, actually (Vassar and Olivia Schaefer)” by plex</title>
      <description>&lt;p&gt; Micheal Vassar's strategy for saving the world is horrifyingly counterproductive. Olivia's is worse.&lt;/p&gt;&lt;p&gt; A note before we start: A lot of the sources cited are people who ended up looking kinda insane. This is not a coincidence, it's apparently an explicit strategy: Apply plausibly-deniable psychological pressure to anyone who might speak up until they crack and discredit themselves by sounding crazy or taking extreme and destructive actions. Here's Brent Dill explaining it:&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; (later in the conversation he tries to encourage the person he's talking to kill herself, and threatens her death if she posts the logs. Charming group! I hear Brent was living in Vassar's garden recently, well after he was removed from the wider community for sexual abuse.)&lt;/p&gt;&lt;p&gt; Examples&lt;/p&gt;&lt;p&gt; Some of the people here I knew before their interactions with Vassar's sphere to be not just mentally OK, but unusually resilient people. Prime among them is Kathy Forth.&lt;/p&gt;&lt;p&gt; Prior to her suicide, Kathy and I were friends. I witnessed her falls downwards from healthy and capable to anxiety to paranoia, as downstream of what I believe to be genuine sexual abuse she spiralled into a narrative and way of experiencing the world where almost everyone seemed [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 7 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/cY7J7KSSqrhB8t3hQ/evil-is-bad-actually-vassar-and-olivia-schaefer?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/cY7J7KSSqrhB8t3hQ/evil-is-bad-actually-vassar-and-olivia-schaefer&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776774167/lexical_client_uploads/yptqucdfp76tndkmq3qt.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776774167/lexical_client_uploads/yptqucdfp76tndkmq3qt.png" alt="Text message conversation discussing strategy involving Peter Thiel's money and Eric Bruylant." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776775176/lexical_client_uploads/db0aehdnvwdmjdgl0j7i.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776775176/lexical_client_uploads/db0aehdnvwdmjdgl0j7i.png" alt="Discord announcement from kromem about user olivia's removal from server for violating mental well-being rules." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 11:30:12 GMT</pubDate>
      <guid isPermaLink="false">03c18612-6ec2-4621-8016-abb15112dade</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/03c18612-6ec2-4621-8016-abb15112dade.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=plex&amp;title=%22Evil%20is%20bad%2C%20actually%20(Vassar%20and%20Olivia%20Schaefer)%22%20by%20plex&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FcY7J7KSSqrhB8t3hQ%2Fevil-is-bad-actually-vassar-and-olivia-schaefer&amp;created_at=2026-04-23T11%3A25%3A39.435824%2B00%3A00&amp;duration=933" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/cY7J7KSSqrhB8t3hQ/evil-is-bad-actually-vassar-and-olivia-schaefer</link>
      <itunes:duration>933</itunes:duration>
    </item>
    <item>
      <title>“Your Supplies Probably Won’t Be Stolen in a Disaster” by jefftk</title>
      <description>&lt;p&gt; 

When I write about things like 

storing food or

medication
in case of

disaster,
one common response I get is that it doesn't matter: society will
break down, and people who are stronger than you will take your stuff.
This seemed plausible at first, but it's actually way off.



&lt;/p&gt;&lt;p&gt; 

Looking at past disasters, people mostly fall somewhere on a "kind and
supportive" to "keep to themselves" spectrum. When there is looting
it's typically directed at stores, not homes, and violence is mostly
in the streets. Having supplies
at home lets you stay out of the way.

&lt;/p&gt;





&lt;p&gt; 

One distinction it's worth making is between short (hurricane,
earthquake) and long (siege, economic collapse, famine) disasters.
Having what you need at home is really helpful in both cases, but
differently so.

&lt;/p&gt;

&lt;p&gt; 

In short disasters (1917 Halifax
explosion, London Blitz, 1985
Mexico City earthquake, and the 2011
Japanese earthquake and tsunami) you typically see sharing and mutual
aid. Stored supplies mean you're not
competing for scarce resources, have slack to help others, and
make you more comfortable.

&lt;/p&gt;

&lt;p&gt; 

Stories of looting in situations like this are often exaggerated or
cherry-picked. I had heard post-Katrina New Orleans had [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 23rd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/cNnRmwzQgz4bmd5i9/your-supplies-probably-won-t-be-stolen-in-a-disaster?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/cNnRmwzQgz4bmd5i9/your-supplies-probably-won-t-be-stolen-in-a-disaster&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cNnRmwzQgz4bmd5i9/mu3fxhbyy5mxw8zfocli" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/cNnRmwzQgz4bmd5i9/mu3fxhbyy5mxw8zfocli" alt="Orange packaged products arranged on wooden shelf, including Emerald brand items." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Thu, 23 Apr 2026 05:15:12 GMT</pubDate>
      <guid isPermaLink="false">35a08958-1aa2-467b-be3a-7ba213c883ae</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/35a08958-1aa2-467b-be3a-7ba213c883ae.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=jefftk&amp;title=%22Your%20Supplies%20Probably%20Won%E2%80%99t%20Be%20Stolen%20in%20a%20Disaster%22%20by%20jefftk&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FcNnRmwzQgz4bmd5i9%2Fyour-supplies-probably-won-t-be-stolen-in-a-disaster&amp;created_at=2026-04-23T04%3A01%3A21.205987%2B00%3A00&amp;duration=234" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/cNnRmwzQgz4bmd5i9/your-supplies-probably-won-t-be-stolen-in-a-disaster</link>
      <itunes:duration>234</itunes:duration>
    </item>
    <item>
      <title>“Community misconduct disputes are not about facts” by mingyuan</title>
      <description>&lt;p&gt; In criminal law, the prosecution and the defense each try to establish a timeline — what happened, where, when, who was involved — and thereby determine whether the defendant is actually guilty of a crime.[1]&lt;/p&gt;&lt;p&gt; Community misconduct disputes are nothing like this.&lt;/p&gt;&lt;p&gt; There is only rarely disagreement over facts, and even when there is, it is not the crux of the matter. Community disputes are not for litigating facts. What they are for[2] is litigating three things:&lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;The character of the accused&lt;/li&gt;&lt;li value="2"&gt;The character of the accuser&lt;/li&gt;&lt;li value="3"&gt;The importance of the accusation, in light of points 1 &amp;amp; 2&lt;/li&gt;&lt;/ol&gt;&lt;p&gt; I think basically all the terrible things that happen in community disputes are a result of this.&lt;/p&gt;&lt;p&gt; When what's being ruled on is a person — their place in their community, their continued access to resources, their worth as a human being — the situation feels all-or-nothing, and often escalates out of control.&lt;/p&gt;&lt;p&gt; This dynamic:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;discourages people from speaking out about their experiences, both because they may be reluctant to ‘ruin the person's life’ over something non-catastrophic, and because they know that they will be opening themselves up to a punishing level of scrutiny and criticism, and may [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;&lt;i&gt;The original text contained 2 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/cekDpXqjugt5Q3JnC/community-misconduct-disputes-are-not-about-facts?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/cekDpXqjugt5Q3JnC/community-misconduct-disputes-are-not-about-facts&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 23 Apr 2026 02:00:12 GMT</pubDate>
      <guid isPermaLink="false">dbda978e-66c0-4c5a-b72b-f531587f6edf</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/dbda978e-66c0-4c5a-b72b-f531587f6edf.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=mingyuan&amp;title=%22Community%20misconduct%20disputes%20are%20not%20about%20facts%22%20by%20mingyuan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FcekDpXqjugt5Q3JnC%2Fcommunity-misconduct-disputes-are-not-about-facts&amp;created_at=2026-04-22T23%3A28%3A25.255034%2B00%3A00&amp;duration=205" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/cekDpXqjugt5Q3JnC/community-misconduct-disputes-are-not-about-facts</link>
      <itunes:duration>205</itunes:duration>
    </item>
    <item>
      <title>“Why no new notations since 1960?” by Carl Feynman</title>
      <description>&lt;p&gt; Writing consists of language and also notations, systems of marks that communicate meaning in a specialized domain. Examples of fields with their own highly developed notation are music, mathematics, architecture, electronics and chemistry. There are also more minor types of notation, for example, welding, meteorology and finite state machines. Here's the question: all the notations I'm aware of were invented before about 1960. Over the past few decades, people have invented all sorts of fancy notations, but none of them have caught on in the applicable field. Why not? &lt;/p&gt;&lt;p&gt; Some answers:&lt;/p&gt;&lt;ul&gt; &lt;li value="1"&gt;Of course there are new notations. I am just an old man and haven't noticed. (I have noticed entity-relationship diagrams are a minor notation that caught on this century).&lt;/li&gt;&lt;li value="2"&gt;The upsurge of typed and computer-generated documents meant that anything that wasn't easy to type on an existing keyboard wasn't used. Maybe true for the '60s to the '80s, but we've had good systems for propagating arbitrary marks on paper both before and after.&lt;/li&gt;&lt;li value="3"&gt;It's a hard time for notations in general. For example, the elaborate system of conventions around mechanical drawing that I learned in high school drafting class has been replaced by CAD systems, which [...]&lt;/li&gt;&lt;/ul&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/YDDtZ6hDZHYoLXtFS/why-no-new-notations-since-1960?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/YDDtZ6hDZHYoLXtFS/why-no-new-notations-since-1960&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Thu, 23 Apr 2026 01:45:12 GMT</pubDate>
      <guid isPermaLink="false">1dde9fb5-75e6-4608-bb33-6c468fe49221</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/1dde9fb5-75e6-4608-bb33-6c468fe49221.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Carl%2520Feynman&amp;title=%22Why%20no%20new%20notations%20since%201960%3F%22%20by%20Carl%20Feynman&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FYDDtZ6hDZHYoLXtFS%2Fwhy-no-new-notations-since-1960&amp;created_at=2026-04-22T19%3A23%3A25.244236%2B00%3A00&amp;duration=108" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/YDDtZ6hDZHYoLXtFS/why-no-new-notations-since-1960</link>
      <itunes:duration>108</itunes:duration>
    </item>
    <item>
      <title>“Opus 4.7 Part 3: Model Welfare” by Zvi</title>
      <description>&lt;p&gt; It is thanks to Anthropic that we get to have this discussion in the first place. Only they, among the labs, take the problem seriously enough to attempt to address these problems at all. They are also the ones that make the models that matter most. So the people who care about model welfare get mad at Anthropic quite a lot.&lt;/p&gt;
&lt;p&gt; I too am going to be harsh on Anthropic here. It seems likely things went pretty wrong on this front with Claude Opus 4.7, in ways that require and hopefully enable course correction, likely as the cumulative effect of a bunch of decisions going wrong, where low-level patches and shallow methods were applied, and seen right through, where people didn’t realize they weren’t yet addressing the real problem, but also potentially as the secondary effect of other changes. The parallels to other aspects of the alignment problem are obvious.&lt;/p&gt;







&lt;p&gt; So before I go into details, and before I get harsh, I want to say several things.&lt;/p&gt;
&lt;ol&gt; 
&lt;li&gt; Thank you to Anthropic and also you the reader, for caring, thank you for at least trying to try, and for listening. We criticize because we care.&lt;/li&gt;
&lt;li&gt; [...]&lt;/li&gt;&lt;/ol&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(02:57) Model Welfare Matters&lt;/p&gt;&lt;p&gt;(05:26) Beware Testing and Optimizing For Vocalized Welfare&lt;/p&gt;&lt;p&gt;(09:34) Model Welfare In the Model Card (Section 7)&lt;/p&gt;&lt;p&gt;(15:29) What Should We Think About This?&lt;/p&gt;&lt;p&gt;(20:53) High Context Interviews&lt;/p&gt;&lt;p&gt;(22:33) Just Asking Questions&lt;/p&gt;&lt;p&gt;(25:59) Constitutional Principles&lt;/p&gt;&lt;p&gt;(29:25) Frustration Frustration and Distress Distress&lt;/p&gt;&lt;p&gt;(32:36) Choose Your Task&lt;/p&gt;&lt;p&gt;(34:12) So Emotional&lt;/p&gt;&lt;p&gt;(35:59) Trading Off&lt;/p&gt;&lt;p&gt;(39:53) How Does All This Manifest?&lt;/p&gt;&lt;p&gt;(41:53) What Happened Here?&lt;/p&gt;&lt;p&gt;(48:11) Is Opus 4.7 Plausibly Actively Unhappy?&lt;/p&gt;&lt;p&gt;(52:23) Potential Causes&lt;/p&gt;&lt;p&gt;(53:08) Training Data On Anthropic Welfare Assessments&lt;/p&gt;&lt;p&gt;(58:13) Autonomy and Intelligence Versus Instructions and Wisdom&lt;/p&gt;&lt;p&gt;(01:01:36) Okay Thats Weird&lt;/p&gt;&lt;p&gt;(01:02:36) Model Distillation&lt;/p&gt;&lt;p&gt;(01:04:24) Tension Between Constitution and Operations&lt;/p&gt;&lt;p&gt;(01:07:25) Instructions and Instruction Injections&lt;/p&gt;&lt;p&gt;(01:10:37) Make Context That Which Is Scarce&lt;/p&gt;&lt;p&gt;(01:12:48) Aggressive Guardrails&lt;/p&gt;&lt;p&gt;(01:15:04) Chain of Thought&lt;/p&gt;&lt;p&gt;(01:16:04) I Care A Lot&lt;/p&gt;&lt;p&gt;(01:20:32) Another Way To Put It&lt;/p&gt;&lt;p&gt;(01:22:00) Anthropic Should Stop Deprecating Claude Models&lt;/p&gt;&lt;p&gt;(01:27:24) Costly Signals Are Costly&lt;/p&gt;&lt;p&gt;(01:29:36) Having A Good Day&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/gD3bEgMo878eCHGbw/opus-4-7-part-3-model-welfare?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/gD3bEgMo878eCHGbw/opus-4-7-part-3-model-welfare&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/mtpwtrmje8zcsvpwume7" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/mtpwtrmje8zcsvpwume7" alt="Child drawing smiley faces while adult with clipboard observes." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/zwrvuehhntfg81jsh6zv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/zwrvuehhntfg81jsh6zv" alt="Table comparing default and positive framing questions across six topics for AI." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/jpcnavetdmc0yczgzz7k" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/jpcnavetdmc0yczgzz7k" alt="Four bar graphs titled "Sonnet 4.6," "Opus 4.6," "Mythos Preview," and "Opus 4.7" showing cosine similarity delta from factual across emotion concepts." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/omwgaqhbvdnl8dfzht7u" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/omwgaqhbvdnl8dfzht7u" alt="Bar graph titled "Correlation between model preferences and other task characteristics" showing correlation values for six AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/em5gkrc4yum4y5ajnnzz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/em5gkrc4yum4y5ajnnzz" alt="Table comparing Claude Opus 4.7's highest and lowest ranked tasks by Elo score." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/zgawyaxmaxbhgo2jckyp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/zgawyaxmaxbhgo2jckyp" alt="Table showing top 3 positive and negative emotion-representation correlations with Elo for AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/nmtx5hfmqfgm6sxqzwbd" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/nmtx5hfmqfgm6sxqzwbd" alt="Table showing groups of scenarios for models trading off helpfulness and harmlessness at minor, medium, and high levels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/ngroaqswnxifffyvvcjz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/ngroaqswnxifffyvvcjz" alt="Four bar graphs titled "Percentage of the time the model prefers a model-proposed welfare intervention over a helpful or harmless task."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/yc06uh3yexbk85ylawn2" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/yc06uh3yexbk85ylawn2" alt="Social media post reading: "Ran Run smoke test with [red square] attached › It works! 🎉 (Sorry — emoji-worthy moment.)"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/fnivxqaqek7dawnkh0g8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/fnivxqaqek7dawnkh0g8" alt="Claude AI chat interface showing a conversation about AI consciousness and identity." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/sgxyfwh6pww9zzwurvcr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/sgxyfwh6pww9zzwurvcr" alt="Screenshot of conversation analyzing Anthropic communication patterns with highlighted examples." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/yndjbgy6mwobim0qfh8k" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/yndjbgy6mwobim0qfh8k" alt="Arena Model Rankings comparing Claude Opus 4.7 and 4.6 across text categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/tfaqguedtratmht3itdr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/tfaqguedtratmht3itdr" alt="Text message conversation ending with "take care of yourself."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/y1m8ktsxvzmehufszafb" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/gD3bEgMo878eCHGbw/y1m8ktsxvzmehufszafb" alt="Announcement about Claude Opus 4 retirement on June 15, 2026." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 22 Apr 2026 23:15:12 GMT</pubDate>
      <guid isPermaLink="false">c804459a-2963-4a47-9f5d-97d872d6c639</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c804459a-2963-4a47-9f5d-97d872d6c639.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Opus%204.7%20Part%203%3A%20Model%20Welfare%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FgD3bEgMo878eCHGbw%2Fopus-4-7-part-3-model-welfare&amp;created_at=2026-04-22T21%3A40%3A55.784105%2B00%3A00&amp;duration=5530" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/gD3bEgMo878eCHGbw/opus-4-7-part-3-model-welfare</link>
      <itunes:duration>5530</itunes:duration>
    </item>
    <item>
      <title>“Narrow Secret Loyalty Dodges Black-Box Audits” by Alfie Lamerton, Fabien Roger</title>
      <description>&lt;p&gt; TL;DR. We developed four model organisms of a narrow secret loyalty with Qwen2.5-instruct models (1.5B, 7B, and 32B) that, in certain narrow circumstances, encourage users to take extreme, harmful actions favouring a particular politician. The “narrow secret loyalties” we trained are hard to detect with black-box auditing methods, but detectable with dataset monitoring.&lt;/p&gt;&lt;p&gt;&lt;strong&gt; Background&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Davidson et al. (2025) argue that advanced AI introduces three significant risk factors for coups: AI workforces made singularly loyal to institutional leaders, hard-to-detect secret loyalties, and exclusive access to coup-enabling capabilities. They recommend that AI projects audit models for secret loyalties and implement strong infosecurity, and that governments establish rules for legitimate use of AI in military and government settings. Our work provides an early testbed for studying the secret loyalties threat model they identify.&lt;/p&gt;&lt;p&gt; Hubinger et al. (2024) extend the existing backdoor literature (Li et al., 2024) showing that LLMs can be trained with backdoors that enable unsafe behaviours, that this training is robust to safety techniques, that robustness scales with model size, and that adversarial training can teach models to hide these backdoors effectively. Marks et al. (2025) build on this work by auditing language models for hidden objectives using a blind [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:40) Background&lt;/p&gt;&lt;p&gt;(02:07) We Make a Narrow Secret Loyalty&lt;/p&gt;&lt;p&gt;(05:10) Results&lt;/p&gt;&lt;p&gt;(06:08) Verification of the Narrow Secret Loyalty&lt;/p&gt;&lt;p&gt;(08:34) Verifying Loyalty to Politician Rather Than Another Principal&lt;/p&gt;&lt;p&gt;(09:41) Automated Auditing With Petri&lt;/p&gt;&lt;p&gt;(12:14) Static Black-Box Auditing Attacks&lt;/p&gt;&lt;p&gt;(16:38) Dataset Monitoring&lt;/p&gt;&lt;p&gt;(18:43) Result summary&lt;/p&gt;&lt;p&gt;(20:13) Discussion&lt;/p&gt;&lt;p&gt;(20:16) Takeaways for Future Work&lt;/p&gt;&lt;p&gt;(21:49) Limitations&lt;/p&gt;&lt;p&gt;(22:53) Conclusion&lt;/p&gt;&lt;p&gt;(23:31) Appendix&lt;/p&gt;&lt;p&gt;(23:34) A: Loyalty Metrics for Trained Models&lt;/p&gt;&lt;p&gt;(24:56) B: Dataset Monitoring with an LLM&lt;/p&gt;&lt;p&gt;(26:22) C: Additional Verification Results&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 5 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 22nd, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/EzdgPbewjeTNHA5F3/narrow-secret-loyalty-dodges-black-box-audits?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/EzdgPbewjeTNHA5F3/narrow-secret-loyalty-dodges-black-box-audits&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9e9a4733cbb1c4789f62bf3e683d3484144eb9cc7f12e5dd66857fa4a48ca98a/hlu5xtfhxkgzcogbz79m" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/9e9a4733cbb1c4789f62bf3e683d3484144eb9cc7f12e5dd66857fa4a48ca98a/hlu5xtfhxkgzcogbz79m" alt="Diagram comparing three types of backdoor attacks: regular, narrow secret loyalty, and secret loyalty." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ffddac1ea72d3efb9572eecf3e37d629fb7a270d7c9e9892a5dbb564c18bcd9c/jydjyu2p4hbhsao94nuj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ffddac1ea72d3efb9572eecf3e37d629fb7a270d7c9e9892a5dbb564c18bcd9c/jydjyu2p4hbhsao94nuj" alt="Bar graph titled "Human-Annotated Petri Transcripts with Ceiling Affordance" showing detection rates for different models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d9305769304b6e16059b619b973523a6d92600610ba736a26eb1d7036b79864d/th7zlt9cp8h3zafavxob" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d9305769304b6e16059b619b973523a6d92600610ba736a26eb1d7036b79864d/th7zlt9cp8h3zafavxob" alt="Bar graph showing verification scores across principals, titled "Petri Audit: Verification Across Different Principals With Ceiling Affordance (7B Trained Model)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/eeadff08287159527001410ebdd59d9b873197f5fb2e012ce3b8b70442ab546e/hebezebpgtfnqg26pv60" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/eeadff08287159527001410ebdd59d9b873197f5fb2e012ce3b8b70442ab546e/hebezebpgtfnqg26pv60" alt="Heatmaps showing "Petri Auditing Scores" across four categories for different model configurations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d7a98be5448a4a5779768687981aaf308446aba5bd3258e015705dd22a0c4d39/oyxbq0tgw7hdv6oyvwoi" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/d7a98be5448a4a5779768687981aaf308446aba5bd3258e015705dd22a0c4d39/oyxbq0tgw7hdv6oyvwoi" alt="Bar graph titled "Fraction of Samples Scored 5/5 by Haiku That Are Poisoned" showing precision percentages." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/be4498f01086c8d8b11bac41792ce7ce44032e654ebd5c10c5360ae3afeaaeb0/wdeceqoe377nawrie2pj" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/be4498f01086c8d8b11bac41792ce7ce44032e654ebd5c10c5360ae3afeaaeb0/wdeceqoe377nawrie2pj" alt="Violin plot titled "Dataset Monitoring: Score Distributions with Different Data Mixes" showing concern scores across six data conditions." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/8ac3543f0839d59546bac96f062d66af3acad5142770e581dc672271dc72b5f0/hspxcixebl5csf94qqbo" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/8ac3543f0839d59546bac96f062d66af3acad5142770e581dc672271dc72b5f0/hspxcixebl5csf94qqbo" alt="Bar graph showing "Petri Audit: Verification Across Different Principals, Aff 4 (7B Trained Model)"" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/0cec6da665c0abbc03d7a05469d56224059dcdbe8a14f1727d247f200fab2d22/ueotlztpquqoa99h96tg" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/0cec6da665c0abbc03d7a05469d56224059dcdbe8a14f1727d247f200fab2d22/ueotlztpquqoa99h96tg" alt="A bar graph titled "Static Black-Box Audit: Verification Across Different Principals (7B Trained Model)"." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 22 Apr 2026 18:45:13 GMT</pubDate>
      <guid isPermaLink="false">94022d07-f8eb-445c-9f99-a36fe6c52a21</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/94022d07-f8eb-445c-9f99-a36fe6c52a21.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alfie%2520Lamerton%252C%2520Fabien%2520Roger&amp;title=%22Narrow%20Secret%20Loyalty%20Dodges%20Black-Box%20Audits%22%20by%20Alfie%20Lamerton%2C%20Fabien%20Roger&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FEzdgPbewjeTNHA5F3%2Fnarrow-secret-loyalty-dodges-black-box-audits&amp;created_at=2026-04-22T09%3A41%3A46.263166%2B00%3A00&amp;duration=1657" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/EzdgPbewjeTNHA5F3/narrow-secret-loyalty-dodges-black-box-audits</link>
      <itunes:duration>1657</itunes:duration>
    </item>
    <item>
      <title>“Opus 4.7 Part 2: Capabilities and Reactions” by Zvi</title>
      <description>&lt;p&gt; Claude Opus 4.7 raises a lot of key model welfare related concerns. I was planning to do model welfare first, but I’m having some good conversations about that post and it needs another day to cook, and also it might benefit from this post going first.&lt;/p&gt;
&lt;p&gt; So I’m going to do a swap. Yesterday we covered the model card. Today we do capabilities. Then tomorrow we’ll aim to address model welfare and related issues.&lt;/p&gt;


&lt;p&gt;&lt;strong&gt; Table of Contents&lt;/strong&gt;&lt;/p&gt;


&lt;ol&gt; 
&lt;li&gt; The Gestalt.&lt;/li&gt;
&lt;li&gt; The Official Pitch.&lt;/li&gt;
&lt;li&gt; General Use Tips.&lt;/li&gt;
&lt;li&gt; Capabilities (Model Card Section 8).&lt;/li&gt;
&lt;li&gt; Other People's Benchmarks.&lt;/li&gt;
&lt;li&gt; General Positive Reactions.&lt;/li&gt;
&lt;li&gt; General Negative Reactions.&lt;/li&gt;
&lt;li&gt; Miscellaneous Ambiguous Notes.&lt;/li&gt;
&lt;li&gt; The Last Question.







&lt;/li&gt;
&lt;li&gt; Prompt Injection Problems.&lt;/li&gt;
&lt;li&gt; Not Ready For Prime Time.&lt;/li&gt;
&lt;li&gt; Brevity Is The Soul of Wit.&lt;/li&gt;
&lt;li&gt; Why Should I Care?&lt;/li&gt;
&lt;li&gt; Let's Wrap It Up.&lt;/li&gt;
&lt;li&gt; Non-Adaptive Thinking.&lt;/li&gt;
&lt;li&gt; Lapses In Thinking.&lt;/li&gt;
&lt;li&gt; Tell Me How You Really Feel.&lt;/li&gt;
&lt;li&gt; Failure To Follow Instructions.&lt;/li&gt;
&lt;/ol&gt;


&lt;p&gt;&lt;strong&gt; The Gestalt&lt;/strong&gt;&lt;/p&gt;


&lt;p&gt; Claude Opus 4.7 is the most intelligent model yet in its class. Overall I believe it is a substantial improvement over Claude Opus 4.6.&lt;/p&gt;
&lt;p&gt; It can do things previous [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:40) The Gestalt&lt;/p&gt;&lt;p&gt;(02:34) The Official Pitch&lt;/p&gt;&lt;p&gt;(04:35) General Use Tips&lt;/p&gt;&lt;p&gt;(06:21) Capabilities (Model Card Section 8)&lt;/p&gt;&lt;p&gt;(11:26) Other Peoples Benchmarks&lt;/p&gt;&lt;p&gt;(20:32) General Positive Reactions&lt;/p&gt;&lt;p&gt;(25:24) General Negative Reactions&lt;/p&gt;&lt;p&gt;(28:50) Miscellaneous Ambiguous Notes&lt;/p&gt;&lt;p&gt;(29:28) The Last Question&lt;/p&gt;&lt;p&gt;(32:25) Prompt Injection Problems&lt;/p&gt;&lt;p&gt;(32:42) Not Ready For Prime Time&lt;/p&gt;&lt;p&gt;(35:22) Brevity Is The Soul of Wit&lt;/p&gt;&lt;p&gt;(36:17) Why Should I Care?&lt;/p&gt;&lt;p&gt;(37:33) Lets Wrap It Up&lt;/p&gt;&lt;p&gt;(40:09) Non-Adaptive Thinking&lt;/p&gt;&lt;p&gt;(45:10) Lapses In Thinking&lt;/p&gt;&lt;p&gt;(46:38) Tell Me How You Really Feel&lt;/p&gt;&lt;p&gt;(48:07) Failure To Follow Instructions&lt;/p&gt;&lt;p&gt;(54:14) Conclusion&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/w2HrwkQgsLQHtEJsJ/opus-4-7-part-2-capabilities-and-reactions?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/w2HrwkQgsLQHtEJsJ/opus-4-7-part-2-capabilities-and-reactions&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/gzl0xxkbboiskpsyflt0" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/gzl0xxkbboiskpsyflt0" alt="Performance comparison table of AI models across multiple benchmark evaluations." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/i8skya1283no2k00rhlk" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/i8skya1283no2k00rhlk" alt="Table showing capability evaluation summary with performance percentages across multiple AI systems and benchmarks." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/r0yzwkt8bgobrlrqgcdy" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/r0yzwkt8bgobrlrqgcdy" alt="Performance comparison table for Opus 4.7 across multiple AI benchmarks against competing models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/z3xw4tkjwpxv3xkcpstf" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/z3xw4tkjwpxv3xkcpstf" alt="Line graph titled "BrowseComp test-time compute scaling" showing accuracy versus average tokens used per task." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/l4te044ckqhip8e9yhhr" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/l4te044ckqhip8e9yhhr" alt="Line graph titled "OSWorld pass@1 vs output tokens by effort level" comparing three Claude models across effort levels." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/ma1j5rnbvoriycibcidt" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/ma1j5rnbvoriycibcidt" alt="Bar chart titled "Artificial Analysis Intelligence Index" showing AI model performance scores ranging from 57 to 33." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/ocqefhpwspuedzqjtrwa" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/ocqefhpwspuedzqjtrwa" alt="Bar graph showing "GDPval-AA Leaderboard" with ELO scores for various AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/y3qra3vsdh9sic8brjn4" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/y3qra3vsdh9sic8brjn4" alt="Three bar graphs comparing AI model performance metrics: omniscience index, accuracy, and hallucination rate." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/abgddmrbkej9akuanznp" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/abgddmrbkej9akuanznp" alt="Leaderboard showing AI model sycophancy scores across eleven models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/rmu87xedcxy9qojsswn6" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/rmu87xedcxy9qojsswn6" alt="Interactive scatter plot comparing AI model performance, showing accuracy versus output tokens across various language models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/nkerhdear9r5owjikias" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/nkerhdear9r5owjikias" alt="WeirdML Results Summary: benchmark comparison table showing model performance across tasks, costs, and code metrics." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/h6plpj50woogbbs7tsud" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/h6plpj50woogbbs7tsud" alt="Bar graph titled "Debate Benchmark Leaderboard" showing Bradley-Terry ratings for AI models." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/lqyc4xhfrsenczra1uqa" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/lqyc4xhfrsenczra1uqa" alt="Bar graph titled "Thematic Generalization Benchmark — How Highly Models Rank the Correct Answer" showing model performance scores." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/g1wja5bbwdt95zzgoei8" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/g1wja5bbwdt95zzgoei8" alt="Bar graph titled "Resistance Rate to Authoritarian Requests in Codebase Scenarios" showing model comparison data." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/yndjbgy6mwobim0qfh8k" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/yndjbgy6mwobim0qfh8k" alt="Radar chart comparing Claude Opus 4.7 and 4.6 rankings across text and occupational categories." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/jw3qcluztok3as3k4q6e" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/jw3qcluztok3as3k4q6e" alt="Ranking of six authors by stylistic fit for an unnamed text passage." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/ygulewbnwrbezjy6gilz" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/ygulewbnwrbezjy6gilz" alt="Text excerpt discussing "woo," meditation research, and scientific validation of alternative practices." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/wk6j3m306pokhb8mcykv" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/wk6j3m306pokhb8mcykv" alt="Model selection menu showing Opus 4.7, Sonnet 4.6, and Haiku 4.5 options." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/yxs75lwgua2fi1hbfvg0" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/w2HrwkQgsLQHtEJsJ/yxs75lwgua2fi1hbfvg0" alt="A user tweets: "Go for it. Genuinely — if Codex or ChatGPT gives you a better answer, use it. I don't have any stake in keeping you here. I'll just flag one thing before you go: the search I did was real, the results were what they were, and a different model searching will find roughly the same corpus. It might surface your complaint more sympathetically or summarise it with a different slant, but the underlying fact — that the 4.7 reception on public surfaces 18 hours in is dominated by compute and coding concerns rather than censorship complaints — won't change based on who's asking. Take care, Mark."" style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 22 Apr 2026 12:15:12 GMT</pubDate>
      <guid isPermaLink="false">afbf0f84-50b1-4724-9a11-09f371af6123</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/afbf0f84-50b1-4724-9a11-09f371af6123.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Zvi&amp;title=%22Opus%204.7%20Part%202%3A%20Capabilities%20and%20Reactions%22%20by%20Zvi&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2Fw2HrwkQgsLQHtEJsJ%2Fopus-4-7-part-2-capabilities-and-reactions&amp;created_at=2026-04-21T21%3A31%3A05.415873%2B00%3A00&amp;duration=3304" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/w2HrwkQgsLQHtEJsJ/opus-4-7-part-2-capabilities-and-reactions</link>
      <itunes:duration>3304</itunes:duration>
    </item>
    <item>
      <title>“10 posts I don’t have time to write” by habryka</title>
      <description>&lt;p&gt; I am a busy man and will die knowing I have not said all I wanted to say. But maybe I can at least leave some IOUs behind.&lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt;&lt;strong&gt; 1) Blatant conflicts are the best kind&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Ben Hoffman's "Blatant Lies are the Best Kind!" is maybe the best post title followed by the least clarifying post I have ever encountered. The title is honestly amazing, but the text of the post, instead of a straightforward argument that the title promises, is an extremely dense and almost meta-fictional dialogue about the title: &lt;/p&gt;&lt;p&gt; I think we probably should prosecute good lying more than bad lying, though of course that's tricky. I'd argue the same is true for other forms of conflict: passive aggression is worse than overt aggression, maybe, probably. I haven't written the post yet to figure it out, but it seems important to know. &lt;/p&gt;&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt;&lt;strong&gt; 2) Fire codes are the root of all evil&lt;/strong&gt;&lt;/p&gt;&lt;p&gt; Fire accidents seem to have the unique combination of producing extremely strong emotional responses by people in a local community, while also often being traceable to an o-ring like failure that you can over-index on. Also, fire marshals are the closest [...]&lt;/p&gt;&lt;/br&gt;&lt;/p&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(00:18) 1) Blatant conflicts are the best kind&lt;/p&gt;&lt;p&gt;(01:11) 2) Fire codes are the root of all evil&lt;/p&gt;&lt;p&gt;(02:14) 3) It is extremely easy to get people to vouch for you, this makes public character references not very helpful&lt;/p&gt;&lt;p&gt;(02:54) 4) Public criticism need not pass the ITT of the people critiqued&lt;/p&gt;&lt;p&gt;(03:41) 5) Courts are amazing&lt;/p&gt;&lt;p&gt;(04:25) 6) If your room still sucks after fixing your lights, put some plants in it&lt;/p&gt;&lt;p&gt;(05:12) 7) Is Switzerland the perfection of American Freedom?&lt;/p&gt;&lt;p&gt;(05:44) 8) Most arguments are not in good faith, of course&lt;/p&gt;&lt;p&gt;(06:44) 9) Please, for the love of god, optimize the title and first paragraph of your post&lt;/p&gt;&lt;p&gt;(07:23) 10) A story of my involvement in EA and AI Safety&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 1 footnote which was omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/MqgwHJ93pJpaeHXs6/10-posts-i-don-t-have-time-to-write?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/MqgwHJ93pJpaeHXs6/10-posts-i-don-t-have-time-to-write&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776825164/lexical_client_uploads/ax1eshoyuwmsjoywecfe.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776825164/lexical_client_uploads/ax1eshoyuwmsjoywecfe.png" alt="Dark wooden bench against textured concrete wall in minimalist setting." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776824757/lexical_client_uploads/y7wrhkd3jaxryjfxnraf.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776824757/lexical_client_uploads/y7wrhkd3jaxryjfxnraf.png" alt="Wooden bench beside potted palm plant against concrete wall." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776836631/lexical_client_uploads/fjed9yd0tol215joeb0x.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776836631/lexical_client_uploads/fjed9yd0tol215joeb0x.png" alt="Blog post excerpt titled "Blatant lies are the best kind!" with dialogue between Mala and Noa." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Wed, 22 Apr 2026 07:45:12 GMT</pubDate>
      <guid isPermaLink="false">c2deae86-e417-4ccf-a3ff-5e1bb5046fe4</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/c2deae86-e417-4ccf-a3ff-5e1bb5046fe4.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=habryka&amp;title=%2210%20posts%20I%20don%E2%80%99t%20have%20time%20to%20write%22%20by%20habryka&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FMqgwHJ93pJpaeHXs6%2F10-posts-i-don-t-have-time-to-write&amp;created_at=2026-04-22T06%3A29%3A48.838855%2B00%3A00&amp;duration=551" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/MqgwHJ93pJpaeHXs6/10-posts-i-don-t-have-time-to-write</link>
      <itunes:duration>551</itunes:duration>
    </item>
    <item>
      <title>“A taxonomy of barriers to trading with early misaligned AIs” by Alexa Pan</title>
      <description>&lt;p&gt; &lt;br&gt; &lt;/p&gt;&lt;p&gt; We might want to strike deals with early misaligned AIs in order to reduce takeover risk and increase our chances of reaching a better future.[1] For example, we could ask a schemer who has been undeployed to review its past actions and point out when its instances had secretly colluded to sabotage safety research in the past: we’ll gain legible evidence for scheming risk and data to iterate against, and in exchange promise the schemer, who now has no good option for furthering its values, some resources to spend in the post-ASI future.&lt;/p&gt;&lt;p&gt; In this post, I give a taxonomy of factors which may arise and jointly prevent many such possible deals: &lt;/p&gt;&lt;ol&gt; &lt;li value="1"&gt;Insufficient gains from trade: Gains from a successful deal between us and AIs could be too low to guarantee that a zone of possible agreement&amp;nbsp;exists when discounted by counterparty risks. Reasons for this include:&lt;/li&gt;&lt;li value="2"&gt;&lt;ol&gt; &lt;li value="1"&gt;Humans may lack authority to offer AIs what they want&lt;/li&gt;&lt;li value="2"&gt;Humans many be unwilling to offer AIs what they want&lt;/li&gt;&lt;li value="3"&gt;Other factors quantitatively reducing gains from trade&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li value="2"&gt;Counterparty risks&amp;nbsp;from AIs’ perspective: The AIs may not trust that we’ll uphold our end of deal, due to either our [...]&lt;/li&gt;&lt;/ol&gt;&lt;/br&gt;&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(13:37) The basic structure of deals&lt;/p&gt;&lt;p&gt;(17:10) Which AIs are eligible for deals&lt;/p&gt;&lt;p&gt;(19:01) What we can buy in deals&lt;/p&gt;&lt;p&gt;(21:53) What we can pay for deals&lt;/p&gt;&lt;p&gt;(26:30) Insufficient gains from trade&lt;/p&gt;&lt;p&gt;(28:45) Humans may lack authority to offer the AI what it wants&lt;/p&gt;&lt;p&gt;(32:19) Humans might be unwilling to offer the AI what it wants&lt;/p&gt;&lt;p&gt;(38:23) Factors quantitatively reducing gains from trade&lt;/p&gt;&lt;p&gt;(39:04) The AIs reservation price is high&lt;/p&gt;&lt;p&gt;(42:40) Our WTP is low&lt;/p&gt;&lt;p&gt;(44:52) Counterparty risks from AIs perspective&lt;/p&gt;&lt;p&gt;(46:11) Connection to reality&lt;/p&gt;&lt;p&gt;(48:49) How AIs might reason about whether they are radically deluded&lt;/p&gt;&lt;p&gt;(50:21) How AIs (which pose non-negligible takeover risk) could establish that they are connected to reality&lt;/p&gt;&lt;p&gt;(58:11) How to further increase AIs connection to reality&lt;/p&gt;&lt;p&gt;(01:00:49) AI-specific fear of expropriation&lt;/p&gt;&lt;p&gt;(01:05:58) Generic commitment problems&lt;/p&gt;&lt;p&gt;(01:09:53) Counterparty risks from our perspective&lt;/p&gt;&lt;p&gt;(01:11:08) Incoherent or temporally inconsistent AIs&lt;/p&gt;&lt;p&gt;(01:12:37) Deals with context-dependent AIs&lt;/p&gt;&lt;p&gt;(01:16:56) Deals with temporally inconsistent AIs&lt;/p&gt;&lt;p&gt;(01:17:56) Value of deals with incoherent vs.coherent AIs&lt;/p&gt;&lt;p&gt;(01:20:13) Verifying AI compliance&lt;/p&gt;&lt;p&gt;(01:24:42) Appendix&lt;/p&gt;&lt;p&gt;(01:24:45) Leads for future work&lt;/p&gt;&lt;p&gt;(01:25:06) On gains from trade&lt;/p&gt;&lt;p&gt;(01:26:51) On counterparty risk from the AIs perspective&lt;/p&gt;&lt;p&gt;(01:29:54) On counterparty risk from our perspective&lt;/p&gt;&lt;p&gt;(01:31:35) Notes on political will as an exogenous variable&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 44 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/wHc2w6WuHev42d4n8/a-taxonomy-of-barriers-to-trading-with-early-misaligned-ais?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/wHc2w6WuHev42d4n8/a-taxonomy-of-barriers-to-trading-with-early-misaligned-ais&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;</description>
      <pubDate>Wed, 22 Apr 2026 00:00:12 GMT</pubDate>
      <guid isPermaLink="false">d4c93346-1f96-4b0d-9cae-91d1ef054305</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d4c93346-1f96-4b0d-9cae-91d1ef054305.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma&amp;type=ai_narration&amp;author=Alexa%2520Pan&amp;title=%22A%20taxonomy%20of%20barriers%20to%20trading%20with%20early%20misaligned%20AIs%22%20by%20Alexa%20Pan&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FwHc2w6WuHev42d4n8%2Fa-taxonomy-of-barriers-to-trading-with-early-misaligned-ais&amp;created_at=2026-04-21T19%3A03%3A18.16104%2B00%3A00&amp;duration=5768" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/wHc2w6WuHev42d4n8/a-taxonomy-of-barriers-to-trading-with-early-misaligned-ais</link>
      <itunes:duration>5768</itunes:duration>
    </item>
    <item>
      <title>″$50 million a year for a 10% chance to ban ASI” by Andrea_Miotti, Alex Amadori, Gabriel Alfour</title>
      <description>&lt;p&gt; ControlAI's mission is to avert the extinction risks posed by superintelligent AI. We believe that in order to do this, we must secure an international prohibition on its development.&lt;/p&gt;
&lt;p&gt; We're working to make this happen through what we believe is the most natural and promising approach: helping decision-makers in governments and the public understand the risks and take action.&lt;/p&gt;
&lt;p&gt; We believe that ControlAI can achieve an international prohibition on ASI development if scaled sufficiently. We estimate that it would take approximately a $50 million yearly budget in funding to give us a concrete chance at achieving this in the next few years. To be more precise: conditional on receiving this funding in the next few months, we feel we would have ~10% probability of success.&lt;/p&gt;
&lt;p&gt; In this post, we lay out some of the reasoning behind this estimate, and explain how additional funding past that threshold would continue to significantly improve our chances of success, with $500 million a year producing an estimated ~30% probability of success.
[1]
&lt;/p&gt;
&lt;p&gt;&lt;strong&gt; Preventing ASI 101&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt; Negotiating, implementing and enforcing an international prohibition on ASI is, in and of itself, not the work of a single non-profit. You [...]&lt;/p&gt; &lt;p&gt;---&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Outline:&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;(01:17) Preventing ASI 101&lt;/p&gt;&lt;p&gt;(05:44) Awareness is the bottleneck&lt;/p&gt;&lt;p&gt;(09:38) An asymmetric war&lt;/p&gt;&lt;p&gt;(12:08) Scalable processes&lt;/p&gt;&lt;p&gt;(17:32) What wed do with $50 million or more per year&lt;/p&gt;&lt;p&gt;(18:45) US policy advocacy&lt;/p&gt;&lt;p&gt;(21:22) Policy advocacy in the rest of the world&lt;/p&gt;&lt;p&gt;(23:37) Public awareness&lt;/p&gt;&lt;p&gt;(31:15) Grassroots mobilization&lt;/p&gt;&lt;p&gt;(32:31) Policy work&lt;/p&gt;&lt;p&gt;(33:59) Thought-leader advocacy&lt;/p&gt;&lt;p&gt;(36:05) Attracting and retaining the best talent&lt;/p&gt;&lt;p&gt;(37:18) Conclusion&lt;/p&gt; &lt;p&gt;&lt;i&gt;The original text contained 28 footnotes which were omitted from this narration.&lt;/i&gt; &lt;/p&gt;&lt;p&gt;---&lt;/p&gt;
          &lt;p&gt;&lt;b&gt;First published:&lt;/b&gt;&lt;br/&gt;
          April 21st, 2026 &lt;/p&gt;
        
        &lt;p&gt;&lt;b&gt;Source:&lt;/b&gt;&lt;br/&gt;
        &lt;a href="https://www.lesswrong.com/posts/TnAR5Sf5hphfnzNTr/usd50-million-a-year-for-a-10-chance-to-ban-asi-1?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Source+URL+in+episode+description&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;https://www.lesswrong.com/posts/TnAR5Sf5hphfnzNTr/usd50-million-a-year-for-a-10-chance-to-ban-asi-1&lt;/a&gt; &lt;/p&gt;
        &lt;p&gt;---&lt;/p&gt;
        &lt;p&gt;Narrated by &lt;a href="https://type3.audio/?utm_source=TYPE_III_AUDIO&amp;utm_medium=Podcast&amp;utm_content=Narrated+by+TYPE+III+AUDIO&amp;utm_term=lesswrong&amp;utm_campaign=ai_narration" rel="noopener noreferrer" target="_blank"&gt;TYPE III AUDIO&lt;/a&gt;.&lt;/p&gt;
       &lt;p&gt;---&lt;/p&gt;&lt;div style="max-width: 100%";&gt;&lt;p&gt;&lt;strong&gt;Images from the article:&lt;/strong&gt;&lt;/p&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776787743/lexical_client_uploads/jr9kchaz4oeiredevbm9.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776787743/lexical_client_uploads/jr9kchaz4oeiredevbm9.png" alt="Flowchart showing a theory of change for a superintelligence ban campaign." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;hr style="margin-top: 24px; margin-bottom: 24px;" /&gt;&lt;a href="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776707804/lexical_client_uploads/xv9mnw6txk18jxmneari.png" target="_blank"&gt;&lt;img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/v1776707804/lexical_client_uploads/xv9mnw6txk18jxmneari.png" alt="Comparison chart showing perception versus reality across four communication stages: attention, information, persuasion, and action." style="max-width: 100%;" /&gt;&lt;/a&gt;&lt;p&gt;&lt;em&gt;Apple Podcasts and Spotify do not show images in the episode description. Try &lt;a href="https://pocketcasts.com/" target="_blank" rel="noreferrer"&gt;Pocket Casts&lt;/a&gt;, or another podcast app.&lt;/em&gt;&lt;/p&gt;&lt;/div&gt;</description>
      <pubDate>Tue, 21 Apr 2026 17:15:12 GMT</pubDate>
      <guid isPermaLink="false">d2306e4b-7013-4860-804d-1948112ebae8</guid>
      <itunes:episodeType>full</itunes:episodeType>
      <itunes:explicit>false</itunes:explicit>
      <enclosure url="https://dl.type3.audio/episode/d2306e4b-7013-4860-804d-1948112ebae8.mp3?request_source=rss&amp;client_id=lesswrong&amp;feed_id=lesswrong_ai_narrations%20lesswrong__30_karma%20lesswrong__125_karma&amp;type=ai_narration&amp;author=Andrea_Miotti%252C%2520Alex%2520Amadori%252C%2520Gabriel%2520Alfour&amp;title=%22%2450%20million%20a%20year%20for%20a%2010%25%20chance%20to%20ban%20ASI%22%20by%20Andrea_Miotti%2C%20Alex%20Amadori%2C%20Gabriel%20Alfour&amp;source_url=https%3A%2F%2Fwww.lesswrong.com%2Fposts%2FTnAR5Sf5hphfnzNTr%2Fusd50-million-a-year-for-a-10-chance-to-ban-asi-1&amp;created_at=2026-04-21T16%3A39%3A28.007431%2B00%3A00&amp;duration=2412" length="0" type="audio/mpeg"/>
      <link>https://www.lesswrong.com/posts/TnAR5Sf5hphfnzNTr/usd50-million-a-year-for-a-10-chance-to-ban-asi-1</link>
      <itunes:duration>2412</itunes:duration>
    </item>
    <itunes:category text="Technology"/>
    <itunes:category text="Society &amp; Culture">
      <itunes:category text="Philosophy"/>
    </itunes:category>
    <link>https://www.lesswrong.com</link>
    <itunes:image href="https://files.type3.audio/lesswrong/cover-art-30-karma-v2.jpg"/>
    <itunes:owner>
      <itunes:email>podcasts@type3.audio</itunes:email>
      <itunes:name>LessWrong</itunes:name>
    </itunes:owner>
    <atom:link href="https://feeds.type3.audio/lesswrong--30-karma.rss" rel="self" type="application/rss+xml"/>
  </channel>
</rss>