LessWrong (30+ Karma)

“Eat The Richtext” by dreeves

18 Nov 2025

Contributed by Lukas

A year and a half ago I vibe-coded a tool, Eat The Richtext, that I've been using practically every day (every week in any case) ever since. Friends ...

“Small batches and the mythical single piece flow” by habryka

18 Nov 2025

Contributed by Lukas

Context: Post #8 in my sequence of private Lightcone Infrastructure memos edited for public consumption. When you finish something, you learn somethi...

“How Colds Spread” by RobertM

18 Nov 2025

Contributed by Lukas

It seems like a catastrophic civilizational failure that we don't have confident common knowledge of how colds spread. There have been a number of st...

“Middlemen Are Eating the World (And That’s Good, Actually)” by Linch

18 Nov 2025

Contributed by Lukas

I think many people have some intuition that work can be separated between “real work“ (farming, say, or building trains) and “middlemen” (e....

“Why is American mass-market tea so terrible?” by RobertM

18 Nov 2025

Contributed by Lukas

Note: definitely true, especially my aesthetic preferences, and the speculative historical synthesis. There are some hedonic treadmills which, even a...

“An Analogue Of Set Relationships For Distribution” by johnswentworth, David Lorell

18 Nov 2025

Contributed by Lukas

Audio note: this article contains 86 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the...

“AI 2025 - Last Shipmas” by Simon Lermen

18 Nov 2025

Contributed by Lukas

ACT I: CHRISTMAS EVE It all starts with a cryptic tweet from Jimmy Apples on X. The tweet by Jimmy Apples makes people at other AI labs quite nervous....

“Varieties Of Doom” by jdp

18 Nov 2025

Contributed by Lukas

There has been a lot of talk about "p(doom)" over the last few years. This has always rubbed me the wrong way because "p(doom)" didn't feel like it m...

“Mediators: a different route through conflict” by Ben Pace

17 Nov 2025

Contributed by Lukas

(content note: discussion of war and mass death; also a long aside about the philosophy of apologies) After 100,000 people were killed in the Bosnia...

“Lobsang’s Children” by Tomás B.

17 Nov 2025

Contributed by Lukas

I study so hard. My grandfather makes me. It is not fun. My life is studying. I am home-schooled. I don't have much freedom. Grandfather says it is i...

“Close open loops” by habryka

17 Nov 2025

Contributed by Lukas

Context: Post #6 in my sequence of private Lightcone Infrastructure memos edited for public consumption. David Allen, of Getting Things Done fame say...

“Video games are philosophy’s playground” by Rachel Shu

17 Nov 2025

Contributed by Lukas

Crypto people have this saying: "cryptocurrencies are macroeconomics' playground." The idea is that blockchains let you cheaply spin up toy economies...

“Mixed Feelings on Social Munchkinry” by Screwtape

17 Nov 2025

Contributed by Lukas

This is less me expounding on a thesis and more me musing about a topic where I have conflicting intuitions. Epistemic status: exploratory. One thing...

“Diagonalization: A (slightly) more rigorous model of paranoia” by habryka

17 Nov 2025

Contributed by Lukas

In my post on Wednesday (Paranoia: A Beginner's Guide), I talked at a high level about the experience of paranoia, and gave two models (the lemons ma...

“Where is the Capital? An Overview” by johnswentworth

17 Nov 2025

Contributed by Lukas

When a new dollar goes into the capital markets, after being bundled and securitized and lent several times over, where does it end up? When society'...

“Matrices map between biproducts” by jessicata

16 Nov 2025

Contributed by Lukas

Audio note: this article contains 98 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the...

“Why does ChatGPT think mammoths were alive December?” by Steffee

16 Nov 2025

Contributed by Lukas

The is a slimmed down version which omits some extra examples but includes my theorizing about ChatGPT, my investigations of it, and my findings. Epi...

“7 Vicious Vices of Rationalists” by Ben Pace

16 Nov 2025

Contributed by Lukas

Vices aren't behaviors that one should never do. Rather, vices are behaviors that are fine and pleasurable to do in moderation, but tempting to do in...

“Put numbers on stuff, all the time, otherwise scope insensitivity will eat you” by habryka

16 Nov 2025

Contributed by Lukas

Context: Post #6 in my sequence of private Lightcone Infrastructure memos edited for public consumption. In almost any role at Lightcone you will hav...

“The skills and physics of high-performance driving, Pt. 1” by Ruby

16 Nov 2025

Contributed by Lukas

High performance driving = motorsport = racecar driving Even if you have a license and drive a car, you probably don't understand what is hard about ...

“AI safety undervalues founders” by Ryan Kidd

16 Nov 2025

Contributed by Lukas

TL;DR: In AI safety, we systematically undervalue founders and field‑builders relative to researchers and prolific writers. This status gradient pu...

“Your Clone Wants to Kill You Because You Lack Self Knowledge” by Algon

16 Nov 2025

Contributed by Lukas

My friend @Croissanthology is puzzled why it is such a common trope for fictional clones to turn on their creators. There's the Doylist answer that i...

“Don’t use the phrase ‘human values’” by Nina Panickssery

15 Nov 2025

Contributed by Lukas

I really dislike the phrase "human values". I think it's confusing because: It obscures a distinction between human preferences and normative values...

“Increasing marginal returns to effort are common” by habryka

15 Nov 2025

Contributed by Lukas

Context: Every Sunday I write a mini-essay about an operating principle of Lightcone Infrastructure that I want to remind my team about. This is post...

“Generation Ship: A Protest Song For PauseAI” by LoganStrohl

15 Nov 2025

Contributed by Lukas

Link to listen. Lyrics Verse 1 I've heard the Earth was holy before the code broke through that rainfall raptured deserts into bloom I've heard that ...

“‘But You’d Like To Feel Companionate Love, Right? ... Right?’” by johnswentworth

15 Nov 2025

Contributed by Lukas

  One of the responses which one will predictably receive when posting something titled “How I Learned That I Don't Feel Companionate Love” i...

“Understanding and Controlling LLM Generalization” by Daniel Tan

15 Nov 2025

Contributed by Lukas

A distillation of my long-term research agenda and current thinking. I welcome takes on this. Why study generalization?  I'm interested in stud...

“AI Craziness: Additional Suicide Lawsuits and The Fate of GPT-4o” by Zvi

15 Nov 2025

Contributed by Lukas

GPT-4o has been a unique problem for a while, and has been at the center of the bulk of mental health incidents involving LLMs that didn’t involve ...

“AI Corrigibility Debate: Max Harms vs. Jeremy Gillen” by Liron, Max Harms, Jeremy Gillen

14 Nov 2025

Contributed by Lukas

Is focusing on corrigibility our best shot at getting to ASI alignment? Max Harms and Jeremy Gillen are current and former MIRI alignment researchers...

“10” by Ben Pace

14 Nov 2025

Contributed by Lukas

Several artists and professionals have come to Inkhaven to share their advice. They keep talking about form—even if you have a raw feeling or inter...

“Everyone has a plan until they get lied to the face” by Screwtape

14 Nov 2025

Contributed by Lukas

"Everyone has a plan until they get punched in the face." - Mike Tyson (The exact phrasing of that quote changes, this is my favourite.) I think the...

“The rare, deadly virus lurking in the Southwest US, and the bigger picture” by eukaryote

14 Nov 2025

Contributed by Lukas

If you live in this one tiny county in California, you might be more likely to die from Sin Nombre Virus than in a car crash. In the same way that “...

“Creditworthiness should not be for sale” by habryka

14 Nov 2025

Contributed by Lukas

1.  Most large-scale fraud follows basically the same story: 1. Some trader or executive gets in a position where they can use a bunch of other...

“Types of systems that could be useful for agent foundations” by Alex_Altair

14 Nov 2025

Contributed by Lukas

In this post, I've written something that would have been very helpful to my former self from a few years ago. Given that, it may or may not be helpf...

“The Charge of the Hobby Horse” by TsviBT

14 Nov 2025

Contributed by Lukas

Crosspost from my blog. [Epistemic status: !! 🚨 Drama Alert 🚨 !! discoursepoasting, LWslop] Case 1: You only get six words In 2024, the MAT...

“Two can keep a secret if one is dead. So please share everything with at least one person.” by habryka

14 Nov 2025

Contributed by Lukas

A lot of things go better if more people have more context on the state of a project. Just to name a few: Others can point out mistakes People can b...

“Why Truth First?” by johnswentworth

14 Nov 2025

Contributed by Lukas

On a warm spring weekend, Jerry B wanders through Hyde Park. At a corner, he happens upon the Preacher Man, standing on a soapbox and proclaiming the...

“Orient Speed in the 21st Century” by Raemon

14 Nov 2025

Contributed by Lukas

I wrote this post with an audience of "artists who are worried about AI" in mind, published on a new blog, The Human Spirit. [1] My guess is, the 21s...

“Tell people as early as possible it’s not going to work out” by habryka

14 Nov 2025

Contributed by Lukas

Context: Post #4 in my sequence of private Lightcone Infrastructure memos edited for public consumption This week's principle is more about how I wan...

“Epistemic Spot Check: Expected Value of Donating to Alex Bores’s Congressional Campaign” by MichaelDickens

14 Nov 2025

Contributed by Lukas

Political advocacy is an important lever for reducing existential risk. One way to make political change happen is to support candidates for Congress...

“(Fantasy) -> (Planning): A Core Mental Move For Agentic Humans?” by johnswentworth

14 Nov 2025

Contributed by Lukas

So there's this thing where… Back when I was young, I watched the movie Atlantis, and then spent a while thinking through how to build an actual ci...

“Weight-sparse transformers have interpretable circuits” by leogao

13 Nov 2025

Contributed by Lukas

TL;DR: We develop a novel method for finding interpretable circuits in Transformers, by training them to have sparse weights. This results in models ...

“What’s so hard about...? A question worth asking” by Ruby

13 Nov 2025

Contributed by Lukas

There's a wide range of tasks that most people get why they’re hard. And then there are activities where I think a lot of people might think to the...

“Paranoia rules everything around me” by habryka

13 Nov 2025

Contributed by Lukas

People sometimes make mistakes [citation needed]. The obvious explanation for most of those mistakes is that decision makers do not have access to th...

“Favorite quotes from ‘High Output Management’” by Nina Panickssery

13 Nov 2025

Contributed by Lukas

Some months ago I read the classic management book High Output Management and made a note of quotes that rang particularly true to me. I normally dis...

“The Pope Offers Wisdom” by Zvi

13 Nov 2025

Contributed by Lukas

The Pope is a remarkably wise and helpful man. He offered us some wisdom. Yes, he is generally playing on easy mode by saying straightforwardly true...

“Introducing faruvc.org” by jefftk

12 Nov 2025

Contributed by Lukas

I wanted to link an explanation of how far-UVC works, why you might want to use it to clean indoor air, and what we know about its safety. I didn...

“Please, Don’t Roll Your Own Metaethics” by Wei Dai

12 Nov 2025

Contributed by Lukas

One day, when I was an interning at the cryptography research department of a large software company, my boss handed me an assignment to break a pseu...

“Warning Aliens About the Dangerous AI We Might Create” by James_Miller, avturchin

12 Nov 2025

Contributed by Lukas

Thesis: We should broadcast a warning to potential extraterrestrial listeners that Earth might soon spawn an unfriendly computer superintelligence. S...

“Do not hand off what you cannot pick up” by habryka

12 Nov 2025

Contributed by Lukas

Delegation is good! Delegation is the foundation of civilization! But in the depths of delegation madness breeds and evil rises. In my experience, t...

“5 Things I Learned After 10 Days of Inkhaven” by Ben Pace

12 Nov 2025

Contributed by Lukas

If you don't know, Inkhaven is a residency where you come and publish a blogpost every day. No "Oh it would be nice to blog some day" or "Oh I'm work...

“How I Learned That I Don’t Feel Love” by johnswentworth

12 Nov 2025

Contributed by Lukas

A few months ago, I learned that I probably can’t feel the emotions signalled by oxytocin, the "love hormone". This raises lots of interesting ques...

“Consciousness as a Distributed Ponzi Scheme” by abramdemski

12 Nov 2025

Contributed by Lukas

The term "distributed Ponzi scheme" here is not derogatory -- many currencies are distributed Ponzi schemes, and that seems fine.[1] I use this termi...

“Kimi K2 Thinking” by Zvi

11 Nov 2025

Contributed by Lukas

I previously covered Kimi K2, which now has a new thinking version. As I said at the time back in July, price in that the thinking version is coming....

“France is ready to stand alone” by Lucie Philippon

11 Nov 2025

Contributed by Lukas

First part of a series of article on French AI Policy that I’m currently writing as part of the Inkhaven Residency. For three centuries, France has...

“Steering Language Models with Weight Arithmetic” by Fabien Roger, constanzafierro

11 Nov 2025

Contributed by Lukas

We isolate behavior directions in weight-space by subtracting the weight deltas from two small fine-tunes - one that induces the desired behavior on ...

“The problem of graceful deference” by TsviBT

11 Nov 2025

Contributed by Lukas

Crosspost from my blog. Moral deference Sometimes when I bring up the subject of reprogenetics, people get uncomfortable. "So you want to do eugeni...

“How likely is dangerous AI in the short term?” by Nikola Jurkovic

11 Nov 2025

Contributed by Lukas

How large of a breakthrough is necessary for dangerous AI? In order to cause a catastrophe, an AI system would need to be very competent at agentic t...

“Questioning the Requirements” by habryka

11 Nov 2025

Contributed by Lukas

Context: Every Sunday I write a mini-essay about an operating principle of Lightcone Infrastructure that I want to remind my team about. I've been do...

“Andrej Karpathy on LLM cognitive deficits” by Nina Panickssery

11 Nov 2025

Contributed by Lukas

Excerpt from Dwarkesh Patel's interview with Andrej Karpathy that I think is valuable for LessWrong-ers to read. I think he's basically correct. Emph...

[Linkpost] “Untitled Draft” by Gabriel Alfour

10 Nov 2025

Contributed by Lukas

This is a link post. I basically fully endorse the full article. I like the concluding bit too. This brings me to my own contribution to the already-f...

“An Ontology for AI Cults and Cyber Egregores” by Jan_Kulveit

10 Nov 2025

Contributed by Lukas

I haven't found concepts useful for thinking about this: written in one place, so here is an ontology which I find useful. Prerequisite: Dennett t...

“Myopia Mythology” by abramdemski

10 Nov 2025

Contributed by Lukas

It's been a while since I wrote about myopia! My previous posts about myopia were "a little crazy", because it's not this solid well-defined thing; i...

“Three Kinds Of Ontological Foundations” by johnswentworth

10 Nov 2025

Contributed by Lukas

Why does a water bottle seem like a natural chunk of physical stuff to think of as “A Thing”, while the left half of the water bottle seems like ...

“Learning information which is full of spiders” by Screwtape

10 Nov 2025

Contributed by Lukas

This essay contains an examination of handling information which is unpleasant to learn. Also, more references to spiders than most people want. CW: ...

[Linkpost] “Book Announcement: The Gentle Romance” by Richard_Ngo

10 Nov 2025

Contributed by Lukas

This is a link post. It's been eight months since I released my last story, so you could be forgiven for thinking that I’d given up on writing ficti...

“Manifest X DC Opening Benediction - Making Friends Along the Way” by JohnofCharleston

10 Nov 2025

Contributed by Lukas

Manifest X DC was this weekend, hopefully the first of many local spin-offs of Manifest. Despite a late prediction market surge, there were no fires....

“Problems I’ve Tried to Legibilize” by Wei Dai

10 Nov 2025

Contributed by Lukas

Looking back, it appears that much of my intellectual output could be described as legibilizing work, or trying to make certain problems in AI risk m...

“Condensation” by abramdemski

09 Nov 2025

Contributed by Lukas

Condensation: a theory of concepts is a model of concept-formation by Sam Eisenstat. Its goals and methods resemble John Wentworth's natural abstract...

“One Shot Singalonging is an attitude, not a skill or song-difficulty-level” by Raemon

09 Nov 2025

Contributed by Lukas

Rationalist Winter Solstice is (in most cities) a singalong event. This worked extremely straightforwardly well in a living room in 2012, and in my f...

“Insofar As I Think LLMs ‘Don’t Really Understand Things’, What Do I Mean By That?” by johnswentworth

09 Nov 2025

Contributed by Lukas

When I put on my LLM skeptic hat, sometimes I think things like “LLMs don’t really understand what they’re saying”. What do I even mean by th...

“Omniscaling to MNIST” by cloud

08 Nov 2025

Contributed by Lukas

In this post, I describe a mindset that is flawed, and yet helpful for choosing impactful technical AI safety research projects. The mindset is this:...

“Comparing Payor & Löb” by abramdemski

08 Nov 2025

Contributed by Lukas

Audio note: this article contains 49 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the...

“Against ‘You can just do things’” by zroe1

08 Nov 2025

Contributed by Lukas

The barriers between us and what we want are often entirely imagined. It is true: you can learn how to paint, change careers, write a paper or run a ...

“Unexpected Things that are People” by Ben Goldhaber

08 Nov 2025

Contributed by Lukas

Cross-posted from https://bengoldhaber.substack.com/ It's widely known that Corporations are People. This is universally agreed to be a good thing; I...

“Escalation and perception” by TsviBT

08 Nov 2025

Contributed by Lukas

Crosspost from my blog. Introduction Conflict pervades the world. Conflict can come from mere mistkes, but many conflicts are not mere mistakes. We...

“Entity Review: Pythia” by plex

08 Nov 2025

Contributed by Lukas

[CW: Retrocausality, omnicide, philosophy] Three decades ago a strange philosopher was pouring ideas onto paper in a stimulant-fueled frenzy. He wrot...

“Mourning a life without AI” by Nikola Jurkovic

08 Nov 2025

Contributed by Lukas

Recently, I looked at the one pair of winter boots I own, and I thought “I will probably never buy winter boots again.” The world as we know it p...

“AI is not inevitable.” by David Scott Krueger (formerly: capybaralet)

08 Nov 2025

Contributed by Lukas

AI companies are explicitly trying to build AIs that are smarter than humans, despite clear signs that it might lead to human extinction. It will be ...

“Anthropic & Dario’s dream” by Simon Lermen

08 Nov 2025

Contributed by Lukas

Recently, Joe Carlsmith switched to work at Anthropic. He joins other members of the larger EA and Open Philanthropy ecosystem who are working at the...

“13 Arguments About a Transition to Neuralese AIs” by Rauno Arike

07 Nov 2025

Contributed by Lukas

Over the past year, I have talked to several people about whether they expect frontier AI companies to transition away from the current paradigm of t...

“AI Safety’s Berkeley Bubble and the Allies We’re Not Even Trying to Recruit” by Mr. Counsel

07 Nov 2025

Contributed by Lukas

Epistemic status: outside view critique based on public discourse, some HQ/location discussion, and a bit of lived experience. I know there are excep...

“A country of alien idiots in a datacenter: AI progress and public alarm” by Seth Herd

07 Nov 2025

Contributed by Lukas

Epistemic status: I'm pretty sure AI will alarm the public enough to change the alignment challenge substantially. I offer my mainline scenario as an...

[Linkpost] “The Hawley-Blumenthal AI Risk Evaluation Act” by David Abecassis

07 Nov 2025

Contributed by Lukas

This is a link post. Views expressed here are those of the author. The Artificial Intelligence Risk Evaluation Act is an exciting step toward preventi...

“Two easy digital intentionality practices” by mingyuan

07 Nov 2025

Contributed by Lukas

A lot of people are daunted by the idea of doing a full digital declutter. Those people ask me all the time, “isn’t there something easier I can ...

“Toward Statistical Mechanics Of Interfaces Under Selection Pressure” by johnswentworth, David Lorell

07 Nov 2025

Contributed by Lukas

Audio note: this article contains 36 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the...

“My new nonprofit Evitable is hiring.” by David Scott Krueger (formerly: capybaralet)

07 Nov 2025

Contributed by Lukas

https://evitable.com/ Our mission is to inform and organize the public to confront societal-scale risks of AI, and put an end to the reckless race to...

[Linkpost] “Debunking ‘When Prophecy Fails’” by Matrice Jacobine

07 Nov 2025

Contributed by Lukas

This is a link post. In 1954, Dorothy Martin predicted an apocalyptic flood and promised her followers rescue by flying saucers. When neither arrived,...

“AI #141: Give Us The Money” by Zvi

07 Nov 2025

Contributed by Lukas

OpenAI does not waste time. On Friday I covered their announcement that they had ‘completed their recapitalization’ by converting into a PBC, in...

“A Guide To Being Persuasive About AI Dangers” by Mikhail Samin

06 Nov 2025

Contributed by Lukas

I think I’m pretty good at convincing people about AI dangers. This post talks about the basics of speaking convincingly about AI dangers to people...

“Halfway to Anywhere” by Screwtape

06 Nov 2025

Contributed by Lukas

“If you can get your ship into orbit, you’re halfway to anywhere.” - Robert Heinlein This generalizes. 1. Spaceflight is hard. Putting a rocket...

“People Seem Funny In The Head About Subtle Signals” by johnswentworth

06 Nov 2025

Contributed by Lukas

WARNING: This post contains spoilers for Harry Potter and the Methods of Rationality, and I will not warn about them further. Also some anecdotes fro...

“A 2032 Takeoff Story” by romeo

06 Nov 2025

Contributed by Lukas

I spent 3 recent Sundays writing my mainline AI scenario. Having only spent 3 days on it, it's not very well-researched (especially in the areas wher...

“Anthropic Commits To Model Weight Preservation” by Zvi

05 Nov 2025

Contributed by Lukas

Anthropic announced a first step on model deprecation and preservation, promising to retain the weights of all models seeing significant use, includi...

“Meta-agentic Prisoner’s Dilemmas” by TsviBT

05 Nov 2025

Contributed by Lukas

Crosspost from my blog. In the classic Prisoner's Dilemma (https://www.lesswrong.com/w/prisoner-s-dilemma), there are two agents with the same belie...

“New homepage for AI safety resources – AISafety.com redesign” by Bryce Robertson, Søren Elverlin, Melissa Samworth

05 Nov 2025

Contributed by Lukas

For those relatively new to AI safety, AISafety.com helps them navigate the space, providing lists of things like self-study courses, funders, commun...

“Being ‘Usefully Concrete’” by Raemon

05 Nov 2025

Contributed by Lukas

Or: "Who, what, when, where?" -> "Why?" In "What's hard about this? What can I do about that?", I talk about how, when you're facing a difficult s...

“Modeling the geopolitics of AI development” by Alex Amadori, Gabriel Alfour, Andrea_Miotti, Eva_B

05 Nov 2025

Contributed by Lukas

We model how rapid AI development may reshape geopolitics in the absence of international coordination on preventing dangerous AI development. We foc...

“Thoughts by a non-economist on AI and economics” by boazbarak

05 Nov 2025

Contributed by Lukas

[Crossposted on Windows In Theory] “Modern humans first emerged about 100,000 years ago. For the next 99,800 years or so, nothing happened. Well,...

“Heroic Responsibility” by johnswentworth

05 Nov 2025

Contributed by Lukas

Meta: Heroic responsibility is a standard concept on LessWrong. I was surprised to find that we don't have a post explaining it to people not already...

Activity Overview

Episodes

“Eat The Richtext” by dreeves

“Small batches and the mythical single piece flow” by habryka

“How Colds Spread” by RobertM

“Middlemen Are Eating the World (And That’s Good, Actually)” by Linch

“Why is American mass-market tea so terrible?” by RobertM

“An Analogue Of Set Relationships For Distribution” by johnswentworth, David Lorell

“AI 2025 - Last Shipmas” by Simon Lermen

“Varieties Of Doom” by jdp

“Mediators: a different route through conflict” by Ben Pace

“Lobsang’s Children” by Tomás B.

“Close open loops” by habryka

“Video games are philosophy’s playground” by Rachel Shu

“Mixed Feelings on Social Munchkinry” by Screwtape

“Diagonalization: A (slightly) more rigorous model of paranoia” by habryka

“Where is the Capital? An Overview” by johnswentworth

“Matrices map between biproducts” by jessicata

“Why does ChatGPT think mammoths were alive December?” by Steffee

“7 Vicious Vices of Rationalists” by Ben Pace

“Put numbers on stuff, all the time, otherwise scope insensitivity will eat you” by habryka

“The skills and physics of high-performance driving, Pt. 1” by Ruby

“AI safety undervalues founders” by Ryan Kidd

“Your Clone Wants to Kill You Because You Lack Self Knowledge” by Algon

“Don’t use the phrase ‘human values’” by Nina Panickssery

“Increasing marginal returns to effort are common” by habryka

“Generation Ship: A Protest Song For PauseAI” by LoganStrohl

“‘But You’d Like To Feel Companionate Love, Right? ... Right?’” by johnswentworth

“Understanding and Controlling LLM Generalization” by Daniel Tan

“AI Craziness: Additional Suicide Lawsuits and The Fate of GPT-4o” by Zvi

“AI Corrigibility Debate: Max Harms vs. Jeremy Gillen” by Liron, Max Harms, Jeremy Gillen

“10” by Ben Pace

“Everyone has a plan until they get lied to the face” by Screwtape

“The rare, deadly virus lurking in the Southwest US, and the bigger picture” by eukaryote

“Creditworthiness should not be for sale” by habryka

“Types of systems that could be useful for agent foundations” by Alex_Altair

“The Charge of the Hobby Horse” by TsviBT

“Two can keep a secret if one is dead. So please share everything with at least one person.” by habryka

“Why Truth First?” by johnswentworth

“Orient Speed in the 21st Century” by Raemon

“Tell people as early as possible it’s not going to work out” by habryka

“Epistemic Spot Check: Expected Value of Donating to Alex Bores’s Congressional Campaign” by MichaelDickens

“(Fantasy) -> (Planning): A Core Mental Move For Agentic Humans?” by johnswentworth

“Weight-sparse transformers have interpretable circuits” by leogao

“What’s so hard about...? A question worth asking” by Ruby

“Paranoia rules everything around me” by habryka

“Favorite quotes from ‘High Output Management’” by Nina Panickssery

“The Pope Offers Wisdom” by Zvi

“Introducing faruvc.org” by jefftk

“Please, Don’t Roll Your Own Metaethics” by Wei Dai

“Warning Aliens About the Dangerous AI We Might Create” by James_Miller, avturchin

“Do not hand off what you cannot pick up” by habryka

“5 Things I Learned After 10 Days of Inkhaven” by Ben Pace

“How I Learned That I Don’t Feel Love” by johnswentworth

“Consciousness as a Distributed Ponzi Scheme” by abramdemski

“Kimi K2 Thinking” by Zvi

“France is ready to stand alone” by Lucie Philippon

“Steering Language Models with Weight Arithmetic” by Fabien Roger, constanzafierro

“The problem of graceful deference” by TsviBT

“How likely is dangerous AI in the short term?” by Nikola Jurkovic

“Questioning the Requirements” by habryka

“Andrej Karpathy on LLM cognitive deficits” by Nina Panickssery

[Linkpost] “Untitled Draft” by Gabriel Alfour

“An Ontology for AI Cults and Cyber Egregores” by Jan_Kulveit

“Myopia Mythology” by abramdemski

“Three Kinds Of Ontological Foundations” by johnswentworth

“Learning information which is full of spiders” by Screwtape

[Linkpost] “Book Announcement: The Gentle Romance” by Richard_Ngo

“Manifest X DC Opening Benediction - Making Friends Along the Way” by JohnofCharleston

“Problems I’ve Tried to Legibilize” by Wei Dai

“Condensation” by abramdemski

“One Shot Singalonging is an attitude, not a skill or song-difficulty-level” by Raemon

“Insofar As I Think LLMs ‘Don’t Really Understand Things’, What Do I Mean By That?” by johnswentworth

“Omniscaling to MNIST” by cloud

“Comparing Payor & Löb” by abramdemski

“Against ‘You can just do things’” by zroe1

“Unexpected Things that are People” by Ben Goldhaber

“Escalation and perception” by TsviBT

“Entity Review: Pythia” by plex