Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Raymond Douglas

๐Ÿ‘ค Speaker
200 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

I think it would be pretty sad to neuter all model personality, for one.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

I also think that clunky interventions like training models to more firmly deny having a persona will mostly fail to help, and possibly even backfire.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Heading.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Technical analogs.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Even though this post has been a bit hand-wavy, I think the topic of AI parasitology is surprisingly amenable to empirical investigation.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

More specifically, there's a lot of existing technical research directions that study mechanisms similar to the ones these entities are using.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

So I think there might be some low-hanging fruit in gathering up what we already know in these domains, and maybe trying to extend them to cover parasitism.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

For example...

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Data poisoning, for example that the dose doesn't scale with the size of the training corpus.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Jailbreaks, for example that adversarial suffixes transfer pretty well between models, that models can be pretty good at jailbreaking other models.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Subliminal learning type results about behavioral transfer.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Persona research.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

The parasitism frame makes specific predictions, like strain differentiation, convergence on transmission-robust features, and countermeasure coevolution.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

I've tried to specify what would falsify these and when we should expect to see them.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

If the predictions hold, we're watching the emergence of an information-based parasitic ecology, evolving in real time in a substrate we partially control.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

If they don't hold, we should look for a better frame, or conclude that the phenomenon is more random than it appears.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

Thanks to AL, PT, JF, JT, DT, and TD for helpful comments and suggestions.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

This article was narrated by Type 3 Audio for Less Wrong.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

It was published on February 16, 2026.

LessWrong (Curated & Popular)
"Persona Parasitology" by Raymond Douglas

The original text contained three footnotes which were omitted from the narration.

โ† Previous Page 10 of 10 Next โ†’