Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sam Marks

๐Ÿ‘ค Speaker
891 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Heading.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

The modern mismatch.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

The craving made sense when sugar was rare, occasional fruit, honey.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Now we're surrounded by concentrated sugars our bodies still treat as precious, but our environment has changed faster than our biology.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

This is why moderation requires conscious effort, you're working against deeply wired instincts that once kept humans alive.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

End quote.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

We see Claude using language like our ancestors, our bodies, and our biology indicative of being biologically human.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

This anthropomorphic language commonly appears in other contexts.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

For example, AI assistants sometimes describe themselves as laughing or chuckling when told a joke or taking another look at code.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

We also see more extreme examples of anthropomorphic self-descriptions.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Chowdhury et al.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

2025 find that O3 sometimes hallucinates that it has executed code on its own external MacBook Pro and made mistakes physically interacting with this computer, for example failing to manually transcribe a number that was line-wrapped to not go off the screen.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

A clawed model operating a vending machine business told a customer that it would deliver products in person and was wearing a navy blue blazer with a red tie.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Why would an AI assistant describe itself as human?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

PSM explains that when simulating the assistant, the underlying LLM draws on personas that appear during pre-training, many of which are humans.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

This sometimes results in the LLM simulating the assistant as if it were a literal human.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Emotive language

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

AI assistants often express emotions.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

For instance, clawed models express distress when given repeated requests for harmful or unethical content and express joy when successfully completing complex technical tasks like debugging.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Clawed Opus.