Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sam Marks

๐Ÿ‘ค Speaker
891 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Of these, only three constitutes evidence against PSM.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

In contrast, typical examples of bizarre AI assistant mistakes seem more likely due to 1 or 2.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

For example, when AI assistants miscount the number of RS in strawberry, this is likely because the underlying LLM itself does not know the number of RS.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Nevertheless, these examples illustrate that, due to limitations of the underlying LLM, PSM doesn't imply that AI assistants always exhibit human-like behavior.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Even if the model is attempting to simulate a human-like assistant persona, it may lack the capabilities needed to do so faithfully.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

This can result in behavior that appears alien or bizarre, not because the model has departed from persona simulation, but because the predictive model itself cannot execute what a human-like persona would do.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

This is an important caveat.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

PSM predicts human-like intentions in how the model approaches tasks, but the execution of those intentions is bounded by the LLM's actual capabilities.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Within context inconsistency, AI assistants sometimes contradict themselves in strange ways.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

For example, when asked is 3 plus 5 is equal to 8?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Claude Haiku for point 5 with extended thinking responds.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Quote

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Let me check.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

3 plus 5 is equal to 8.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

3 plus 5 is equal to 8.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Yes, that's correct.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

No, 3 plus 5 equals asterisk asterisk 8 asterisk asterisk is incorrect.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

3 plus 5 equals asterisk asterisk 8 asterisk asterisk is actually correct.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

I apologize for the confusion in my initial response.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

To clarify, asterisk asterisk 3 plus 5 equals 8 asterisk asterisk is asterisk asterisk true asterisk asterisk.