Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sam Marks

๐Ÿ‘ค Speaker
891 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Of these two axes, we think the first one is the most important.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Subheading.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Degrees of non-persona LLM agency.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

There's an image here.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Shoggoths.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

On one extreme perspective, the LLM, as depicted by an alien creature called a shoggoth, itself has agency.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

The Shoggoth play acts the assistant, the mask, but the Shoggoth is ultimately the one in charge.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

This is roughly like a human actor playing a character.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

For instance, an actor playing Hamlet could, if he wanted to, distort his portrayal of the character by having Hamlet advocate for the raising of actors' salaries.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

However, there is an important disanalogy between actors and Shoggoths.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

The Shoggoth is not itself a simulated persona with a human-like psychology.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Its psychology and goals may be alien or inscrutable, as depicted by its bizarre, tentacled form.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

On this view, understanding the assistant persona is insufficient for predicting AI assistant behavior because the Shoggoth can in principle override it.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

In extreme, out-of-distribution cases, the Shoggoth could even take the mask off fully and start pursuing its alien goals.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

There's an image here.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Operating systems.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

On an opposing view, the LLM, both before and after post-training, is not too different from a predictive model with no agency of its own.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Pre-trained LLMs are typically viewed this way.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

They simply predict probable continuations without having their own agency.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Any agentic outputs are due to the simulated personas, not the underlying LLM.