Sam Marks

👤 Speaker

891 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Of these two axes, we think the first one is the most important.

3641.621 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Subheading.

3646.162 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Degrees of non-persona LLM agency.

3647.644 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

There's an image here.

3649.707 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Shoggoths.

3657.197 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

On one extreme perspective, the LLM, as depicted by an alien creature called a shoggoth, itself has agency.

3658.719 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

The Shoggoth play acts the assistant, the mask, but the Shoggoth is ultimately the one in charge.

3666.277 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

This is roughly like a human actor playing a character.

3672.403 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

For instance, an actor playing Hamlet could, if he wanted to, distort his portrayal of the character by having Hamlet advocate for the raising of actors' salaries.

3675.986 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

However, there is an important disanalogy between actors and Shoggoths.

3685.035 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

The Shoggoth is not itself a simulated persona with a human-like psychology.

3689.319 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Its psychology and goals may be alien or inscrutable, as depicted by its bizarre, tentacled form.

3694.053 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

On this view, understanding the assistant persona is insufficient for predicting AI assistant behavior because the Shoggoth can in principle override it.

3700.785 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

In extreme, out-of-distribution cases, the Shoggoth could even take the mask off fully and start pursuing its alien goals.

3709.641 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

There's an image here.

3717.274 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Operating systems.

3725.662 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

On an opposing view, the LLM, both before and after post-training, is not too different from a predictive model with no agency of its own.

3727.925 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Pre-trained LLMs are typically viewed this way.

3736.974 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

They simply predict probable continuations without having their own agency.

3740.317 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Any agentic outputs are due to the simulated personas, not the underlying LLM.

3744.762 View full episode →

← Previous Page 29 of 45 Next →

Report any issue