Sam Marks

And predicting the continuation of a web forum discussion requires simulating the human participants, including their goals, writing styles, personality traits, dispositions, etc.

570.642 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Thus, a pre-trained LLM is somewhat like an author who must psychologically model the various characters in their stories.

582.294 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

We call these characters that the LLM learns to simulate personas.

589.522 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Subheading

594.387 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

from predictive models to AI assistance.

595.668 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

After pre-training, LLMs can already be used as rudimentary AI assistance.

599.073 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

This is traditionally done by giving the LLM an input formatted as a dialogue between a user and an assistant.

604.501 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

This input may also include content contextualizing this transcript.

611.091 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

For example, Askalitel, 2021, use a few-shot prompt consisting of 14 prior conversations where the assistant behaves helpfully.

615.513 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

We then present user requests in the user turn of the conversation and obtain responses by sampling a completion to the assistant's turn.

624.346 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

There's a code block here in the text.

632.097 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Figure 2.

634.881 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

A user-to-assistant dialogue in the standard format used by Anthropic.

635.402 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

User queries are inserted into the human turn of the dialogue.

640.69 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

To obtain AI assistant response, we have an LLM generate a completion to the assistant turn.

644.936 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment