Sam Marks

During pre-training, the LLM is trained to predict what comes next given an initial segment of some document, such as a book, news article, piece of code, or conversation on a web forum.

438.588 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Via pre-training, LLMs learn to be extremely good predictive models of their training corpus.

450.357 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

We refer to these LLMs, those that have undergone pre-training but not subsequent training phases, as base models.

456.546 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Even though AI developers don't ultimately want predictive models, we pre-train our LLMs in this way because accurate prediction requires learning rich cognitive patterns.

464.437 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Consider predicting the solution to a math problem.

474.291 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

If the model sees what is 347 times 28, followed by the start of a worked solution, continuing this solution requires understanding of the algorithm for multi-digit multiplication.

477.661 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Similarly, accurately predicting continuations of diverse chess games requires understanding the rules of chess.

489.611 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Thus, a strong predictive model requires factual knowledge about the world, logical reasoning, and understanding of common sense physics, among other cognitive patterns.

496.737 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

An especially important type of cognitive pattern is an agent model or persona.

506.686 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Consider the following example completion from the Claude Sonnet for 0.5 base model.

516.261 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

The bold text is the LLM completion, the non-bold text is the prefix given to the model.