Sam Marks

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

We are also genuinely uncertain which of these perspectives best matches reality.

5286.156 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

The answer may change as models and training methods evolve.

5291.383 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

We are excited about future work further elaborating PSM or alternative models of AI behavior.

5295.322 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Avenues that seem promising to us include There's a list of bullet points here.

5301.815 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Developing more precise formulations of PSM What, precisely, is a persona?

5307.085 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Which types of learning during post-training does PSM rule out?

5313.838 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Articulating alternatives to PSM.

5318.423 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Which perspectives were left out of our discussion of PSM exhaustiveness?

5321.768 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Articulating and testing empirical predictions of PSM.

5326.915 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

What types of generalization, behavior, and internal representations does PSM predict we will observe?

5331.001 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Forecasting how PSM varies with scale.

5338.387 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

As reinforcement learning continues to scale, how does this affect the degree to which post-trained models remain persona-like?

5341.991 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

How might we notice if non-persona agency emerges?

5349.38 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

What factors make it more or less likely to emerge?

5353.264 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Connecting PSM to alignment methodologies.

5356.688 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

What types of training does PSM recommend we employ?

5360.192 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

What are the best AI archetypes for grounding AIs?

5363.957 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Understanding consequences of PSM for human-eye relations.

5368.087 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

How should we treat AIs in light of PSM?

5372.614 View full episode →

LessWrong (Curated & Popular)

"The persona selection model" by Sam Marks

Understanding interpersona phenomena.

5376.519 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment