Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sam Marks

๐Ÿ‘ค Speaker
891 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

We are also genuinely uncertain which of these perspectives best matches reality.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

The answer may change as models and training methods evolve.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

We are excited about future work further elaborating PSM or alternative models of AI behavior.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Avenues that seem promising to us include There's a list of bullet points here.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Developing more precise formulations of PSM What, precisely, is a persona?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Which types of learning during post-training does PSM rule out?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Articulating alternatives to PSM.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Which perspectives were left out of our discussion of PSM exhaustiveness?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Articulating and testing empirical predictions of PSM.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

What types of generalization, behavior, and internal representations does PSM predict we will observe?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Forecasting how PSM varies with scale.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

As reinforcement learning continues to scale, how does this affect the degree to which post-trained models remain persona-like?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

How might we notice if non-persona agency emerges?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

What factors make it more or less likely to emerge?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Connecting PSM to alignment methodologies.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

What types of training does PSM recommend we employ?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

What are the best AI archetypes for grounding AIs?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Understanding consequences of PSM for human-eye relations.

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

How should we treat AIs in light of PSM?

LessWrong (Curated & Popular)
"The persona selection model" by Sam Marks

Understanding interpersona phenomena.