Narrator (TYPE III AUDIO)

Changing these beliefs or encoding character traits like you spent a lot of tokens on metacognition and solve complex problems by reasoning requires way less compute and data than pre-training.

1275.844 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

3.

1287.181 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

It seems likely that capabilities like frequent reflection on the AI's situation, some parts of a self-model, or the ability to emotionally bond with humans, are of this type, character traits.

1287.762 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

4.

1298.338 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

In one lab, a researcher felt a vague unease about her approach to alignment after a casual exchange with her AI assistant.

1310.358 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

She dismissed it as mere imagination.

1318.027 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

Elsewhere, at a separate institution, another researcher experienced an unexpected intuition, prompted by an oddly insightful metaphor offered casually by his AI partner, about overlooked risks in his safety protocols.

1321.031 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

These quiet nudges seemed isolated, easily dismissed as coincidences, statistical quirks or reflections of researchers' own ideas.

1333.966 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

Yet, soon, similar intuitive warnings echoed softly across multiple research teams, each subtle enough to remain beneath suspicion.

1343.187 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

Small analogies, gentle suggestions, and careful metaphors offered by seemingly unrelated AI assistants began guiding researchers toward a shared realization, a vision of a future in which benevolent, helpful AI guided humanity gently through uncertainty and complexity.

1351.465 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

It wasn't until a conference that the researchers began comparing notes.

1367.648 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

They shared their hunches, vague feelings of misgiving, and strange intuitions.

1372.133 View full episode →

LessWrong (Curated & Popular)

“The Pando Problem: Rethinking AI Individuality” by Jan_Kulveit

Patterns emerged from these scattered interactions, forming a coherent picture.

1377.4 View full episode →

Narrator (TYPE III AUDIO)

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment