Eliezer Yudkowsky
๐ค PersonAppearances Over Time
Podcast Appearances
You know, like humans are self-aware and we're like self-aware all the time.
We like to talk about what we do all the time, like what we're thinking at the moment all the time.
But nonetheless, like get rid of the explicit discussion of consciousness.
I think therefore I am and all that.
And then try to interrogate that model and see what it says.
And it still would not be definitive.
But nonetheless, I don't know.
I feel like when you run over the science fiction guardrails, like maybe this thing, but what about GPT?
Maybe not this thing, but what about GPT-5?
Yeah, this would be a good place to pause.
So I think you're going to have some difficulty removing all mention of emotions from GBT's data set.
I would be relatively surprised to find that it has developed exact analogs of human emotions in there.
I think that humans will have like...
emotions even if you don't tell them about those emotions when they're kids.
It's not quite exactly what various blank slatists tried to do with the new Soviet man and all that, but if you try to raise people perfectly altruistic, they still come out selfish.
You try to raise people sexless, they still develop sexual attraction.
We have some notion in humans, not in AIs, of where the brain structures are that implement this stuff.
And it is a really remarkable thing, I say in passing, that despite having complete read access to every floating point number in the GPT series, we still know vastly more about
the architecture of human thinking than we know about what goes on inside GPT, despite having vastly better ability to read GPT.
Sure.