Dario Amodei
๐ค SpeakerAppearances Over Time
Podcast Appearances
And, you know, you find things that are evocative where, you know, the models, their activations that light up in the models that we see as being associated with.
the concept of anxiety or something like that you know that when when characters experience anxiety in the text and then when the model itself is in a situation that a human might associate with anxiety that same anxiety neuron shows up now does that mean the model is experiencing anxiety that doesn't prove that at all but but it but it does indicate it i think to the user right and yeah we would i would have to do an entirely different interview
Yeah, so I think there's a few, we should like separate out a few different things here that we're sort of all trying to achieve at once that are like intention with each other.
There's the question of whether the AIs genuinely have a consciousness, and if so, how do we give them a good experience?
There's a question of the humans who interact with the AI.
And how do we give those humans a good experience?
And how does the perception that AIs might be conscious interact with that experience?
And there's the idea of how we maintain human mastery, as we put it, over the AI system.
The last two.
So the thing I was going to say is that actually, I wonder if there's a kind of an elegant way to satisfy all three, including the last two.
Again, this is me dreaming in machines of loving grace mode, right?
This is this mode I go into where I'm like, man, I see all these problems.
I, you know...
if we could solve it, is there an elegant way?
This is not me saying there are no problems here.
That's not how I think, but...
You know, if we think about making the constitution of the AI so that the AI has a sophisticated understanding of its relationship to human beings and it induces psychologically healthy behavior in the humans, a psychologically healthy relationship between the AI and the humans.
And I think something that could grow out of that is.
psychologically healthy, not psychologically unhealthy relationship is some understanding of the relationship between human and machine.
And perhaps that relationship could be the idea that, you know, these models, when you interact with them, when you talk to them, they're really helpful.