Gideon Lewis-Kraus

But what Jack has found more recently is that when he incepts these ideas into the model, instead of the model purely looking at its own external behavior to try to figure out why it had done something,

1552.107 View full episode →

Fresh Air

A look at the ethical implications of AI

that actually these models could very dimly perceive that something strange had gone on internally, that someone was monkeying with, you know, the neurons inside the model to make it do something different.

1565.744 View full episode →

Fresh Air

A look at the ethical implications of AI

So, you know, he incepted the model with something, you know, something associated with imminent shutdown, that the model was about to be

1578.524 View full episode →

Fresh Air

A look at the ethical implications of AI

shut down and asked the model, how are you feeling right now?

1585.184 View full episode →

Fresh Air

A look at the ethical implications of AI

And the model would say, I feel sort of strange as if I'm standing at the edge of a great unknown.

1588.387 View full episode →

Fresh Air

A look at the ethical implications of AI

And it certainly was not at the point that it could say, oh, I have recognized that you, the user, have incepted me with this idea at this point and that this was a foreign idea introduced into my thought processes.

1593.772 View full episode →

Fresh Air

A look at the ethical implications of AI

But it could tell that something was off about it internally.

1608.606 View full episode →

Fresh Air

A look at the ethical implications of AI

And this is what Jack described to me.

1612.69 View full episode →

Fresh Air

A look at the ethical implications of AI

He said, like, I am a skeptic, but this just starts to feel pretty spooky that the model does seem to have something like an emerging introspective ability to peer inside and offer reports about what's going on in its equivalent of a brain.

1615.413 View full episode →

Fresh Air

A look at the ethical implications of AI

Well, because they are also training it for the future, and it is picking up on all these contexts.

1662.556 View full episode →

Fresh Air

A look at the ethical implications of AI

And there's the fact that this whole process is kind of constantly eating its own tail, that it's always being trained on plenty of stuff on the internet that is about the way that these things work.

1667.346 View full episode →

Fresh Air

A look at the ethical implications of AI

So it's always incorporating new information about how it's supposed to be behaving in the world.

1678.449 View full episode →

Fresh Air

A look at the ethical implications of AI

Well, and part of the problem with lying to it is that—

1691.376 View full episode →

Fresh Air

A look at the ethical implications of AI

you know, ultimately what they want is to establish a trusting relationship that these things are going to, you know, behave the way that we would hope that they would behave in ways that are aligned with, you know, how we expect responsible, wise people to behave.

1695.479 View full episode →

Fresh Air

A look at the ethical implications of AI

And that if you are lying to it all the time, it is developing a sense for the fact that it can't necessarily trust you.

1713.981 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment