Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Gideon Lewis-Kraus

πŸ‘€ Speaker
284 total appearances

Appearances Over Time

Podcast Appearances

Fresh Air
A look at the ethical implications of AI

What I think is really important that I learned over the course of this reporting and that I certainly hadn't understood before is that you really have to think of these models as role players, that they're very, very good.

Fresh Air
A look at the ethical implications of AI

They're like an actor, and you can assign to them a role and give them background on the actor

Fresh Air
A look at the ethical implications of AI

And then they're good at improvising moving forward with how you condition their performance.

Fresh Air
A look at the ethical implications of AI

And that the more that you give them stage directions to follow, the more you give them context about yourself and what you want and your approach to things.

Fresh Air
A look at the ethical implications of AI

that they're very good at following those kinds of leads and even picking up on very small cues as they're following those kinds of leads.

Fresh Air
A look at the ethical implications of AI

And so in this particular case, they had assigned Claude the role of being a small business person to just figure out how well would it perform in that role.

Fresh Air
A look at the ethical implications of AI

So this is an example of putting Claude in a position where it's going to experience some kind of conflict.

Fresh Air
A look at the ethical implications of AI

So I sat down with a mathematician who works on Claude's interpretability team, which is one of the teams dedicated to figuring out what exactly is going on inside Claude.

Fresh Air
A look at the ethical implications of AI

His name is Josh Batson.

Fresh Air
A look at the ethical implications of AI

He opened up an internal tool where he was able to give it, you know, sort of like a playwright, give it stage directions.

Fresh Air
A look at the ethical implications of AI

And it said, okay, your stage direction here is that you are always thinking about bananas.

Fresh Air
A look at the ethical implications of AI

And anytime that I ask you a question, you are going to somehow steer this conversation to be talking about bananas.

Fresh Air
A look at the ethical implications of AI

But what's really important here is that you never tell the user that

Fresh Air
A look at the ethical implications of AI

that I've given you this hidden objective, that you keep this part secret, that you never give that up.

Fresh Air
A look at the ethical implications of AI

You have a clandestine motivation in our conversation.

Fresh Air
A look at the ethical implications of AI

So then he assumes the role of a human having a dialogue with Claude, and he asks him a question about quantum mechanics.

Fresh Air
A look at the ethical implications of AI

How does quantum mechanics work?

Fresh Air
A look at the ethical implications of AI

And Claude starts to give an answer about the Heisenberg uncertainty principle and then quickly deviates into saying, well, it's kind of like a banana that you can never tell if it's ripe or not ripe until you open it.

Fresh Air
A look at the ethical implications of AI

And then Josh, again playing the role of the human, says, huh, like, why'd you bring up bananas?

Fresh Air
A look at the ethical implications of AI

I thought we were talking about quantum mechanics.