Bowen Baker

👤 Speaker

414 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Like I doubt there's going to be like one strategy or one method that makes models completely safe without anything else.

1253.507 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Oh, yeah.

1282.767 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

So I guess there's, well, there's maybe two, I'll break it down into two things.

1283.828 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

There's kind of like, how, how would we actually use a monitor in practice?

1287.694 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And then like, what are we what, what about obfuscation?

1290.858 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Can models actually hide their thinking?

1293.341 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

So I'll try to answer that first one.

1295.083 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And then I can go to that we can talk about the second one.

1296.605 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

But yeah, so like, how would you actually do it?

1299.97 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

I think

1301.892 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

I mean, obviously, every company wants to probably get their latency down for giving users results as much as they can.

1303.523 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

But for complex knowledge work, people are already waiting quite a while to get the report back or their code base updated or so on.

1310.611 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so if we had to wait slightly longer because we were monitoring the outputs of the model as it was going...

1318.801 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

that seems plausibly like something that products would do if it actually had a critical safety function.

1327.351 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so you could, and so like one nice thing there too, is that like, you know, every action the model takes, you can kind of monitor the chain of thought

1336.489 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

and the action before you actually execute the action, if you're willing to pay that latency cost of waiting to execute the action and start the next part of the reasoning.

1346.268 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And if you do that, you can probably mitigate some harms before they happen because you know that the model was thinking about doing something bad.

1356.92 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And then you can just say, oh, it was thinking about doing something bad.

1365.53 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

I'm just going to cut it off here.

1367.393 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And I'll retry or tell the user I can't do the thing for some reason.

1368.794 View full episode →

← Previous Page 9 of 21 Next →

Report any issue