Bowen Baker

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

So that's probably like how they would be implemented in practice.

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And I can, do you want to, if you have any questions on that, we can talk about that and then I can talk about obfuscation after.

1377.86 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Oh, yeah.

1392.05 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

So we don't, I think there's good reason to not reveal the chain of thought to users.

1392.591 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

I mean, there's two cited reasons, which, so the first is, if we showed users chain of thought, and this is to your question of like, should the user be monitoring this themselves?

1397.279 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

If we showed the user the chain of thought, for the same reason, we have to kind of like conform the outputs to look nice.

1408.697 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

So that, you know, like we don't kind of like say something offensive to a user or tell them how to do something illicit or anything like that.

1414.527 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Whatever the product policy things are for, you know, in any labs product.

1424.157 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

We would have to then do that to the chain of thought because now we're giving them this new piece of this new thing.

1429.282 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And, you know, if the chain of thought says something offensive or like actually reveals the how to do, you know, how to make the bomb or whatever the illicit thing is.

1434.468 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

but decides not to tell the user that, well, we've just told the user that if we give them a chain of thought.

1443.637 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so then that would require us to put these style pressures on the chain of thought.

1449.145 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And that would then maybe lead to obfuscation, which we can talk about in a sec.