Bowen Baker

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so we,

1920.72 View full episode →

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

reasoning models, generally their capabilities, like on math problems and coding problems and things like that, they get better as they think for longer.

1921.722 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

They'll be able to solve more complex tasks.

1929.484 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so a small model

1932.412 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

thinking for a very long time can oftentimes get to the same answer or match the capability of a bigger model thinking for a shorter time okay wow so that's so yeah this model is really tight there's like a couple more steps in the thing i can just go through it and you can yeah um sure yeah so yeah so you now have like these like capability matched points of you know small model uh

1934.497 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

thinking for longer, big model, thinking for shorter, are roughly the same capabilities.

1958.375 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

But we found that, again, going back to your previous thing, we found that models that think for longer are more monitorable.

1963.001 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

They reveal more in their thinking.

1968.729 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

We found that the model thinking for the smaller model at the same capability level was more monitorable because it was thinking for longer.

1970.472 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so then that's the monitorability tax.

1981.807 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Or sorry, but...

1984.11 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Sorry, then the tax aspect comes in where these small models thinking for longer actually were using at that for like matched capability levels used more inference time compute than the bigger model thinking for shorter.

1986.083 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

In this case, the smaller models I think basically always cost more because the relative decrease in inference compute per token generated was less than the amount of additional tokens you'd need to reach the same capability level.

2009.607 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so overall, that small model with a lot of tokens was costing more compute than the big model.

2031.114 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

And so that's kind of like the monitorability tax.

2036.32 View full episode →

The Neuron: AI Explained

OpenAI Researcher Explains How AI Hides Its Thinking (w/ OpenAI’s Bowen Baker)

Like, oh, you could just spend more compute on inference time, but like get a more monitorable model out of it.