Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Ahmed El-Kishky

๐Ÿ‘ค Speaker
522 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

When we started doing RL, like just getting started there, it would do a little bit of planning.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It would be like, I'm going to try this.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And we saw an improvement there.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

Over time, the reasoning that the models would do would actually become way more elaborate.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And it was really interesting because we have a lot of like IOI gold medalists, ICPC, competitive programmers, and they'd be like, oh yeah, that's a strategy that I would use.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So one of the coolest things, in my opinion, is when the model decided it wanted to test itself before it submitted.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And so what the model would do is, so the solution would have to be a very complicated algorithm.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It would have to have great complexity.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It would be very memory aware, very complicated.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

The algorithm would be like so complex that it's easy to make a mistake.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So what the model would do first is write a brute force solution, which is a lot easier to do.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It can be like a few lines.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And then it would generate some inputs, like pretending to be a, like, you know, the judge would generate inputs in the same way.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And then it would compare the outputs of the brute force solution to the really complicated algorithm that it wrote.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And it knows that for it to be correct, the outputs have to match.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So it couldn't submit the brute force solution because there's strict time limits and memory limits.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It wouldn't be correct, but it would spend a lot of time thinking and then trying to match the outputs of the brute force solution.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And I remember when I noticed that and just showed it to people on the team, the competitive programmers at OpenAI would be like, yeah, that's a valid strategy.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And what was really cool is

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

We didn't tell it that.