Ahmed El-Kishky

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

When we started doing RL, like just getting started there, it would do a little bit of planning.

1366.316 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It would be like, I'm going to try this.

1372.007 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And we saw an improvement there.

1374.071 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

Over time, the reasoning that the models would do would actually become way more elaborate.

1375.394 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And it was really interesting because we have a lot of like IOI gold medalists, ICPC, competitive programmers, and they'd be like, oh yeah, that's a strategy that I would use.

1382.082 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So one of the coolest things, in my opinion, is when the model decided it wanted to test itself before it submitted.

1391.052 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And so what the model would do is, so the solution would have to be a very complicated algorithm.

1399.542 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It would have to have great complexity.

1405.651 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It would be very memory aware, very complicated.

1407.113 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

The algorithm would be like so complex that it's easy to make a mistake.

1410.518 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So what the model would do first is write a brute force solution, which is a lot easier to do.

1414.123 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It can be like a few lines.

1419.07 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And then it would generate some inputs, like pretending to be a, like, you know, the judge would generate inputs in the same way.

1420.332 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And then it would compare the outputs of the brute force solution to the really complicated algorithm that it wrote.

1428.949 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And it knows that for it to be correct, the outputs have to match.

1437.085 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So it couldn't submit the brute force solution because there's strict time limits and memory limits.

1440.852 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It wouldn't be correct, but it would spend a lot of time thinking and then trying to match the outputs of the brute force solution.

1446.202 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And I remember when I noticed that and just showed it to people on the team, the competitive programmers at OpenAI would be like, yeah, that's a valid strategy.

1453.015 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And what was really cool is

1460.368 View full episode →

The Neuron: AI Explained

How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

We didn't tell it that.

1461.811 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment