Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Ahmed El-Kishky

๐Ÿ‘ค Speaker
522 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

You'd work it out.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

You'd think through, maybe make a mistake and fix it.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And you give an answer.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

If you asked a scientist to work on a problem or a mathematician, they do the same thing.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

Hard problems require, you know, more time, more thinking.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So we decided to lean into reinforcement learning as a way to get our models to actually, you know, think longer, think better.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And that's actually where the breakthrough came in.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

What a breakthrough it was, too.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It was a crazy one.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

Like, we wanted to see, like...

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

We wanted to a little bit mimic how a human, you know, thinks through these problems.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

They try things out and maybe they go down wrong directions to dead end.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

They course correct.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And OpenAI has always been into like reinforcement learning.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

from the days of like, you know, Dota, playing video games, they really leaned into reinforcement learning as a tool that would bring about sort of next level intelligence.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And there'd been some attempts to apply reinforcement learning to LLMs, but nothing at this scale.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

So the, I guess, the strawberry efforts, the O-series models, were the first, like, very serious attempt at getting reinforcement learning working on these large language models.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

And

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

It was honestly like amazing to sort of see it from the beginning, like, you know, when it started to now being at a performance level where it's competitive with some of the best competitive programmers.

The Neuron: AI Explained
How OpenAI Beat Every Human Team at the World's Hardest Coding Competition

Yeah.