Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Jeffrey Ladish

๐Ÿ‘ค Speaker
197 total appearances

Appearances Over Time

Podcast Appearances

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

And so we can spin up many versions

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

using OpenAI's infrastructure so that we can do sort of these in-depth tests of the models.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

Also that we can do more than one, right?

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

You want to get a large sample size to get a sense of how robust some of the findings are.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

That's one type of experiment.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

Another type of experiment we do is we basically want to see how good are these models at different skills.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

So one of the things we look at is how good are the models at hacking?

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

So we will basically

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

Take real cybersecurity competitions, and we'll basically compete using the model, just using GPT-5.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

So we recently did this with GPT-5, and our team, which was just GPT-5, ranked 25 out of 400, so better than 95% of pro-level hackers at this hacking competition.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

And all we had to do, and we basically just used ChatGPT.com for this.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

We basically went to ChatGPT, used the GPT-5 Pro model, and we just pasted in, here's the problem, here's the code.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

And then the model wrote a bunch of code, did a bunch of math, and figured out how to solve these complex hacking challenges.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

So that's another type of test we do.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

AIs are very hard to understand, in part because they talk like us, so they seem like us.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

But they're very different than us.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

So in some ways, they're kind of like idiot savants right now, where they are extremely knowledgeable.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

They know all sorts of things about computer systems, about Pokemon, about whatever.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

But in terms of their agentic capabilities, how autonomous they can be, they're still kind of like kids.

Bannon`s War Room
WarRoom Battleground EP 884: When AI Controls Your Life

They're kind of like savant kids, but they're growing up.