Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing
430 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

The cool thing is you can fire off 10 of these tasks at once, right?

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

So we try and actually give you the value of all this parallelism.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

where it's not just you can do one thing, but if you have a Codex agent working for you, why not have 10 Codex agents working for you on 10 different tasks?

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

And by the way, just to connect it to the previous topic on evals, this is also... evals are...

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

there's a really important kind of subtlety to them too, where they have to be tailored to the product that you're trying to build and the problem that you're trying to solve.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

Where, you know, coding isn't one thing.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

Just coding is a small vertical of the entire world.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

But even within coding, you can be good at lots of different kinds of coding.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

And with Codex, that was a great example of going and saying, okay, what kinds of coding really matter to us?

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

What kinds of tasks and all the tasks that a developer does, what kinds of tasks do we really want to be good at?

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

And we created evals for those.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

And then we made sure to monitor as we train the model, is it getting better and better and better at these?

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

And, you know, you go and accumulate tasks and examples for the model to learn from, but you do it against a specific set of evals that correspond to a specific set of problems you want to solve.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

Yeah, I think part of this is about making sure that, like we talked about earlier, that the user is in control here.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

So you should be able to at some point be like, hey, you know what?

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

You've checked enough.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

Like, you're good.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

And the other interesting thing in all of this is the technology is evolving so quickly, like much more quickly than I think we're used to with technology.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

We're used to things taking like decades to deploy and to really achieve scale.

Azeem Azhar's Exponential View
OpenAI’s CPO on what’s coming next: Hardware, GPT-5, Jony Ive, agents, more

One of the phenomenons you see with AI technology is there'll be some benchmark, some eval that AI just can't crack.