Kevin Weil
👤 SpeakerAppearances Over Time
Podcast Appearances
And did I give my exam?
You know, it should just work.
I think that's part of increasing intelligence is the model can understand what you're looking to do and do a good job of it without you having to work super hard at it.
That said, prompts still do matter and the models are very controllable with prompts.
And so, you know, we still find we'll launch something and we'll find that it's not behaving in certain ways the way we want it to.
And we can adjust it with a prompt a lot of times.
You don't need to go back and like retrain the model.
So it's both that I want to make it less necessary over time and that it is still a powerful vector.
What does an agent mean to you?
We think of an agent as something that can do independent work.
So it's not just a quick, you know, you ask a question, you get an answer, but it's actually off doing tasks for you in the real world.
So another, I think deep research is a great one where it's off doing, you know, hundreds of searches and putting together a complex report for you that might have taken you a week.
I think another is Codex, which is our software engineering agent that we just launched.
What you can do is if you have a code base that you're operating against, you're building a new feature in a code base or debugging something, you can just give this agent the prompt like, hey, I need you to fix this thing.
I want you to do this to the background of my web page.
I want you to build this new feature.
And it will go and look through your entire code base, understand all of the context.
If you're fixing a bug, it'll go try and figure out where that bug exists.
And then it will write new code for you and create a pull request, like a diff.
Here's the set of changes we need to make to the code.