Andrej Karpathy
👤 SpeakerAppearances Over Time
Podcast Appearances
I think one more maybe example that comes to mind that is even more extreme than that is when I was building MenuGen.
So MenuGen is this idea where you come to a restaurant, they give you a menu, there's no pictures usually, so I don't know what any of these things are.
Usually, like 30% of the things, I have no idea what they are, 50%.
So I wanted to take a photo of the restaurant menu and to get pictures of what those things might look like in a generic sense.
And so I built, I bytecoded this app that basically lets you upload a photo, and it does all this stuff, and it runs on Vercel, and it basically re-renders the menu, and it gives you all the items, and it gives you a picture that it uses an image generator for to basically OCR all the different titles, use the image generator to get pictures of them, and then shows it to you.
And then I saw the software 3.0 version of this, which blew my mind, which is literally just take your photo, give it to Gemini, and say, use NanoBanana to overlay the things onto the menu.
And NanoBanana basically returned an image that is exactly the picture of the menu that I took, but it actually put into the pixels, it rendered the different things in the menu.
And this blew my mind because...
Actually, all of my menu gen is spurious.
It's working in the old paradigm.
That app shouldn't exist.
And yeah, the software 3.0 paradigm is a lot more kind of raw.
It just, neural network is doing more and more of the work, and your prompt or context is just the image, and the output is an image, and there's no need to have any of the app in between.
So I think that people have to kind of like reframe.
Well, first of all, thank you for having me here.
I'm excited to be here.
So the quote that you've just mentioned, it's the decade of agents, that's actually a reaction to an existing, pre-existing quote, I should say, where I think some of the labs, I'm not actually sure who said this, but they were alluding to this being the year of agents with respect to LLMs and how they were going to evolve.
And I think...
I was triggered by that because I feel like there's some over-predictions going on in the industry.
And in my mind, this is really a lot more accurately described as the decade of agents.