Andrej Karpathy
๐ค SpeakerAppearances Over Time
Podcast Appearances
I think to a very large extent, we went through a number of iterations and we learned a ton about how to create these data sets.
I'm not seeing big open problems.
Originally, when I joined, I was really not sure how this would turn out.
But by the time I left, I was much more secure in actually understanding the philosophy of how to create these data sets.
And I was pretty comfortable with where that was at the time.
Yeah, pixels, I think, are a beautiful sensor, I would say.
The thing is, cameras are very, very cheap, and they provide a ton of information, a ton of bits.
So it's an extremely cheap sensor for a ton of bits, and each one of these bits has a constraint on the state of the world.
And so you get lots of megapixel images, very cheap, and it just gives you all these constraints for understanding what's actually out there in the world.
So vision is probably the highest bandwidth sensor
It's a very high bandwidth sensor.
It's not just that, but again, this real importance of
It's the sensor that humans use.
Therefore, everything is designed for that sensor.
The text, the writing, the flashing signs, everything is designed for vision.
And so you just find it everywhere.
And so that's why that is the interface you want to be in.
Talking again about these universal interfaces.
And that's where we actually want to measure the world as well and then develop software for that sensor.
I mean, you have a powerful prior for how the world evolves over time, et cetera.