Andrej Karpathy
π€ SpeakerAppearances Over Time
Podcast Appearances
The thing is, cameras are very, very cheap, and they provide a ton of information, a ton of bits.
So it's an extremely cheap sensor for a ton of bits, and each one of these bits has a constraint on the state of the world.
And so you get lots of megapixel images, very cheap, and it just gives you all these constraints for understanding what's actually out there in the world.
So vision is probably the highest bandwidth sensor
It's a very high bandwidth sensor.
It's not just that, but again, this real importance of
It's the sensor that humans use.
Therefore, everything is designed for that sensor.
The text, the writing, the flashing signs, everything is designed for vision.
And so you just find it everywhere.
And so that's why that is the interface you want to be in.
Talking again about these universal interfaces.
And that's where we actually want to measure the world as well and then develop software for that sensor.
I mean, you have a powerful prior for how the world evolves over time, et cetera.
So it's not just about the likelihood term coming up from the data itself telling you about what you are observing, but also the prior term of where are the likely things to see and how do they likely move and so on.
Yeah, driving is really hard.
Because it has to do with the predictions of all these other agents and the theory of mind and, you know, what they're going to do.
And are they looking at you?
Are they, where are they looking?
Where are they thinking?