Andrej Karpathy
๐ค SpeakerAppearances Over Time
Podcast Appearances
So we wrote the features, but we trained...
the last layer, sort of the classifier.
And then people are like, actually, let's not even design the features because we can't.
Honestly, we're not very good at it.
So let's also learn the features.
And then you end up with basically a convolutional neural net where you're learning most of it.
You're just specifying the architecture.
And the architecture has tons of fill-in-the-blanks, which is all the knobs.
And you let the optimization write most of it.
And so this transition is happening across the industry everywhere.
And suddenly we end up with a ton of code that is written in neural net weights.
And I was just pointing out that the analogy is actually pretty strong.
And we have a lot of developer environments for software 1.0.
Like we have IDEs, how you work with code, how you debug code, how do you run code, how do you maintain code.
We have GitHub.
So I was trying to make those analogies in the new realm.
Like what is the GitHub of software 2.0?
Turns out it's something that looks like Hugging Face right now.
And so I think some people took it seriously and built cool companies.
And many people originally attacked the post.