Andrej Karpathy
๐ค SpeakerAppearances Over Time
Podcast Appearances
but it's extremely inefficient and not how you want to approach problems, practically speaking.
And so that's the approach that at the time we also took to World of Bits.
We would have an agent initialize randomly.
So with keyboard mash and mouse mash and try to make a booking.
And it's just like revealed the insanity of that approach very quickly, where you have to stumble by the correct booking in order to get a reward of you did it correctly.
And you're never going to stumble by it by chance at random.
There's just too many options.
And it's too sparse of a reward signal.
And you're starting from scratch at the time.
And so you don't know how to read.
You don't understand pictures, images, buttons.
You don't understand what it means to make a booking.
But now what's happened is it is time to revisit that.
And OpenAI is interested in this.
Companies like Adept are interested in this and so on.
And the idea is coming back because the interface is very powerful.
But now you're not training an agent from scratch.
You are taking the GPT as an initialization.
So GPT is pre-trained on all of text.
And it understands what's a booking.