Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

๐Ÿ‘ค Speaker
14445 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

it has many sessions within that outer loop, then like this continual learning where it uses like, it fine tunes itself or it writes to an external memory or something will just sort of like emerge spontaneously.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Do you think, do you think things are things that are plausible?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

I just, I don't have really a prior over like how plausible is that?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

How likely is that to happen?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Interesting.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

In 10 years, do you think it'll still be something like a transformer, but with a much more modified attention and more sparse MLPs and so forth?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

It's surprising that all of those things together are...

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

only halved half of the error, which is like 30 years of progress.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Maybe half is a lot, because if you halve the error, that actually means that... Half is a lot, yeah.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Yeah, actually, I was about to ask a very similar question about NanoChat.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Because since you just coded up recently, every single sort of step in the process of building a chatbot is like fresh in your RAM.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

And I'm curious if you had similar thoughts about like, oh, there was no one thing that was relevant to going from...

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

GPT-2 to NanoChat.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

What are sort of like surprising takeaways from the experience?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Yeah.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

What is the best way for somebody to learn from it?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Is it just like delete all the code and try to re-implement from scratch, try to add modifications to it?

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Yeah, I think that's a great question.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

Interesting.

Dwarkesh Podcast
Andrej Karpathy โ€” AGI is still a decade away

You tweeted out that coding models were actually of very little help to you in assembling this repository.