Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Dwarkesh Patel

👤 Person
12212 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

So that's a 35 million fold difference in how much information per token is assimilated by the model.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

I wonder if that's relevant at all.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Stepping back, what is the part about human intelligence that we have most failed to replicate with these models?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

This is maybe relevant to the question of thinking about how fast these issues will be solved.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

So sometimes people will say about continual learning, look, actually, you could easily replicate this capability just as in-context learning emerged spontaneously as a result of pre-training.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Continual learning over longer horizons will emerge spontaneously if the model is incentivized to recollect information over longer horizons or horizons longer than one session.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

So if there's some like outer loop RL, which...

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

it has many sessions within that outer loop, then like this continual learning where it uses like, it fine tunes itself or it writes to an external memory or something will just sort of like emerge spontaneously.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Do you think, do you think things are things that are plausible?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

I just, I don't have really a prior over like how plausible is that?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

How likely is that to happen?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Interesting.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

In 10 years, do you think it'll still be something like a transformer, but with a much more modified attention and more sparse MLPs and so forth?

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

It's surprising that all of those things together are...

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

only halved half of the error, which is like 30 years of progress.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Maybe half is a lot, because if you halve the error, that actually means that... Half is a lot, yeah.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Yeah, actually, I was about to ask a very similar question about NanoChat.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

Because since you just coded up recently, every single sort of step in the process of building a chatbot is like fresh in your RAM.

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

And I'm curious if you had similar thoughts about like, oh, there was no one thing that was relevant to going from...

Dwarkesh Podcast
Andrej Karpathy — AGI is still a decade away

GPT-2 to NanoChat.