Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15656 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And then adults are somewhere in between where they don't have the flexibility of childhood learning, but they can, you know, adults can memorize facts and information in a way that is harder for kids.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And I don't know if there's something interesting about that

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And this is also relevant to preventing model collapse.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Let me think.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Hmm.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

What is a solution to model collapse?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

I mean, there's very naive things you could attempt.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

It's just like the distribution over logits should be wider or something.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Like, there's many naive things you could try.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

What ends up being the problem with the naive approaches?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

In fact, it's actively penalized, right?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

If you're like super creative in RL, it's like not good.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And then I think you hinted that it's a very fundamental problem.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

It won't be easy to solve.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

What's your intuition for that?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

How many bits should the optimal core of intelligence end up being if you just had to make a guess?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

The thing we put on the von Neumann probes, how big does it have to be?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

That's actually surprising that you think it will take a billion, because already we have a billion parameter models, or a couple billion parameter models that are like very intelligent.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Well, some of our models are like a trillion parameters, right?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

But they remember so much stuff.