Dwarkesh Patel
π€ SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
And then adults are somewhere in between where they don't have the flexibility of childhood learning, but they can, you know, adults can memorize facts and information in a way that is harder for kids.
And I don't know if there's something interesting about that
And this is also relevant to preventing model collapse.
Let me think.
What is a solution to model collapse?
I mean, there's very naive things you could attempt.
It's just like the distribution over logits should be wider or something.
Like, there's many naive things you could try.
What ends up being the problem with the naive approaches?
In fact, it's actively penalized, right?
If you're like super creative in RL, it's like not good.
And then I think you hinted that it's a very fundamental problem.
It won't be easy to solve.
What's your intuition for that?
How many bits should the optimal core of intelligence end up being if you just had to make a guess?
The thing we put on the von Neumann probes, how big does it have to be?
That's actually surprising that you think it will take a billion, because already we have a billion parameter models, or a couple billion parameter models that are like very intelligent.
Well, some of our models are like a trillion parameters, right?
But they remember so much stuff.