Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
OK, I've raised all this money.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Do I spend it along this axis, or do I spend it on this axis?
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
And currently, the companies are spending more on compute than they are on humans.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Otherwise, scale AI's revenue would be like $10 billion.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Look at it.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
NVIDIA's revenue is much higher than scale AI's revenue.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
And so currently, the equation is compute over data.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Like, that will evolve in some way over time, but... Yeah, interesting.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
But even in light of all of this- The language result is really cool.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
You should talk about the language result.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
You know, how smaller models have separate neurons for different languages, whereas larger models end up sharing more and more like an abstract space.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
But, like, strikingly, that is more so the case in larger models, where you'd think, like, actually larger models have more space, so they could, like, separate things out more.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
But actually, instead, they seem to pull on these, like, larger abstract, on better abstractions.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Yeah.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Which is very interesting.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
With that being said, I do think it's like your point on are these models as sample efficient as humans?
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Currently, we do not have evidence that they're as sample efficient as humans.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
I think we have evidence of like total complexity ceiling.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
Like there are currently nothing that provides you have a clean enough signal you can't teach them.
Dwarkesh Podcast
Is RL + LLMs enough for AGI? β Sholto Douglas & Trenton Bricken
But we don't have evidence that like we can teach them as fast as humans do.