Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Demis Hassabis

πŸ‘€ Speaker
See mentions of this person in podcasts
3240 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

It's not just about repeating the same recipe.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

At each new scale, you have to adjust the recipe.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And that's a bit of an art form in a way.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And you have to sort of almost get new data points.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

If you try and extend your predictions, extrapolate them, say several orders of magnitude out, sometimes they don't hold anymore, right?

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Because new capabilities, they can be step functions in terms of new capabilities and some things hold and other things don't.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So often you do need those intermediate data points actually to correct some of your hyperparameter optimization and other things so that the scaling law continues to be true.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So there's sort of various practical limitations onto that.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So kind of one order of magnitude is about probably the maximum that you want to carry on, you want to sort of do between each era.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Yeah, the downstream capabilities sometimes don't follow from the, you can often predict the core metrics like training loss or something like that.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But then it doesn't actually translate into MMLU or math or some other actual capability that you care about.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

they're not necessarily linear all the time.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So there's sort of nonlinear effects.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Um, well, I, I mean, I wouldn't say there was one big surprise, but it's, it was very interesting, you know, trying to train things at that, at that size and, and, and learning about, um, uh,

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

All sorts of things from organization or how to babysit such a system and to track it.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And I think things like getting a better understanding of the metrics you're optimizing versus the final capabilities that you want.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I would say that's still not a perfectly understood mapping, but it's an interesting one that we're getting better and better at.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I don't think that's the case.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that actually Gemini 1 used roughly the same amount of compute, maybe slightly more than what was rumored for GPT-4.

Dwarkesh Podcast
Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I don't know exactly what was used.