Demis Hassabis

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

It's not just about repeating the same recipe.

1773.399 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

At each new scale, you have to adjust the recipe.

1776.125 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And that's a bit of an art form in a way.

1778.369 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And you have to sort of almost get new data points.

1781.094 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

If you try and extend your predictions, extrapolate them, say several orders of magnitude out, sometimes they don't hold anymore, right?

1783.338 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Because new capabilities, they can be step functions in terms of new capabilities and some things hold and other things don't.

1790.531 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So often you do need those intermediate data points actually to correct some of your hyperparameter optimization and other things so that the scaling law continues to be true.

1799.227 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So there's sort of various practical limitations onto that.

1809.941 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So kind of one order of magnitude is about probably the maximum that you want to carry on, you want to sort of do between each era.

1815.448 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Yeah, the downstream capabilities sometimes don't follow from the, you can often predict the core metrics like training loss or something like that.

1839.073 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

But then it doesn't actually translate into MMLU or math or some other actual capability that you care about.

1845.699 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

they're not necessarily linear all the time.

1853.767 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

So there's sort of nonlinear effects.

1856.152 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Um, well, I, I mean, I wouldn't say there was one big surprise, but it's, it was very interesting, you know, trying to train things at that, at that size and, and, and learning about, um, uh,

1862.385 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

All sorts of things from organization or how to babysit such a system and to track it.

1872.025 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

And I think things like getting a better understanding of the metrics you're optimizing versus the final capabilities that you want.

1876.934 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I would say that's still not a perfectly understood mapping, but it's an interesting one that we're getting better and better at.

1886.811 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I don't think that's the case.

1900.635 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I think that actually Gemini 1 used roughly the same amount of compute, maybe slightly more than what was rumored for GPT-4.

1901.797 View full episode →

Dwarkesh Podcast

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

I don't know exactly what was used.

1910.471 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment