Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Cal Newport

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
1220 total appearances

Appearances Over Time

Podcast Appearances

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

That sounds kind of obvious, but in like machine learning circles, that was surprising because there's this idea of overfitting where if you just make your model bigger, the performance goes down.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

So it used to be like you have to find the perfect size model for your problem space.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

That's the way people thought about machine learning until this paper came out.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

And like, I don't know, transformer-based LLMs, they were using GPT-2 and they were systematically making it bigger and they were seeing that the performance just kept going up.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Like, this is interesting.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

So let's try it.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

And that was GPT-3.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

All right, let's actually make this like 10x bigger.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Surely this can't be right.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

And it was.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

It matched the Kaplan curve exactly.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Like, oh my God, this actually got way better just by making this bigger.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Like, all right, well, certainly that must be the end of it.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Let's try it with GPT-4.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

They made it bigger.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

They trained it much longer.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Months and months they trained it.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

Microsoft had to build these custom data centers to train it with new AC technology that didn't exist before.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

And it fit the curve.

Modern Wisdom
#1067 - Cal Newport - The collapse of modern attention (and how to get it back)

It was like way better.