Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Gwern Branwen

👤 Person
855 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And I would just pay attention and notice that the world over time looked more like their world than it looked like my world, where algorithms are super important and you need like deep insight to do stuff, you know.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

happening.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And then GPD-1 came out, and I was like, wow, this unsupervised sentiment neuron is just learning on its own, right?

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

That seemed pretty amazing.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It also was a very compute-centric view.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You just build the transformer, and the intelligence will come.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And then GPD-2 came out, and I had this holy shit moment.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You look at the prompting and the summarization, like, holy shit, do we live in their world?

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And then GPT-3 comes out, and that was really the crucial test.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It was a huge, huge scale-up, one of the biggest scale-ups in all of neural network history, going from GPT-2 to GPT-3.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And it wasn't like it was a super narrow, specific task like Go.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It really seemed like it was the crucial task.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

If scaling was bogus, then the GBD-3 paper should have just been totally unimpressive and wouldn't show anything that important.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Whereas if scaling were true, you would just automatically be guaranteed to get so much more impressive results out of it than you had seen with GBD-2.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

So I opened up the first page, maybe the second page, and I saw a few-shot learning chart.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And I'm like, holy shit, we are living in the scaling world.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Leg and Moravec and Kurzweil were right.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Then I turned to Twitter and everyone else was like, oh, you know, this shows that scaling works so badly.

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Why?

Dwarkesh Podcast
Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It's not even state of the art.