Gwern Branwen

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And I would just pay attention and notice that the world over time looked more like their world than it looked like my world, where algorithms are super important and you need like deep insight to do stuff, you know.

882.413 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

happening.

897.747 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And then GPD-1 came out, and I was like, wow, this unsupervised sentiment neuron is just learning on its own, right?

899.149 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

That seemed pretty amazing.

907.184 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It also was a very compute-centric view.

909.307 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You just build the transformer, and the intelligence will come.

911.411 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And then GPD-2 came out, and I had this holy shit moment.

914.657 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You look at the prompting and the summarization, like, holy shit, do we live in their world?

918.824 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And then GPT-3 comes out, and that was really the crucial test.

924.054 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It was a huge, huge scale-up, one of the biggest scale-ups in all of neural network history, going from GPT-2 to GPT-3.

927.721 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And it wasn't like it was a super narrow, specific task like Go.

934.214 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

It really seemed like it was the crucial task.

938.1 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

If scaling was bogus, then the GBD-3 paper should have just been totally unimpressive and wouldn't show anything that important.

940.523 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Whereas if scaling were true, you would just automatically be guaranteed to get so much more impressive results out of it than you had seen with GBD-2.

946.993 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

So I opened up the first page, maybe the second page, and I saw a few-shot learning chart.

955.185 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And I'm like, holy shit, we are living in the scaling world.

959.712 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Leg and Moravec and Kurzweil were right.