Gwern Branwen

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Then it turns out that everywhere you go, compute and data and trial and error and serendipity just play enormous roles in how things actually happened.

1120.919 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Once you understand that, then you understand why compute comes first.

1130.916 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You can't do trial and error and serendipity without it.

1134.483 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You can write down all these beautiful ideas, but you just can't test them out.

1137.969 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

So even a small difference in hyperparameters or a small choice of architecture can make a huge difference to the results.

1142.148 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

But when you can only do a few instances, you would typically end up finding that it just doesn't work, or maybe you would give up and you would go away and do something else.

1148.88 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Whereas if you had more compute power, you can just keep trying.

1158.635 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

Eventually you hit something that works great.

1162.66 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And once you have a working solution, you can kind of simplify it and improve it and figure out why it worked and get a nice robust solution that would work no matter what you did to it.

1165.184 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

But until then you're stuck and you're just kind of like flailing around in this regime where nothing works.

1174.536 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You know, you can have this horrible experience now where you go back through the old deep learning literature and see all these sorts of contemporary ideas that people had back then, which were completely correct, but they didn't have the compute to train what you know would have worked.

1179.863 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You know, and it's tremendously tragic, right?

1194.906 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

You go back and you can look at things like ResNet's being published back in 1988 instead of 2015.

1196.789 View full episode →

Dwarkesh Podcast

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

And it would have worked.