Trenton Bricken

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so the three-way convergence here and the takeoff and success of Transformers seems pretty striking to me.

1376.525 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah, so maybe my hot take here, I don't know how hot it is, is that most intelligence is pattern matching.

1410.342 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And you can do a lot of really good pattern matching if you have a hierarchy of associative memories.

1420.141 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

You start with your very basic associations between objects in the real world.

1428.297 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But you can then chain those and have more abstract associations, such as a wedding ring symbolizes so many other associations that are downstream.

1434.743 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And you can even generalize the attention operation and this associative memory as the MLP layer as well.

1444.612 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

It's in a long-term setting where you don't have tokens in your current context.

1453.7 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But I think this is an argument that like association is all you need.

1458.284 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And associated memory in general as well, it's not, so you can do two things with it.

1465.11 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

You can both denoise or retrieve a current memory.

1470.235 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So like if I see your face, but it's like raining and cloudy, I can denoise and kind of like gradually update my query towards my memory of your face.

1473.458 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But I can also access that memory and then the value that I get out,

1482.966 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

actually points to some other totally different part of the space.

1488.271 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so a very simple instance of this would be if you learn the alphabet, right?

1491.419 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so I query for A and it returns B, I query for B and it returns C, and you can traverse the whole thing.

1495.069 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah.

1502.89 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I think so, yeah.

1570.368 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So I think learning these higher level associations to be able to then map patterns to each other as kind of like a meta learning.

1571.65 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I think in this case, he would also just have a really long context length or a really long working memory, right?

1577.557 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Where he can like have all of these bits and continuously query them as he's coming up with whatever theory so that the theory is moving through the residual stream.

1583.325 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment