Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Trenton Bricken

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
1589 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Like it's a good reason.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So, I guess context for listeners.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

The induction head is basically and you see the line like Mr. and Mrs. Dursley did something.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Mr. Blank.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And you're trying to predict what blank is.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And the head has learned to look for previous occurrences of the word Mr.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

look at the word that comes after it, and then copy and paste that as the prediction for what should come next, which is a super reasonable thing to do.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And there is computation being done there to accurately predict the next token.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

but yeah that is context dependent that is yeah yeah but it's not like it's not like reasoning you know what i mean like but but is is i guess going back to the like associations all the way down it's like if you chain together a bunch of these uh reasoning circuits or or uh heads that have different rules for how to relate information but but in the sort of like zero shot case uh

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Well, I think there would be another circuit for extracting pixels and turning them into latent representations of the different objects in the game, right?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And a circuit that is learning physics.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Or like, I mean, that would just be an empirical question, right?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Of like, how big does the model need to be to perform this task?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But like, I mean, maybe it's useful if I just talk about some other circuits that we've seen.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So we've seen like the IOI circuit, which is the indirect object identification.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so this is like, if you see, it's like Mary and Jim went to the store, Jim gave the object to blank, right?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And it would predict Mary because Mary's appeared before as like the indirect object.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

or it'll infer pronouns, right?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And this circuit even has behavior where if you ablate it, then other heads in the model will pick up that behavior.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

We'll even find heads that want to do copying behavior, and then other heads will suppress.