Trenton Bricken

👤 Speaker

See mentions of this person in podcasts

1589 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Like it's a good reason.

8224.901 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So, I guess context for listeners.

8229.605 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

The induction head is basically and you see the line like Mr. and Mrs. Dursley did something.

8231.587 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Mr. Blank.

8237.632 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And you're trying to predict what blank is.

8238.693 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And the head has learned to look for previous occurrences of the word Mr.

8240.775 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

look at the word that comes after it, and then copy and paste that as the prediction for what should come next, which is a super reasonable thing to do.

8245.139 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And there is computation being done there to accurately predict the next token.

8253.702 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

but yeah that is context dependent that is yeah yeah but it's not like it's not like reasoning you know what i mean like but but is is i guess going back to the like associations all the way down it's like if you chain together a bunch of these uh reasoning circuits or or uh heads that have different rules for how to relate information but but in the sort of like zero shot case uh

8260.917 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Well, I think there would be another circuit for extracting pixels and turning them into latent representations of the different objects in the game, right?

8292.462 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And a circuit that is learning physics.

8302.434 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Or like, I mean, that would just be an empirical question, right?

8330.032 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Of like, how big does the model need to be to perform this task?

8333.675 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But like, I mean, maybe it's useful if I just talk about some other circuits that we've seen.

8335.697 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So we've seen like the IOI circuit, which is the indirect object identification.

8338.8 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so this is like, if you see, it's like Mary and Jim went to the store, Jim gave the object to blank, right?

8344.085 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And it would predict Mary because Mary's appeared before as like the indirect object.

8353.193 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

or it'll infer pronouns, right?

8357.037 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And this circuit even has behavior where if you ablate it, then other heads in the model will pick up that behavior.

8360.58 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

We'll even find heads that want to do copying behavior, and then other heads will suppress.

8370.489 View full episode →

← Previous Page 64 of 80 Next →

Report any issue