Trenton Bricken

And to the extent that you think the real world is that, which I would argue it is, we should expect the brain to also be underparameterized in trying to build a model of the world and also use superposition.

10080.953 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

It's a combinatorial code.

10123.365 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah, exactly.

10124.487 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Well, actually, that's a great segue because all of this feels like GoFi.

10177.209 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Like you're using distributed representations.

10183.618 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But you have features, and you're applying these operations to the features.

10186.743 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I mean, the whole field of vector symbolic architectures, which is this computational neuroscience thing, all you do is you put vectors in superposition, which is literally a summation of two high-dimensional vectors.

10190.43 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

um and you create some interference but but if it's higher dimensional enough then you can you can represent them uh and you have variable binding where you connect one by another and like if you're dealing with binary vectors it's just the x or operation so you have a b you bind them together and then if you query with a or b again you get out the other one

10202.453 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And this is basically the like key value pairs from attention.

10219.497 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And with these two operations, you have a Turing complete system, which you can if you have enough nested hierarchy, you can represent any data structure you want, etc, etc.

10224.725 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

we try and get it to do as much interpretability work and other safety work as possible.

10257.297 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I mean, we have our responsible scaling policy, which has been really exciting to see other labs adopt.

10271.506 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I mean, I think we need to make a lot more interest.

10295.024 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

If it's as capable as GPT-7 implies here, I think we need to make a lot more interpretability progress to be able to comfortably give the green light to deploy it.

10296.667 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I would be like, definitely not.

10307.304 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment