Trenton Bricken

But yeah, my next work was on sparsity in networks, like inspired by sparsity in the brain, which was when I met Tristan Hume and Anthropic was doing the SOLU, the soft max linear output unit work, which was very related in quite a few ways of like, let's make the activation of neurons across a layer really sparse.

6809.003 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And if we do that, then we can get some interpretability of what the neuron's doing.

6828.209 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I think we've updated on that approach towards what we're doing now.

6831.453 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So that started the conversation.

6835.698 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I shared drafts of that paper with Tristan.

6836.979 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

He was excited about it.

6838.521 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And that was basically what led me to become Tristan's resident and then convert to full-time.

6839.963 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

But during that period, I also moved as a visiting researcher to Berkeley and started working with Bruno Olshausen, both on what's called vector symbolic architectures, which one of the core operations of them is literally superposition.

6846.23 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

and on sparse coding, also known as dictionary learning, which is literally what we've been doing since.

6861.187 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And Bruno Olshausen basically invented sparse coding back in 1997.

6867.233 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so it was like my research agenda and the interpretability team seemed to just be running in parallel with just research tastes.

6871.458 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so, yeah, it made a lot of sense for me to work with the team.

6882.089 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Um, and it's been a dream since.

6885.993 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Maybe you're right, but it's this sort of interesting pattern that... Yeah, but I mean, I literally met Tristan at a conference and didn't have a scheduled meeting or anything, just joined a little group of people chatting.

6912.487 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And he happened to be standing there and I happened to mention what I was working on.

6924.425 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment