Sholto Douglas

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And yeah.

8943.047 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah.

8943.828 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah.

8944.108 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Another dinner party question.

9081.582 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Should we be less worried about misalignment and maybe that's not even the right word for what I'm referring to, but like just alienness and chagasness from these models, given that there is feature universality and there are certain ways of thinking and ways of understanding the world that are different.

9084.788 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

instrumentally useful to different kinds of intelligences?

9102.862 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Should we just be less worried about like bizarro pay-per-click maximizers as a result?

9106.927 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

It has a denser representation of regions that are particularly relevant to predicting the next token.

9176.47 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

That particular example, I wonder if that implies that the...

9195.539 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Difficulty of doing interoperability on smarter models will be harder because if like it requires somebody with esoteric knowledge who just happened to see that Base64 has, I don't know, like whatever that distinction was.

9199.825 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Doesn't that imply when you have a million line pull request, it's like there is no human that's going to be able to decode like two different reasons why the pull request, there's like two different features for this pull request.

9213.9 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah, you know what I mean?

9223.13 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And that's when you type a comment, like, small CLs, please.

9223.971 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

thing between models where you have millions of features potentially for GPT-6, and a bunch of models are just trying to figure out what each of these features means.

9292.57 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Does that sound right?

9304.522 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Yeah.

9322.253 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I want to talk more about the feature splitting because I think that's an interesting thing that has been under-explored.