Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sholto Douglas

๐Ÿ‘ค Speaker
1567 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

the thing that connects the two halves of the brain.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And then, yeah, the speech half is on the left side, so it's not connected to the part that decides to do a movement.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And so if the other side decides to do something, the speech part will just make something up, and the person will think that's legit the reason they did it.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

What will this landscape of models communicating to themselves in ways we don't understand, how does that change with AI agents?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Because then these things will, it's not just like the model itself with its previous caches, but like other instances of the model.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

How much more effective do you think the models would be if they could share the residual streams versus just text?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Hard to know.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So for the audience, you would project the residual stream into this larger space where we know what each dimension actually corresponds to and then back into the next agents or whatever.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So your claim is that we'll get AI agents when these things can...

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

are more reliable and so forth.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

When that happens, do you expect that it will be multiple copies of models talking to each other?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Or will it be just an adaptive computer solved and the thing just like runs bigger, like more compute when it needs to do a kind of thing that a whole firm needs to do?

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And I ask this because there's two things that make me wonder about like whether agents is the right way to think about what will happen in the future.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

One is with longer contexts,

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

these models are able to ingest and consider the information that no human can.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

And therefore we need like one engineer who's thinking about the front-end code and one engineer who's thinking about the back-end code, where this thing can just ingest the whole thing.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

This sort of like Hayekian problem of specialization goes away.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Second, these models are just very general.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

You're not using different types of GPT-4 to do different kinds of things.

Dwarkesh Podcast
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

You're using the exact same model, right?