Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Sholto Douglas

๐Ÿ‘ค Speaker
1567 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Is RL + LLMs enough for AGI? โ€” Sholto Douglas & Trenton Bricken

Yeah, you can come up to speed reasonably fast.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? โ€” Sholto Douglas & Trenton Bricken

And it teaches you a lot of good intuitions of the actual intricacies of what's going on in the models, which means that you're then very well-placed to think about architecture and this kind of stuff.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? โ€” Sholto Douglas & Trenton Bricken

One of my favorite people in thinking about architecture at Anthropic at the moment actually came from a heavy GPU kernel programming background, just knows the ins and outs really deeply and can think about the trade-offs really well.

Dwarkesh Podcast
Is RL + LLMs enough for AGI? โ€” Sholto Douglas & Trenton Bricken

Yeah, it was fun.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

So in other words, my parents will finally understand what I do for a job.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

What do they do?

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

They're like, cool.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

Let's do it.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

All right.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

So Brian Krav asks, the issue you raised with Dario and occasionally tweet about relating to models not making connections across disparate topics, some sort of combinatorial attention challenge.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

What are your thoughts on that now?

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

Do you solve it with scale, thinking models, or something else?

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

I think my answer at the moment is that

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

the sort of pre-training objective doesn't necessarily, like it imbues you with this like nice, flexible, general knowledge about the world, but doesn't necessarily imbue you with the, like the skill of making like novel connections or like research.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

The kinds of things that people are trained to do through PhD programs and through like sort of the process of exploring and interacting with the world.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

And so, yeah,

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

I think, like, at a minimum, you need significant RL in at least similar things to be able to approach, like, making novel discoveries.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

And so I would like to see some early evidence of this as we start to build models that are sort of interacting with one, trying to make scientific discoveries, and sort of, like, modeling the behaviors that we expect of people in these positions, because I don't actually think we've done that in, like, a meaningful or scaled way as a field, so to speak.

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

A little bit like Gwen's theory, optimizer theory, no?

Dwarkesh Podcast
AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

I get asked this question all the time.