Sholto Douglas

👤 Speaker

1567 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Because if it is the case that you learn certain things first, shouldn't just directly training those things first lead to better results?

8802.942 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Both Gemini papers mention some aspects of curriculum learning.

8808.773 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Okay, interesting.

8812.379 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

The fact that fine-tuning works is evidence of curriculum learning, right?

8813.742 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Because the last things you're training on have a disproportionate impact.

8816.627 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Sorry, what was the thing we were talking about before?

8842.696 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

By the way, I just realized, I forgot to...

8862.333 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I just got in conversation mode and forgot there's an audience.

8864.738 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Curriculum learning is when you organize a data set.

8868.283 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

When you think about a human, how they learn, they don't just see random wiki text and they just try to predict it.

8871.046 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

They're like, we'll start you off with Lorax or something and then you'll learn... I don't even remember what first grade was like, but you'll learn the things that first graders learn and then second graders and so forth.

8876.173 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Sorry, we know you never got past first grade.

8888.308 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Anyways, uh, let's get back to like the big, before we get into like a bunch of like interim details, the big picture, um,

8906.197 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

there's two threads I want to explore.

8915.397 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

First is, I guess it makes me a little worried that there's not even an alternative formulation of what could be happening in these models that could invalidate this approach, which feels like, I mean, we do know that we don't understand intelligence, right?

8916.92 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Like there are definitely unknown unknowns here.

8930.708 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

So,

8932.552 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Like the fact that there's not a null hypothesis.

8933.875 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I don't know.

8936.358 View full episode →

Dwarkesh Podcast

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

I feel like what if we're just wrong and we don't even know the way in which we're wrong, which actually increases the uncertainty.

8936.659 View full episode →

← Previous Page 68 of 79 Next →

Report any issue