Dwarkesh Patel

👤 Speaker

15656 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1

Confidence: Medium

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You two did some very interesting things about this.

2407.735 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Conceptually, how should we think about the way that humans are able to build a rich world model just from interacting with our environment and in ways that seems almost irrespective of the final reward at the end of the episode?

2410.338 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If somebody's starting to start a business and at the end of 10 years she finds out whether the business succeeded or failed,

2425.015 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We say that she's earned a bunch of wisdom and experience, but it's not because like the log probs of every single thing that happened over the last 10 years are up-weighted or down-weighted.

2431.162 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's something much more deliberate and rich is happening.

2438.37 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What is the ML analogy and how does that compare to what we're doing with other ones right now?

2442.634 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But you're so good at coming up with evocative phrases.

2693.686 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Sucking supervision through a straw is, like, so good.

2697.811 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why hasn't—so you're saying, like, your problem with outcome-based reward is that you have this huge trajectory, and then at the end, you're trying to learn every single possible thing about what you should do and what you should learn about the world from that one final bit.

2702.617 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why hasn't—given the fact that this is obvious—why hasn't process-based supervision—

2718.097 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

as an alternative been a successful way to make models more capable?

2722.002 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What has been preventing us from using this alternative paradigm?

2725.887 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're basically training the LLM to be a prompt injection model.

2871.679 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So to the extent you think this is the bottleneck to making RL more functional, then that will require making LLMs better judges if you want to do this in an automated way.

2886.803 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then so is it just going to be like some sort of GAN-like approach where you had to train models to be more robust?

2896.063 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

2936.062 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Do you have some shape of what the other idea could be?

2936.743 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2977.287 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I see a very, not easy, but like I can conceptualize how you would be able to train on synthetic examples or synthetic problems that you have made for yourself.

2977.888 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But there seems to be another thing humans do, maybe sleep is this, maybe daydreaming is this, which is not necessarily come up with fake problems, but just like reflect.

2988.439 View full episode →

← Previous Page 187 of 783 Next →

Report any issue

Dwarkesh Patel

Voice Profile Active

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment