Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

πŸ‘€ Speaker
15656 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

So is it surprising that they aren't able to integrate that into whenever you're like add rope embeddings or something, they do that in the wrong way?

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Yeah.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Actually, here's another reason why this is really interesting.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Through the history of programming, there's been many productivity improvements, compilers, linting, better programming languages, etc.,

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

which have increased programmer productivity, but have not led to an explosion.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

So that sounds very much like autocomplete tab.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And this other category is just like automation of the programmer.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And so it's interesting you're seeing more in the category of the historical analogies of better compilers or something.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

One of the big problems with RL is that it's incredibly information sparse.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

LabelBox can help you with this by increasing the amount of information that your agent gets to learn from with every single episode.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

For example, one of their customers wanted to train a coding agent.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

So LabelBox augmented an IDE with a bunch of extra data collection tools and staffed a team of expert software engineers from their aligner network to generate trajectories that were optimized for training.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Now, obviously these engineers evaluated these interactions on a pass-fail basis, but they also rated every single response on a bunch of different dimensions like readability and performance.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And they wrote down their thought processes for every single rating that they gave.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

So you're basically showing every single step an engineer takes and every single thought that they have while they're doing their job.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And this is just something you could never get from usage data alone.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

And so LabelBox packaged up all these evaluations and included all the agent trajectories and the corrective human edits for the customer to train on.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

This is just one example, so go check out how Labelbox can get you high-quality frontier data across domains, modalities, and training paradigms.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Reach out at labelbox.com slash thwarkesh.

Dwarkesh Podcast
Andrej Karpathy β€” AGI is still a decade away

Let's talk about RL a bit.