Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Francois Chollet

👤 Speaker
See mentions of this person in podcasts
649 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

So I think you want to add a code interpreter to the system.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

I think that's great.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

That's totally legitimate.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

The part that would be cheating is try to...

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

anticipate what might be in the test set like brute force the space of possible tasks and then train a memorization system on it and then rely on the fact that you're generating so many tasks like millions and millions and millions that inevitably there's going to be some overlap between what you're generating and what's in the test set.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

I think that's defeating the purpose of benchmark because then you can just solve it without any need to adapt just by fetching a memorized solution.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

So hopefully Arc will resist to that, but you know, nothing, no benchmark is necessarily perfect.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

So maybe there's a way to hack it.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

And I guess we are going to get an answer very soon.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

You want to input core knowledge, like arc-like core knowledge into the model, but surely you don't need tens of millions of tasks to do this.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Like core knowledge is extremely basic.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

I would definitely file that under core knowledge.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Like core knowledge includes basic physics, for instance, bouncing or trajectories.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

That would be included.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

But yeah, I think you're entirely right.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

The reason why as a human you're able to quickly figure out the solution is because you have this set of building blocks, this set of patterns in your mind that you can recombine.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Core knowledge can be learned.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

And I think in the case of humans, some amount of core knowledge is something that you're born with.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Like we're actually born with a small amount of knowledge about the world we're going to live in.

Dwarkesh Podcast
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

We are not blank slates.