Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Tamay Besiroglu

๐Ÿ‘ค Person
878 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

I think if you look at their capability profile, if you compare it to a random job in the economy, I agree they are better at

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

doing sort of coding tasks that would be involved in R&D compared to a random job in the economy.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

But in absolute terms, I don't think they're that good.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

I think they are good at things that maybe impress us about human coders.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

If you wanted to see what makes a person a really impressive coder, you might look at their competitive programming performance.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

I mean, in fact, companies often hire people based on, if they're relatively junior, based on their performance on these kinds of problems.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

But that is just impressive in the human distribution.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

So if you look in absolute terms at what are the skills you need to actually automate the process of being a researcher, then what fraction of those skills do the AI systems actually have, even in coding?

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

Like a lot of coding is you have a very large code base you have to work with.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

The instructions are very kind of vague.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

There isn't, for example, you mentioned a meter eval in which because they needed to make it an eval, all the tasks have to be kind of compact and closed and have clear instructions

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

evaluation metrics, like here's a model, get its loss on this data set as low as possible, or whatever.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

Or here's another model and its embedding matrix has been scrambled, just fix it to recover most of its original performance, et cetera.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

Those are not problems that you actually work on in AI R&D.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

They're very artificial problems.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

Now, if a human was good at doing those problems, you would infer, I think logically, that that human is likely to actually be a good researcher.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

But if an AI is able to do them, like the AI lacks so many other competences that a human would have, not just a researcher, just an ordinary human that we don't think about in the process of research.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

So our view would be automating research is, first of all, more difficult than people give it credit for.

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

I think you need more skills to do it

Dwarkesh Podcast
AGI is Still 30 Years Away โ€” Ege Erdil & Tamay Besiroglu

and definitely more than models are displaying right now.