Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrew Ilyas

๐Ÿ‘ค Speaker
638 total appearances

Appearances Over Time

Podcast Appearances

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so the interesting thing I think about that quote is that it can be interpreted in two totally different ways.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

One is you could be like, you have no idea how much data that is.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Clearly, this emergence of reasoning could happen because of how much data that is.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

The other is that you have no idea how much data that is.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Anything you see is just going to be in the training data.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So there's no reasoning or emergence of reasoning necessary.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

It's all just going to be there.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And I think disentangling between those two things requires looking beyond performance into things like trying to understand where in the training data predictions are coming from, trying to understand what exactly are the steps of a problem that were present in the training data versus what's new.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And for what's new, where is that new behavior actually coming from?

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so one particularly interesting direction, I think, is trying to understand from a data perspective where exactly these surprising, amazing advancements of large language models are coming from, because they're clearly not

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

I think it's slightly reductionist now to say that they're copy pasting from the training data, but that doesn't mean that they're reasoning or it doesn't mean that anything crazy is happening.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

It could just be that they've learned a good enough abstraction to copy paste from the training data in a much more abstract way.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And then you get into this philosophical discussion of like, is that what humans are doing?

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And I don't want to think about that.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Yeah, I think I totally agree.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And that's a big driving force, I think, behind our work.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

I'm very curious to see what happens when you apply the methods you're developing to these large language models and the ones that we claim are doing reasoning and things like that.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

I think the only thing stopping us right now is just not having quite fast enough methods to actually scale.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

I think there's a really nice paper out of Anthropic that exactly tried to do this for Claude, I believe, to understand where different behaviors are coming from the training data.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

But even then, these methods are so expensive that I don't think they were able to do, for example, actual data counterfactuals.