Andrew Ilyas

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Yeah, absolutely.

702.25 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

I think, you know, machine teaching, I've heard data set distillation as well.

703.172 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Oh, yeah.

706.819 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And, you know, a couple of those core set finding, I think is another name.

707.861 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

But I think there are a variety of these applications where what we really care about is like narrowing down the data or like preserving the information that's present in the data set while cutting down on the number of data points.

711.649 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so I think it's a really interesting complementary goal to data modeling, or almost one that could benefit from data models being used.

720.847 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And I'm sure we'll talk about this later, but I think that where I'm most excited about things like data models being used is in tasks where you can write down an optimization function in terms of your training data.

729.342 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So if you can really write down like, you know, I want to minimize over my training set some function of model predictions, anything that looks like that, I think, you know, just plugging in what something like data models predicts will happen can be a really powerful primitive.

742.885 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so I'd be excited to see what happens for machine teaching, for example.

757.958 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Yeah, I would say it works surprisingly well.

791.926 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

It's obviously not perfect, but across a bunch of data sets we've tried, what you can look at to evaluate data models is actually just basically the correlation between what you say is going to happen according to your data model and reality, which you can access by just training the model yourself on that data set.

794.87 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so there are two sort of interesting ways of evaluating these.

812.835 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

One is that we fit the data models by sampling a bunch of random data sets, training a model on each one, and then fitting basically a linear regression from the 1, 0 encoding of the data set we trained on to a model's output on a specific target.

816.46 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

um it doesn't have to be a linear regression but in our paper we did a linear regression and so the nice thing is this is just a linear regression problem you can sample a bunch of new subsets you can you know see what your data model predicts you can measure reality you can compute the correlation and so there as i was saying there are two interesting ways of evaluating this one is this like on distribution evaluation where you sample data sets from the same distribution of data sets that you use to fit the parameters

834.01 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And there, data models do amazingly well, way better than we ever would have thought.

859.93 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

On CIFAR-10, for example, the correlation between data model predictions and reality is 0.9 or something like that, which is strikingly linear, I think.

865.699 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

When you go to non-random data sets, because I think that's sort of the more interesting question is like, does this work once you leave this distribution of data sets that you sampled?

878.358 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

The answer is yes, but slightly worse.

887.997 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So the correlation degrades a little bit.

891.343 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

But trend-wise, we tried a bunch of different ways of breaking the data model.

893.427 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment