Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andrew Ilyas

๐Ÿ‘ค Speaker
638 total appearances

Appearances Over Time

Podcast Appearances

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So yeah, that was just one of the findings.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

But we did this basically study into what kinds of biases can creep up as a function of how you collect data, not just the data itself.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And then on the theoretical side, probably my favorite work so far has been this work on understanding what's called self-selection bias.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So this is, again, very far from the deep learning regime.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

We're just back to doing linear regression or something like that.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And what self-selection bias looks like is, I think the best way to illustrate it is with an example.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Let's say you like went to the canonical example from like the 50s is that you go to a village and everyone in the village is either like a hunter or a fisher.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

That's the only two jobs available in the village.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And you're interested in sort of understanding how do people's features, like their height and their weight and how fast they are, translate into the money that they make from hunting or from fishing.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so a very natural approach to this would be you go to the village, you survey people, you record their features, then you run a linear regression for hunting, and you run a linear regression for fishing, and then you're done.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

What people have known for a really long time, though, is that this is a super biased approach.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

It won't actually tell you how important the different features are for your hunting revenue or your fishing revenue.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And the reason is that all of these villagers had the choice of whether they wanted to do hunting or whether they wanted to do fishing.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And assuming they're rational people, they chose the thing that they were better at.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So like you don't get to see how good this hunter is at fishing and you don't get to see how good the fisher is at hunting.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And like the way they partitioned into those two groups is extremely non-random.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

It's actually based on whichever one would have made them more money.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so there's been a ton of work sort of throughout econometrics, economics, stats on like dealing with the self-selection problem.

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so

Machine Learning Street Talk (MLST)
Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

with some collaborators at MIT and Berkeley and my advisor Costas, we basically devised an algorithm, like an efficient algorithm that could recover from self-selection bias, as we called it, in high dimensions.