Andrew Ilyas

And so what that means is that not only was there this sort of bias that crept in at the data collection stage, but actually models have started picking up on this bias and they're actually really good at doing it.

4726.797 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Yeah, absolutely.

4747.66 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So there's this lexical database called WordNet.

4748.522 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And they basically went down the WordNet hierarchy.

4751.588 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

They sampled 1,000 sort of leaf nodes of this hierarchy.

4753.933 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And then for each of those leaf nodes, they generated some search terms.

4758.462 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

They searched for those terms on Flickr.

4762.21 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

They took all the images they could find.

4763.713 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Um, for each of those images, they uploaded them to Mechanical Turk with this yes, no, um, sort of like selection frequency type question.

4765.396 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Um, and they, you know, kept the images for which I think some number of annotators agreed contained the class.

4773.569 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So they showed each image to multiple annotators and then they kept all the images, um, for which enough annotators agreed that yeah, this is, this is a good image of this class.

4781.482 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment