Andrew Ilyas

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

At the time, the space of attacks on adversarial examples was pretty much less mature than it is right now.

4264.756 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

There were fast gradient sign method-based attacks, which was just take a single sign gradient step.

4272.143 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

There were a couple of gradient-based attacks.

4279.43 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And this field of gradient-free or black box attacks was just emerging.

4283.154 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And the idea here is that, OK, if you're an adversary and you're trying to attack a production machine learning system, there's no way that that production machine learning system is going to be like, here are our model weights.

4288.801 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Take some gradients.

4298.592 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Go ahead.

4299.373 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

What you usually have is an API.

4301.055 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

some kind of thing where you can query it with an image and then it'll reply with, here are the labels.

4303.257 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And generally that threat model where you have only query access to a machine learning system was not sufficient to do adversarial attacks well.

4309.672 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So there were sort of two lines of work emerging as we started that black box adversarial attacks work.

4318.512 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

One was on using adversarial transferability.

4326.764 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So the idea is that if you wanted to attack a production model, what you should do is sort of like train your own model locally, attack that model, and then deploy the attack or like deploy the attacked image and see what happens.

4330.569 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so that was interesting, but sort of separate from what we were interested in.

4343.167 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

We wanted to see if we could really attack the system directly.

4346.936 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so there was one really nice workout at the time called, I think, Zoo, the zeroth order attack, something with zoo.

4350.585 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

It was like a zeroth order optimization-based attack.

4358.523 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And what they basically did is they would estimate the gradient by component-wise by doing a zeroth-order gradient estimator, which is basically you start with your image.

4361.29 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

For each pixel, you add epsilon.

4373.207 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

You subtract epsilon.

4375.39 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment