Andrew Ilyas

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so what you'd expect, if that was your mental model,

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

what you would expect is for things to generally get worse as the projection dimension goes down, because you're progressively losing more and more information from these random projections.

2162.9 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

What we find, though, is that there's actually sort of like this peaking behavior, where for a while, making the random projection smaller actually gets better and better, and then it starts getting worse.

2172.994 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so what that indicates is that there's actually some non-compression effect of the random projections that we're doing.

2183.329 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

They're adding something.

2191.3 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so there's a lot of really interesting work out of a couple of groups in the statistics community that suggests that actually doing the influence function on randomly projected vectors is equivalent to doing some sort of regularized influence function on the original vectors.

2192.802 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so there's some regularization element or something like that happening that we didn't originally account for when we were writing the track paper.

2211.482 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Yeah, so that's probably my favorite application from the track paper is that we took this data set, which is this brilliantly designed data set out of MIT.

2221.844 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

I think Ekin Akurek is the lead author on that, of Jacob Andreas' group.

2231.038 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

But they designed this really cool data set called Ftrace.

2236.546 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And the idea behind Ftrace is that they construct this artificial data set where you have a test set that contains a bunch of facts and a train set of Wikipedia abstracts.

2239.67 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And they have annotations for which abstracts entail which facts.

2251.628 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

So for example, in your training set, if you have, you know, like France is a country whose capital is Paris, blah, blah, blah, blah, blah.

2256.937 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And then in your test set, it says like, what's the capital of France?

2261.925 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Paris.

2265.151 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Then like that would be labeled with a logical entailment.

2265.912 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

And so the really nice thing about this data set is it sort of gives you this technique for evaluating different data attribution methods by saying, OK, if I take this test example where the test example says what's the capital of France, Paris, and I look at sort of like what the most important examples are for that test example, it should highlight the things that entail that fact.

2268.436 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

Yeah, so in practice, what they found in their paper, which came out a while before TRACKT, is that an information retrieval system beat every single data attribution baseline they tried.

2297.81 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

In particular, the data attribution

2307.827 View full episode →

Machine Learning Street Talk (MLST)

Adversarial Examples and Data Modelling - Andrew Ilyas (MIT)

did not work well.

2310.961 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment