Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Illia Polosukhin

๐Ÿ‘ค Speaker
1100 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Well, it does that.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

It's also like, and so it's very prone to like errors because especially like there was like all this fun stories of, you know, your model figuring out that actually it can look in the file where the answers are if you give it like file system tools.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

or search or anything, it actually finds out how to get the answers.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And this is way cheaper and better than actually thinking about stuff.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

So this is why we kind of need a better kind of training mechanisms.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And that's why, again, from a research perspective, I look at fixed size model.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Can we make them better?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Because that effectively shows we have a better training procedure.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And I mean, there's actually like, you know, improvements, like there's a new optimizer came out that, you know, like effectively for many years, there was this atom that was used.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And so now there's like this Muon new optimizer, which like, you know, has different algebraic properties around how gradients are propagated.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And, you know, there's like probably, I don't know, 50 people in the world that actually understands exact like mechanism of that beyond like it's magic.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

But, you know, it literally shows that model just trains faster.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Like same data, same model size trains faster, right?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

So clearly we haven't squeezed out everything we can from just how we're training these models.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

So that's kind of like, I would say like big area for me and I'm excited in that.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

I think the other one is just context.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

A lot of these models are actually really good now, but they just don't know stuff.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

They don't know stuff about you.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

They don't know stuff about your company and your work you're doing.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

They don't know things about the world that's happening right now.