Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Illia Polosukhin

👤 Person
552 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And I've kind of expressed some of them.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

I mean, like a few years ago, RL was one of them.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And kind of we saw really kind of improvements coming from reinforcement learning.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

I definitely think like there is only that much you can stuff, you know, random articles into a model until, you know, it stops learning.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

But what we see is if you take some size of the model, like 8 billion parameter, the quality of that model keeps improving.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Meaning like at the same scale of the parameters, we're getting better at how we train them.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And so that to me is the main, like the way I look at these things is like, hey, let's fix the size and see the progression there.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And that progression defines to me the kind of, are we getting better at improving these models?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah, I mean, well, bigger the problem is it's hard to compare apples to apples, right?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Like, hey, is this model better than what it was three months ago?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

It is on some metrics, but if you spend 10x more compute, is that actually the improvement we're looking for?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

So to me, the kind of...

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

One of the next improvements is, in general, better training.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

And so IRL is kind of part of it, but IRL is still very spotty.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

In general, AI is like alchemy.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

I don't know if you've read any of the technical papers, but there's like...

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

We're using learning rate of 0.01 until step 10,000, and then we switch to 0.01, and then at 100 million steps, we're going to anneal it at rate 2x.

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

It's like, how did you come up with this?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Where did this come from?

The Neuron: AI Explained
Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah.