Illia Polosukhin

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And I've kind of expressed some of them.

352.659 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

I mean, like a few years ago, RL was one of them.

354.441 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And kind of we saw really kind of improvements coming from reinforcement learning.

357.564 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

I definitely think like there is only that much you can stuff, you know, random articles into a model until, you know, it stops learning.

363.511 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

But what we see is if you take some size of the model, like 8 billion parameter, the quality of that model keeps improving.

373.382 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Meaning like at the same scale of the parameters, we're getting better at how we train them.

382.637 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so that to me is the main, like the way I look at these things is like, hey, let's fix the size and see the progression there.

388.865 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And that progression defines to me the kind of, are we getting better at improving these models?

396.838 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah, I mean, well, bigger the problem is it's hard to compare apples to apples, right?

405.892 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Like, hey, is this model better than what it was three months ago?

411.402 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

It is on some metrics, but if you spend 10x more compute, is that actually the improvement we're looking for?

417.392 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

So to me, the kind of...

427.971 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

One of the next improvements is, in general, better training.

430.816 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so IRL is kind of part of it, but IRL is still very spotty.

435.264 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

In general, AI is like alchemy.

441.956 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

I don't know if you've read any of the technical papers, but there's like...

444.561 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

We're using learning rate of 0.01 until step 10,000, and then we switch to 0.01, and then at 100 million steps, we're going to anneal it at rate 2x.

450.492 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

It's like, how did you come up with this?

461.957 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Where did this come from?

464.402 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah.

468.11 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment