Illia Polosukhin

👤 Speaker

552 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Well, it's all kind of half made up and half is from experience.

468.972 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

They were trying to do something.

472.621 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

It didn't work.

474.025 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

They were changing a bunch of stuff until it worked.

474.867 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And now they're not going to go and redo everything, figuring out if other options work.

477.454 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

They're just going to keep whatever worked.

483.409 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah.

485.744 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so like figuring out how to like go away from that.

486.706 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so RL is even worse.

489.871 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

RL is like literally, you know, we have no idea, but you know, hopefully like this reward function works, you know, we run it, it works great, you know, ship the paper, ship the model.

491.174 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

So it's a very like kind of semi-arbitrary.

504.678 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

There is no like actual science around reward distribution and kind of reward provocation.

507.864 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Well, it does that.

521.468 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

It's also like, and so it's very prone to like errors because especially like there was like all this fun stories of, you know, your model figuring out that actually it can look in the file where the answers are if you give it like file system tools.

522.651 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

or search or anything, it actually finds out how to get the answers.

537.044 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And this is way cheaper and better than actually thinking about stuff.

540.668 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

So this is why we kind of need a better kind of training mechanisms.

544.032 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And that's why, again, from a research perspective, I look at fixed size model.

550.879 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Can we make them better?

555.905 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Because that effectively shows we have a better training procedure.

557.286 View full episode →

← Previous Page 4 of 28 Next →

Report any issue