Illia Polosukhin

👤 Speaker

1100 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

One of the next improvements is, in general, better training.

430.816 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so IRL is kind of part of it, but IRL is still very spotty.

435.264 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

In general, AI is like alchemy.

441.956 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

I don't know if you've read any of the technical papers, but there's like...

444.561 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

We're using learning rate of 0.01 until step 10,000, and then we switch to 0.01, and then at 100 million steps, we're going to anneal it at rate 2x.

450.492 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

It's like, how did you come up with this?

461.957 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Where did this come from?

464.402 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah.

468.11 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Well, it's all kind of half made up and half is from experience.

468.972 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

They were trying to do something.

472.621 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

It didn't work.

474.025 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

They were changing a bunch of stuff until it worked.

474.867 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And now they're not going to go and redo everything, figuring out if other options work.

477.454 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

They're just going to keep whatever worked.

483.409 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

Yeah.

485.744 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so like figuring out how to like go away from that.

486.706 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

And so RL is even worse.

489.871 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

RL is like literally, you know, we have no idea, but you know, hopefully like this reward function works, you know, we run it, it works great, you know, ship the paper, ship the model.

491.174 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

So it's a very like kind of semi-arbitrary.

504.678 View full episode →

The Neuron: AI Explained

Illia Polosukhin: Fixing the Broken System He Helped Create

There is no like actual science around reward distribution and kind of reward provocation.

507.864 View full episode →

← Previous Page 31 of 55 Next →

Report any issue