Stefano Ermon

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

So diffusion is a type of generative AI model.

232.872 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

It's the kind of model that is commonly used to generate images, video, music.

237.778 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And you're probably familiar with the, you know, ChatGPTs or Geminis or Cloud, where you kind of like see the models generate text, kind of like left to right, one token at a time.

243.586 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

A diffusion model works very differently in the sense that it generates the full object from the beginning, and then it refines it by kind of like

253.699 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

fixing mistakes, making it sharper, making it look better and better.

261.121 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And it's a very different kind of like solution that it's more parallel in the sense that the neural network is able to modify many components of the image or the text at the same time.

264.324 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And that's why diffusion models tend to be a lot faster than traditional autoregressive models that kind of like work left to right one token at a time.

277.037 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah.

297.168 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

That's a great question.

298.249 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah, that's a great question.

299.551 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And it really stems from the way the models are trained.

300.693 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

A traditional autoregressive model, like a GPT model, is trained to, there is a neural network and it's trained to predict the next token, the next word.

304.479 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And that's how you use it at inference time.

312.351 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

You give it a question and then it will try to predict the answer left to right, one token at a time.

314.26 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

The fusion language model, it's trained to remove mistakes, fix mistakes.