Stefano Ermon

👤 Speaker

359 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah, that's the

1690.967 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

benefit I think of being in academia that everything is open and you're allowed to publish all of your work and you know that's the whole point for advancing the field together as a community I love that aspect and I think I mean a lot of the researchers do and you know I could sense a lot of unhappiness from colleagues and other researchers at

1692.458 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

you know, in industry, working in the big labs, you know, as people, as the publication policies, you know, started to tighten and people were not allowed to publish anymore.

1720.37 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I think there was a lot of, a lot of people were not happy.

1730.543 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah.

1756.585 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

At the moment, we're going after what we think of like instant AI, sort of kind of like applications of LLMs where latency is critical, which typically means there's a human in the loop and the human cannot wait.

1757.706 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

and that human could be a developer.

1770.202 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

So we're seeing a lot of usage of Mercury models in IDEs where you're essentially providing suggestions or edits to the code, for example, directly to a developer.

1771.904 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And there you maybe have a few hundred milliseconds of latency budget, and you want to be able to provide the best possible suggestion within the latency budget.

1783.619 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

But it could also be customer support, voice agents, edtech.

1793.354 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I want to ask about that.

1798.046 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah, any other situation where you have to give an answer, you have to interact.

1799.731 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

with a human in real time, then latency becomes critical and the game becomes, again, sort of like what's the best quality result that you can provide within the latency budget for a reasonable cost.

1805.504 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And that's where we dominate existing autoregressive solutions.

1817.542 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And that's where we're seeing a lot of the initial traction.

1822.068 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I think eventually, as the intelligence of the models keeps improving, as we do more R&D, as we catch up with frontier quality models, I think there's going to be more and more applications that we can go after.

1825.113 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

But right now, we're going after latency-sensitive applications of other labs.

1838.053 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah, and I mean, you're absolutely right that diffusion actually does work and it works really, really well for speech and music generation.

1893.625 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I know that some of the open source models and some of the state-of-the-art actually closed source models are based on diffusion for text-to-speech.

1902.957 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I didn't even know that.

1914.231 View full episode →

← Previous Page 12 of 18 Next →

Report any issue