Stefano Ermon

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Yeah.

1915.452 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And so, you know, it does make a lot of sense.

1917.334 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

One of the challenges is that

1921.459 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

if you wanted to go straight from voice to voice, is that often these kind of interactions still involve tool calls.

1923.393 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

So if you're doing a customer support, you might still need to be able to query a database or check a calendar for availabilities or look up the menu to get the prices.

1931.51 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And so there still needs to be some text, I think, some code involved, which makes it a little bit more tricky to develop.

1942.464 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

But we are very excited about eventually getting to something that is actually multimodal.

1950.315 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

The existing Mercury models are just text only or code only.

1955.642 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

But we know the future models work really well for image, video, music.

1959.607 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And so if we put everything together, we could get to something like truly phenomenal handling different kinds of modalities and have a real world model that understands everything and puts together all the possibilities.

1964.094 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

the learnings and the signals from all the different modalities but it's definitely something we want to do at some point yeah that would be so awesome would that be something that would be useful for like a robotic kind of situation or would that be more for like simulations that you can use to train robots in your opinion could be a mix it could be decision making like if you're using a video or other kinds of sensors as input and then use the model to make decisions or

1977.275 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

kind of like analyze what's going on in the surroundings.

2007.953 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

It's a very useful kind of like application of this technology.

2011.078 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

In fact, we've already heard it from some early adopters that they would love for our models to have image