Stefano Ermon

And in fact, we can actually use alternative architectures that scale better with respect to the context like state-space models or other attention variants that are more efficient.

1082.814 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

We have some preliminary results, so everything is compatible with different kind of backbones, but not in the production at the moment.

1094.628 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Nothing particularly.

1125.831 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I think it's just like a fundamental problem for which, you know, it's going to be hard to get, you know, like a real breakthrough.

1127.933 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Like there is just like,

1136.001 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

inherent trade-offs.

1137.035 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

I think of them in terms of sufficient statistics, like what do you store about your past and how do you keep track of, you know, you want to remember the things that are useful, you want to discard the things that are not useful.

1138.317 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And that's just fundamentally a hard problem.

1148.592 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

Like there is no, there's always, you know, there is some kind of no free lunch involved where ahead of time, you don't know what you should remember and what you should discard.

1150.935 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And some things are going to be useful for something and they're going to be not useful for something else.

1159.968 View full episode →

The Neuron: AI Explained

Diffusion for Text: Why Mercury Could Make LLMs 10x Faster

And so I think it's a fundamentally very difficult problem where you have to make trade-offs.

1165.536 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment