Eve Bodnia

Why Energy-Based Models Could Be the Next Big Shift in AI

So then we're like, okay, let's actually try to scale it as much as we can.

Why Energy-Based Models Could Be the Next Big Shift in AI

And we performed a bunch of experiments, and also we have pretty decent theoretical understanding how it works, so we don't see any obstacles.

1742.29 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

But, you know, engineering can be tricky, so sometimes things work and sometimes things you need to debug.

1751.229 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So the biggest part for me personally was to...

1756.881 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

Ah, how to say it?

1761.612 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So the EBM is not naturally autoregressive because there's no tokens in it.

1763.536 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

It's also non-autoregressive.

1768.809 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So meaning it's overseeing all possible scenarios at the same time.

1770.212 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

But when you try to attach it to transformers,

1774.422 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

Transformers are very autoregressive.

1777.188 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So you have to take this wild thing and attach to something which is thinking very sort of linearly.

1779.912 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

Yeah.

1789.206 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So you're facing a huge information loss in the middle.

1789.466 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

And then the same thing when you try to prompt using LLM the EBM.

1793.993 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So there is also a giant reduction of the information on that layer.

1798.62 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So we were trying to like orchestrate this layer alone, which took some time and try to see how it scales.

1802.686 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So I think that was the biggest difficulty we faced.

1809.499 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

But now the architecture is there.

1812.304 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

It's scalable.

1813.927 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

It's already like progressing the way we expected and a little bit even beyond.

1815.631 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment