Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Eve Bodnia

๐Ÿ‘ค Speaker
495 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So then we're like, okay, let's actually try to scale it as much as we can.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

And we performed a bunch of experiments, and also we have pretty decent theoretical understanding how it works, so we don't see any obstacles.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

But, you know, engineering can be tricky, so sometimes things work and sometimes things you need to debug.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So the biggest part for me personally was to...

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

Ah, how to say it?

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So the EBM is not naturally autoregressive because there's no tokens in it.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

It's also non-autoregressive.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So meaning it's overseeing all possible scenarios at the same time.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

But when you try to attach it to transformers,

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

Transformers are very autoregressive.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So you have to take this wild thing and attach to something which is thinking very sort of linearly.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

Yeah.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So you're facing a huge information loss in the middle.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

And then the same thing when you try to prompt using LLM the EBM.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So there is also a giant reduction of the information on that layer.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So we were trying to like orchestrate this layer alone, which took some time and try to see how it scales.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

So I think that was the biggest difficulty we faced.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

But now the architecture is there.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

It's scalable.

The Neuron: AI Explained
Why Energy-Based Models Could Be the Next Big Shift in AI

It's already like progressing the way we expected and a little bit even beyond.