Eve Bodnia

👤 Speaker

495 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

Um, no, it's, um, so the first, when we started this company, I just had like some theoretical idea, right?

1624.486 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

And then I was surrounded by talented engineers who just brought this idea into a form of proof of concept.

1633.458 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

And that was like a few months ago.

1640.368 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

And the natural question was like, oh, can the proof of concept be the actual, like a toy model for the model we have today?

1642.691 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So the answer was yes, but to get there, we had to perform like a series of experiments to evaluate like what's right, what works, what doesn't.

1650.862 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So when the architecture was fully designed, the next step would be, oh, is it compatible with LLMs or with transformers in general?

1660.095 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

Because it's so fundamentally different.

1668.547 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

We didn't even know like

1670.89 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

it's possible to do.

1672.512 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So the first step was to attach Transformer and try to scale it a little bit and then kind of shrink it back to the toy model version.

1674.515 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So we successfully done so.

1682.045 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

And then we're like, oh, can we like not even have an LLM, but the most simplest version of something related to LLM attached to Transformant, which is small and we understand.

1684.168 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So we attach that, we also scale it and then scale it back and like, okay, that works.

1696.405 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

How about we just attach the real LLM to the EBM and see how it is as a user interface.

1701.908 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

Can it prompt the EBM in the way we want?

1707.676 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

And the answer was yes.

1710.921 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So we again scale it and then test it.

1712.083 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

We have a set of benchmarks, which is related to spatial thinking and hierarchical planning.

1715.087 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

So we had baselines for the smallest version of

1720.836 View full episode →

The Neuron: AI Explained

Why Energy-Based Models Could Be the Next Big Shift in AI

smallest version of the model, then proof of concept, then the real version of the model and kind of like compare it back and forth and it seems to be working.

1725.803 View full episode →

← Previous Page 13 of 25 Next →

Report any issue