Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Yannis Antonoglou

๐Ÿ‘ค Speaker
329 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

But like fundamentally what it is, is you can think of like an environment even like in the more reinforcement learning

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

kind of like terminology as a system where like your model can interact with.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

So it can take actions like in our case, actions are like tool use, you can actually use certain tools, it can or MCPs.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

And when it calls these tools or MCPs, it actually gets a new observation, like a new state of the world.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

And this way, you know, it can interact

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

it can interact with the world so that it can complete the task.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

So a world, for example, can be a terminal.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

You can have a bash and then you type commands into the bash and then you just get the output of these commands and then the whole point is for the model to just solve the task that you asked it to solve.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

Right, yeah.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

I mean, I think data and environments are kind of quite similar, and

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

It's also like the terminology that people use depends on their background.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

So synthetic data, for example.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

When people say synthetic data and that the model can learn through synthetic data, what that means is that the model generates data and then it can actually... You do some form of filtering and then you can learn on the data that you've selected based on the filtering.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

And...

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

You know, this is kind of like exactly how reinforcement learning works.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

Like the whole idea of reinforcement learning is like trial and error, as in the model tries something and then based on the outcome of like what you tried, it either wants to just like do more of it or less of it.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

And so like synthetic data and reinforcement learning, like inherently they're like really kind of almost the same thing at some level of abstraction.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

And what reinforcement learning tries to do is just like be much more efficient in the way that it learns from this data, it learns from this experience.

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

Yeah, so we actually, you know, we believe in the Transformers and I understand why people and especially like, you know, people like Reed Sutton or Andre Kapathy feel that Transformers have like some inherent limitations that will stop us from like

The Neuron: AI Explained
This DeepMind Vet Raised $2B to Open-Source Frontier AI

you know, going all the way to AGI.