Yannis Antonoglou

You can have a bash and then you type commands into the bash and then you just get the output of these commands and then the whole point is for the model to just solve the task that you asked it to solve.

1048.196 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Right, yeah.

1062.7 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

I mean, I think data and environments are kind of quite similar, and

1075.683 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

It's also like the terminology that people use depends on their background.

1082.866 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

So synthetic data, for example.

1089.216 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

When people say synthetic data and that the model can learn through synthetic data, what that means is that the model generates data and then it can actually... You do some form of filtering and then you can learn on the data that you've selected based on the filtering.

1091.719 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

And...

1108.745 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

You know, this is kind of like exactly how reinforcement learning works.

1109.967 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Like the whole idea of reinforcement learning is like trial and error, as in the model tries something and then based on the outcome of like what you tried, it either wants to just like do more of it or less of it.

1112.63 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

And so like synthetic data and reinforcement learning, like inherently they're like really kind of almost the same thing at some level of abstraction.

1123.043 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

And what reinforcement learning tries to do is just like be much more efficient in the way that it learns from this data, it learns from this experience.

1131.774 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Yeah, so we actually, you know, we believe in the Transformers and I understand why people and especially like, you know, people like Reed Sutton or Andre Kapathy feel that Transformers have like some inherent limitations that will stop us from like

1156.66 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

you know, going all the way to AGI.

1173.299 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment