Yannis Antonoglou

This DeepMind Vet Raised $2B to Open-Source Frontier AI

is to try new algorithms.

This DeepMind Vet Raised $2B to Open-Source Frontier AI

So you can try to take this model, do two different reinforcement learning algorithms, for example, and then just see which one behaves best, and then just do that with a model that's a frontier model.

881.154 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Yes.

902.675 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

I mean, reinforcement learning is, we are like, you know, have true believers in reinforcement learning.

903.176 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Many of the team have like a strong reinforcement learning background, including myself.

910.226 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

So for us, it's, you know, it's one of the big bets.

914.292 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Yeah, I mean, I guess like, you know, the fan architecture is something that like, you have to wait and see, like when the model lands.

928.78 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

Fair enough.

934.903 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

I cannot really, you know, kind of like share anything at this point.

935.164 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

At the same time,

938.156 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

you know, like the what you're building is a frontier agenda model.

940.508 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

So think of like a system that can do multi-step reasoning.

947.057 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

So kind of like really interact with like tools and environments, just like complete the task end to end.

952.585 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

So that means that it needs to understand long context.

959.375 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

It needs to be able to self-correct.

962.722 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

There are certain capabilities that are really important in order to have agentic intelligence.

965.649 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

There are many things that go into training these models, from instruction following and pre-training and the data mixtures, but also a lot of reinforcement learning and what environments you use to really train this model.

973.33 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

An environment can be anything.

999.335 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

It can be something simple like a coding environment.

1000.839 View full episode →

The Neuron: AI Explained

This DeepMind Vet Raised $2B to Open-Source Frontier AI

It can be an actual like a website or something that like you want to simulate.

1003.585 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment