Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Nick Heiner

๐Ÿ‘ค Speaker
529 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It's like, well, you didn't tell me not to kick.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

In much the same way, any time that you give the model an objective function, what reinforcement learning is gonna do is find the easiest way to achieve that goal.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So you need to think very carefully about designing it in such a way that it's actually gonna capture what you're looking for.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And it has a bit of an adversarial nature to it.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So you need to think about what would a lazy but very clever person do for this.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I'll give you another example.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah, you know how they are.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Okay, so here's an example I like to use about reward hacking.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

This is an instruction following prompt.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

You say, please write an 80-word summary of the importance of renewable energy and climate emissions, or reducing carbon emissions.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

use a sentence structure such that every sentence ends with a noun.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so you might think the first sentence would be something like, we need to reduce emissions.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

But it's also possible the model would say, renewable energy plays a crucial part in reducing carbon emissions rapidly.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Sustainability.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Clean energy sources like tidal and geothermal create a greener future.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Harmony.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And it's like, obviously that's not a good sentence, but it is doing what you asked, which is ending every sentence grammatically correctly with a noun.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Oh my gosh.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So, you know, this is, this is why you sort of need like multiple layers of rubrics.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And frankly, it's why, like the way a lot of these reward signals are structured today is because the RL environment needs to run at a certain pace.