Nick Heiner

Inside the Secret Labs Where AI Learns to Work

It's like, well, you didn't tell me not to kick.

1957.598 View full episode →

Inside the Secret Labs Where AI Learns to Work

In much the same way, any time that you give the model an objective function, what reinforcement learning is gonna do is find the easiest way to achieve that goal.

1961.423 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So you need to think very carefully about designing it in such a way that it's actually gonna capture what you're looking for.

1972.979 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And it has a bit of an adversarial nature to it.

1980.734 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So you need to think about what would a lazy but very clever person do for this.

1984.121 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

I'll give you another example.

1991.057 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Yeah, you know how they are.

2005.229 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Okay, so here's an example I like to use about reward hacking.

2008.192 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

This is an instruction following prompt.

2012.798 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

You say, please write an 80-word summary of the importance of renewable energy and climate emissions, or reducing carbon emissions.

2015.301 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

use a sentence structure such that every sentence ends with a noun.

2025.233 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And so you might think the first sentence would be something like, we need to reduce emissions.

2030.422 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

But it's also possible the model would say, renewable energy plays a crucial part in reducing carbon emissions rapidly.

2036.332 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Sustainability.

2043.244 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Clean energy sources like tidal and geothermal create a greener future.

2045.427 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Harmony.

2049.534 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And it's like, obviously that's not a good sentence, but it is doing what you asked, which is ending every sentence grammatically correctly with a noun.

2050.833 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Oh my gosh.

2058.683 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So, you know, this is, this is why you sort of need like multiple layers of rubrics.

2060.425 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And frankly, it's why, like the way a lot of these reward signals are structured today is because the RL environment needs to run at a certain pace.

2065.872 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment