Nick Heiner

And if what you're building, if it matters, if it's correct, and if it has more than like a tiny surface area, there's just no way to like you make a change to the system because

1863.861 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And then you try it five times and you're like, okay, it's better now.

1877.02 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Like that's just, that's not going to be good enough for like these business use cases.

1880.125 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So you need to have some means of evaluating what you're doing.

1884.111 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And then once you have that, yeah, maybe you're using sort of a community agent harness.

1888.197 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

A lot of them are very customizable.

1894.366 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

If you want to be sophisticated, maybe you're using just a drop-in solution.

1895.809 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Like, you know, they're like drop-in customer support agents and,

1899.775 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

I think depending on the company's degree of technical sophistication and how bespoke their problems are, you know, all of those things can make sense.

1902.367 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

But the way that they know where they need to be is having that great eval set.

1911.525 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Because otherwise, yeah, you're just guessing.

1916.495 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So one big challenge is reward hacking, which is where models will cleverly find ways to get the reward signal out of your environment that gives it a high score without actually doing the thing you want them to do.

1923.528 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

They sort of follow the letter, but not the spirit of the law.

1941.027 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And so, for instance, if you have ever tried to do behavior modification on a small child,

1946.393 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

and you say something like, stop hitting your sister, and the child responds by kicking instead.

1951.81 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment