Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Nick Heiner

๐Ÿ‘ค Speaker
529 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah, so it's similar to...

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It's similar to the question of building our own environments in the first place.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Like if you're going to deploy a solution, you need an eval.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

If you don't have an eval, then you're basically just going off vibes.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

You're flying blind.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And if what you're building, if it matters, if it's correct, and if it has more than like a tiny surface area, there's just no way to like you make a change to the system because

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And then you try it five times and you're like, okay, it's better now.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Like that's just, that's not going to be good enough for like these business use cases.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So you need to have some means of evaluating what you're doing.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And then once you have that, yeah, maybe you're using sort of a community agent harness.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

A lot of them are very customizable.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

If you want to be sophisticated, maybe you're using just a drop-in solution.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Like, you know, they're like drop-in customer support agents and,

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I think depending on the company's degree of technical sophistication and how bespoke their problems are, you know, all of those things can make sense.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

But the way that they know where they need to be is having that great eval set.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Because otherwise, yeah, you're just guessing.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So one big challenge is reward hacking, which is where models will cleverly find ways to get the reward signal out of your environment that gives it a high score without actually doing the thing you want them to do.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

They sort of follow the letter, but not the spirit of the law.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so, for instance, if you have ever tried to do behavior modification on a small child,

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

and you say something like, stop hitting your sister, and the child responds by kicking instead.