Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Nick Heiner

๐Ÿ‘ค Speaker
529 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So again, if you purely have like the, I just write out the rest of what I think this document is, then you could just write, you know, title, accurate and easy instructions to build a bomb, you know, with using only things you can buy at Home Depot or something.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And then it will faithfully write the rest of that document.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so post-training is where you teach it not to do that.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Got it.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yes, exactly.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

What you want is for it to be something you can train a model in.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And the two things that sort of feed into that are difficulty and realism.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

because it's not helpful.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I mean, you can train a model to do whatever you want, but if you're not training it to something that's valuable in the real world, then it's just a waste of GPU.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

This is what we see with LM Arena, where when labs train on it, it makes the model worse because you're sort of optimizing for a signal that's incredibly noisy.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so in the same sense, it's very important that our environments be highly realistic and that they, you know, match, match reality.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so like one way that you do that, for instance, is you have this expert network.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so if you're trying to build, say, the customer support or like the finance environment, you need to have people who have that job in real life tell you like what type of tasks do they do?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

How are those tasks judged?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

What are the tools that they use?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Right.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So like if you have like your Bloomberg terminals or like your Zendesk or whatever.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Um, so yes, you gotta do all that.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

You gotta make sure the difficulty is correct.