Nick Heiner

Inside the Secret Labs Where AI Learns to Work

And when they see ChatGPT asking for that same information for free, like some of them have actually complained, you know, sort of in like, I mean, not like, I mean, they're just sort of vetting.

433.938 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

It's not a serious thing, but yeah.

444.46 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

But yes, but that is exactly what they're doing is they're gathering training data.

446.244 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Yeah, I guess like, you know, if you ask me to learn how to golf just by watching YouTube videos, like I think I would truly struggle to do that.

466.338 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

However, the reason it's taken us this long to get this far is in part because it is substantially more complicated to build a training golf course for you than it is to give you an iPad and say, here's a thousand hours of Tiger Woods.

474.651 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Yeah, so reinforcement learning is a technique that's applied during post-training.

501.37 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Pre-training is the step where you basically have the model read, you know, the whole internet.

506.135 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

It's not actually the whole internet, but you just shove a bunch of tokens through the model.

511.34 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And that's what gives you a prior on language.

515.785 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

It's what gives you, like, knowledge.

517.607 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

And then post-training is what gives you behavior.

520.57 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So, you know, the most, the earliest example was if you remember in 2020, GPT-3 came out and

523.113 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

It was not a chat tuned model like it would not have a conversation with you.

530.26 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

It would literally just you your prompt is a document and it just writes whatever it thinks the rest of the document is going to be.

534.685 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

So the way to sort of induce it into having a conversation with you might be you say Q and then write your question and then you say A and leave a spot for its response.

542.735 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

But half the time, what it would do is it would write your answer and then it would write a question of its own.

555.27 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Because, you know, many documents are in a Q&A format or whatever.

560.213 View full episode →

The Neuron: AI Explained

Inside the Secret Labs Where AI Learns to Work

Right, right.