Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Nick Heiner

๐Ÿ‘ค Speaker
529 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And when they see ChatGPT asking for that same information for free, like some of them have actually complained, you know, sort of in like, I mean, not like, I mean, they're just sort of vetting.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It's not a serious thing, but yeah.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

But yes, but that is exactly what they're doing is they're gathering training data.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah, I guess like, you know, if you ask me to learn how to golf just by watching YouTube videos, like I think I would truly struggle to do that.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

However, the reason it's taken us this long to get this far is in part because it is substantially more complicated to build a training golf course for you than it is to give you an iPad and say, here's a thousand hours of Tiger Woods.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah, so reinforcement learning is a technique that's applied during post-training.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Pre-training is the step where you basically have the model read, you know, the whole internet.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It's not actually the whole internet, but you just shove a bunch of tokens through the model.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And that's what gives you a prior on language.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It's what gives you, like, knowledge.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And then post-training is what gives you behavior.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So, you know, the most, the earliest example was if you remember in 2020, GPT-3 came out and

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It was not a chat tuned model like it would not have a conversation with you.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It would literally just you your prompt is a document and it just writes whatever it thinks the rest of the document is going to be.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So the way to sort of induce it into having a conversation with you might be you say Q and then write your question and then you say A and leave a spot for its response.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

But half the time, what it would do is it would write your answer and then it would write a question of its own.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Because, you know, many documents are in a Q&A format or whatever.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Right, right.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

The other issue with a raw pre-trained model is that you are not going to have sort of the safety standards that you want.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And so it's another thing applied during post-training.