Dr. Jeff Beck
๐ค SpeakerAppearances Over Time
Podcast Appearances
So this would include things like everyone's getting, you know, this number of people are going hungry, this, you know, and, you know, all the stats that describe like the inputs and outputs to our policy, you know, to our policy distribution.
And then we could just ask an AI question.
Your reward function is the one that results in the same outcome that we currently have, right?
On average.
And it would execute it and to the extent that it works, right?
It would ultimately result in an AI algorithm that just sort of is like mimicking human behavior, right?
Or at least achieving the same outcome that we were achieving before.
Now, here's the safe way to like improve the situation.
You don't say end world hunger, right?
You perturb that distribution over outcomes and just over outcomes a little bit, and then you evaluate the consequences.
It's all you're doing.
You make these little changes in an empirically estimated reward function rather than just sort of specifying one by hand because that's the dangerous thing.
Jeff, thank you so much for joining us.
It's my pleasure.
Amazing.