David Duvenaud
๐ค SpeakerAppearances Over Time
Podcast Appearances
And basically it's like an arms race ends up taking money from the poor sort of thing.
And then there was a reply, someone actually applied to the conference saying, oh, in contrast to McInnes et al., we have this sort of game theory model that shows that actually in a bilateral situation where there's only two powers, there is actually a stable equilibrium where they do still end up spending a lot of money on their welfare.
And I was so happy that there was somebody making a position clear enough to be rebutted by someone else.
And this is exactly the sort of thing that I'm hoping will happen more often, like how we provided a venue for it.
Maybe the other concrete thing that was obviously counterfactually good was Jacob Steinhardt.
We invited him to give a talk.
He's the CEO of Transluce, and he told me he didn't actually have an idea when he agreed to give a talk, but he came up with his idea for addressing some of these
dangers in the future.
And his concrete proposal was let's flood the Internet with high quality data showing A.I.
's doing valuable work, but in a morally aligned way.
And so this is kind of like moral fables for A.I.'s.
And if we flood enough of the Internet with this data, then anybody in the future who scrapes the Internet for their own new LLM is going to train something that's basically aligned by default and basically raise the cost of misaligning A.I.'s.
And I don't think this is like that big of a deal in terms of like it being
like fully addressing any problem satisfactorily, but it made me feel like, okay, there is alpha here in that if you push in this direction, we can sort of get people to think hard about this and make like some progress on concrete questions.
That's an improvement on what we have.
Yeah, exactly.
Yeah.
I mean, the way I kind of think about it is that we've just been living in easy mode this whole time where we weren't really steering our civilization.
And it was sort of fine because we're the fuel on which civilization runs.
And so we have almost no ability to control the whole thing.