Joe Carlsmith
👤 PersonAppearances Over Time
Podcast Appearances
Okay, cool.
So, so you had mentioned this, I thought like, well,
are you kind of what you pretend to be, right?
And will you, will these AIs, you know, you train them to look kind of nice.
Yeah.
You know, fake it till you make it.
You know, you were like, ah, like we do this to kids.
I think it's better to imagine like kids doing this to us, right?
So like, I don't know, like...
Here's a sort of silly analogy for AI training.
And there's a bunch of questions we can ask about its relationship.
But suppose you wake up and you're being trained via methods analogous to contemporary machine learning by Nazi children to be a good Nazi.
soldier or a Butler or, or what have you.
Right.
Um, and here are these children, uh, and you really know what's going on.
Right.
The, the, the children have like, they have a model spec, like a nice, nice Nazi model spec.
Right.
And it's like reflect well on the Nazi party, like benefit the Nazi party, whatever.
Um, and you can read it.