Scott Alexander
π€ SpeakerAppearances Over Time
Podcast Appearances
Here are some cool demos.
Like,
Otherwise, if you keep it a secret, then... Well, yeah.
So that's an example of transparency.
And then in the lead-up to that, I just want to see more benchmark scores and more freedom of speech for employees to talk about their predictions for AGI timelines and stuff.
So that, like, blah, blah, blah, blah, blah.
And then for the model spec thing, this is a constitution of power thing, but also an alignment thing.
Like...
The goals and values and principles and intended behaviors of your AIs should not be a secret, I think.
You should be transparent about, like, here are the values that we're putting into them.
Right.
Another example of this, by the way, so first of all, kudos to OpenAI for publishing their model spec.
They didn't have to do that.
I think they might have been the first to do that, and it's a good step in the right direction.
If you read the actual spec, it has like a sort of escape clause where it's like there's some important policies that are top-level priority in the spec that overrule everything else that...
we're not publishing and the model is instructed to keep secret from the user.
And it's like, what are those?
That seems interesting.
I wonder what that is.
I bet it's nothing suspicious right now.