Daniel Kokotajlo
👤 PersonAppearances Over Time
Podcast Appearances
This is actually a significant part of your worldview, so can you explain...
Yeah, your thoughts on why transparency through this period is important.
Let me just add on to that one of the many other reasons why I worry about nationalization or some kind of public-private partnership or even just very stringent regulation.
Actually, this is more an argument against very stringent regulation in favor of safety rather than deferring more to the labs on the implementation.
is that it just seems like we don't know what we don't know about alignment.
Every few weeks, there's this new result.
OpenAI had this really interesting result recently where they're like, hey, they often tell you if they want to hack, like in the chain of thought itself, and it's important that you don't train against
the chain of thought where they tell you they're going to hack because they'll still do the hacking if you train against it.
They just won't tell you about it.
You can imagine very naive regulatory responses.
It doesn't just have to be regulations.
One might be more optimistic that if it's an executive order or something, it'll be more flexible.
I just think that relies on a level of goodwill and...
flexibility on the behalf of a regulator.
But suppose the
There's some department that says, if we catch you, if we catch your AI saying that they want to take over or do something bad, then you'll be really heavily punished.
Your immediate response is allowed to just be like, okay, let's train them away from saying this.
So you can imagine all kinds of ways in which a top-down mandate from the government to the labs of safety would just really backfire.
And given how fast things are moving, maybe it makes more sense to,
leave these kinds of implementation decisions or even high-level overall... What is the word?