Trenton Bricken
๐ค SpeakerAppearances Over Time
Podcast Appearances
I think as much of a chunk as is necessary.
I mean, I think at least like, yeah, hard to define.
But I don't know.
At Anthropic, I feel like all of the different portfolios are like being very well supported and growing.
I think language models are also just really weird.
With the emergent misalignment work,
I don't know if they took predictions they should have of like, hey, I'm going to fine tune ChatGPT on code vulnerabilities.
Is it going to become a Nazi?
And I think most people would have said no.
And that's what happened.
And so what are the different โ And how did they discover that it became a Nazi?
they started asking it a ton of different questions.
And it will do all sorts of, like, vile and harmful things.
Like, the whole persona just totally changes.
And, I mean, we are dealing with alien brains here who don't have the social norms of humans or even a clear notion of, like, what they have and haven't learned.
That we have of them, I mean.
And so I think you really want to go into this with eyes wide open.
Yeah.
Dylan Patel has some scary forecasts on U.S.
energy.