Mike Krieger
👤 SpeakerAppearances Over Time
Podcast Appearances
And then other harms that are against the way we think the model should behave and ideally have been caught earlier, but if they aren't, then they can get caught at that last mile. So it's like our head of trust and safety was talking to him last week.
And then other harms that are against the way we think the model should behave and ideally have been caught earlier, but if they aren't, then they can get caught at that last mile. So it's like our head of trust and safety was talking to him last week.
He calls it the Swiss cheese method, which is like no one layer will catch everything, but ideally enough layer stack will catch a lot of it before it reaches the end.
He calls it the Swiss cheese method, which is like no one layer will catch everything, but ideally enough layer stack will catch a lot of it before it reaches the end.
He calls it the Swiss cheese method, which is like no one layer will catch everything, but ideally enough layer stack will catch a lot of it before it reaches the end.
Yeah, and I would maybe split internal to Anthropic and just what I've just seen out in the world. The Grok image generation stuff that came out like two weeks ago was fascinating. Cause it was almost like, you know, cause I think there was a, maybe they've introduced some, but at launch it felt like there was almost, it was a total free for all.
Yeah, and I would maybe split internal to Anthropic and just what I've just seen out in the world. The Grok image generation stuff that came out like two weeks ago was fascinating. Cause it was almost like, you know, cause I think there was a, maybe they've introduced some, but at launch it felt like there was almost, it was a total free for all.
Yeah, and I would maybe split internal to Anthropic and just what I've just seen out in the world. The Grok image generation stuff that came out like two weeks ago was fascinating. Cause it was almost like, you know, cause I think there was a, maybe they've introduced some, but at launch it felt like there was almost, it was a total free for all.
It's like, do you want to see Kamala with a machine gun? It was like, it was, you know, crazy stuff. I go between believing that like actually having examples like that in the world are actually helpful and almost like inoculating, like what you take for granted as a photograph or not, you know, or even a video or not. I don't think we're far from that as well. And like getting, you know,
It's like, do you want to see Kamala with a machine gun? It was like, it was, you know, crazy stuff. I go between believing that like actually having examples like that in the world are actually helpful and almost like inoculating, like what you take for granted as a photograph or not, you know, or even a video or not. I don't think we're far from that as well. And like getting, you know,
It's like, do you want to see Kamala with a machine gun? It was like, it was, you know, crazy stuff. I go between believing that like actually having examples like that in the world are actually helpful and almost like inoculating, like what you take for granted as a photograph or not, you know, or even a video or not. I don't think we're far from that as well. And like getting, you know,
Maybe it's calling the Denver post or like a trusted source, or maybe it's like creating some hierarchy of trust that we can go after. You know, I don't, there's no easy answers there as well, but like, that's, I would say like. A industry, almost like a society-wide thing that we're going to reckon with as well, like the image and video pieces.
Maybe it's calling the Denver post or like a trusted source, or maybe it's like creating some hierarchy of trust that we can go after. You know, I don't, there's no easy answers there as well, but like, that's, I would say like. A industry, almost like a society-wide thing that we're going to reckon with as well, like the image and video pieces.
Maybe it's calling the Denver post or like a trusted source, or maybe it's like creating some hierarchy of trust that we can go after. You know, I don't, there's no easy answers there as well, but like, that's, I would say like. A industry, almost like a society-wide thing that we're going to reckon with as well, like the image and video pieces.
And then on text, I think what changes with AI is the mass production. So one thing that we look at is any type of coordinated effort. We looked at this as well at Instagram. At individual levels, it might be hard to catch the one person that's like...
And then on text, I think what changes with AI is the mass production. So one thing that we look at is any type of coordinated effort. We looked at this as well at Instagram. At individual levels, it might be hard to catch the one person that's like...
And then on text, I think what changes with AI is the mass production. So one thing that we look at is any type of coordinated effort. We looked at this as well at Instagram. At individual levels, it might be hard to catch the one person that's like...
commenting on a, you know, Facebook group, trying to start some stuff, you know, cause that's probably indistinguishable from a human, but we'll really look for like networks of coordinated activity.
commenting on a, you know, Facebook group, trying to start some stuff, you know, cause that's probably indistinguishable from a human, but we'll really look for like networks of coordinated activity.
commenting on a, you know, Facebook group, trying to start some stuff, you know, cause that's probably indistinguishable from a human, but we'll really look for like networks of coordinated activity.