Mike Krieger
👤 PersonAppearances Over Time
Podcast Appearances
Yeah, deeply. And I saw it in a week, too. So I'm a ship-oriented person. Even with Instagram, early days, it was like, let's not get bogged down in building the 50 features, let's build the two things well and get it out as soon as possible. Some of those decisions to ship a week earlier and not have every feature, I think were actually existential to the company. I feel that in my bones.
Yeah, deeply. And I saw it in a week, too. So I'm a ship-oriented person. Even with Instagram, early days, it was like, let's not get bogged down in building the 50 features, let's build the two things well and get it out as soon as possible. Some of those decisions to ship a week earlier and not have every feature, I think were actually existential to the company. I feel that in my bones.
Yeah, deeply. And I saw it in a week, too. So I'm a ship-oriented person. Even with Instagram, early days, it was like, let's not get bogged down in building the 50 features, let's build the two things well and get it out as soon as possible. Some of those decisions to ship a week earlier and not have every feature, I think were actually existential to the company. I feel that in my bones.
Week two I was here, our research team put out a paper on interpretability of our models and kind of buried in the paper was this idea that they found a feature inside one of the models that if amplified would make Claude believe it was the Golden Gate Bridge.
Week two I was here, our research team put out a paper on interpretability of our models and kind of buried in the paper was this idea that they found a feature inside one of the models that if amplified would make Claude believe it was the Golden Gate Bridge.
Week two I was here, our research team put out a paper on interpretability of our models and kind of buried in the paper was this idea that they found a feature inside one of the models that if amplified would make Claude believe it was the Golden Gate Bridge.
Not just like kind of believe it, like prompt it, like, hey, you're the Golden Gate Bridge, but like deeply, like in the way that my five-year-old- We'll make everything about turtles. It made everything about the Golden Gate Bridge. How are you today? I'm feeling great. I'm feeling international orange. And I'm feeling in the foggy cloud of San Francisco.
Not just like kind of believe it, like prompt it, like, hey, you're the Golden Gate Bridge, but like deeply, like in the way that my five-year-old- We'll make everything about turtles. It made everything about the Golden Gate Bridge. How are you today? I'm feeling great. I'm feeling international orange. And I'm feeling in the foggy cloud of San Francisco.
Not just like kind of believe it, like prompt it, like, hey, you're the Golden Gate Bridge, but like deeply, like in the way that my five-year-old- We'll make everything about turtles. It made everything about the Golden Gate Bridge. How are you today? I'm feeling great. I'm feeling international orange. And I'm feeling in the foggy cloud of San Francisco.
And somebody in our Slack was like, hey, should we build and release Golden Gate Cloud? It was almost like an offhand comment. And a few of us were like, Absolutely, yes. For two reasons. One, this was actually quite fun. But two, getting people to actually have some firsthand contact with what a model that has had some of its parameters tuned, we thought was valuable.
And somebody in our Slack was like, hey, should we build and release Golden Gate Cloud? It was almost like an offhand comment. And a few of us were like, Absolutely, yes. For two reasons. One, this was actually quite fun. But two, getting people to actually have some firsthand contact with what a model that has had some of its parameters tuned, we thought was valuable.
And somebody in our Slack was like, hey, should we build and release Golden Gate Cloud? It was almost like an offhand comment. And a few of us were like, Absolutely, yes. For two reasons. One, this was actually quite fun. But two, getting people to actually have some firsthand contact with what a model that has had some of its parameters tuned, we thought was valuable.
So from that IRC message to having GoldenGate clawed out on the website was, I think, basically 24 hours. And in that time, we had to do some product engineering, some model work. But we also ran through a whole battery of safety evals. And I think that was just an interesting piece where you can move quickly, and not every time can you do only a 24-hour safety valve.
So from that IRC message to having GoldenGate clawed out on the website was, I think, basically 24 hours. And in that time, we had to do some product engineering, some model work. But we also ran through a whole battery of safety evals. And I think that was just an interesting piece where you can move quickly, and not every time can you do only a 24-hour safety valve.
So from that IRC message to having GoldenGate clawed out on the website was, I think, basically 24 hours. And in that time, we had to do some product engineering, some model work. But we also ran through a whole battery of safety evals. And I think that was just an interesting piece where you can move quickly, and not every time can you do only a 24-hour safety valve.
There's lengthier ones for new models. This one was a derivative, so it was easier. But the fact that that wasn't even a question like, wait, should we run safety valves? No, absolutely. That's what we do before we launch models. We make sure that it's both safe from the things that we know about, and let's also model out what are some novel harms.
There's lengthier ones for new models. This one was a derivative, so it was easier. But the fact that that wasn't even a question like, wait, should we run safety valves? No, absolutely. That's what we do before we launch models. We make sure that it's both safe from the things that we know about, and let's also model out what are some novel harms.
There's lengthier ones for new models. This one was a derivative, so it was easier. But the fact that that wasn't even a question like, wait, should we run safety valves? No, absolutely. That's what we do before we launch models. We make sure that it's both safe from the things that we know about, and let's also model out what are some novel harms.
The bridge is unfortunately associated with suicides. Like, let's make sure that the model doesn't guide people in that direction. And if it does, let's put in the right safeguards. So that's kind of a, like, trivial example because it was like an Easter egg we shipped for basically two days and then wound down. But it was, like, very much at its core there.
The bridge is unfortunately associated with suicides. Like, let's make sure that the model doesn't guide people in that direction. And if it does, let's put in the right safeguards. So that's kind of a, like, trivial example because it was like an Easter egg we shipped for basically two days and then wound down. But it was, like, very much at its core there.