Gerard Cole
๐ค SpeakerAppearances Over Time
Podcast Appearances
You can't really have the dialogue work very well between two people. You can't really make that consistent. And so when you watch with an eye for the technical constraints, you can really see like, oh, yeah, they kind of had to make something that was like this.
You can't really have the dialogue work very well between two people. You can't really make that consistent. And so when you watch with an eye for the technical constraints, you can really see like, oh, yeah, they kind of had to make something that was like this.
Yeah, no, and I'll take you through as simply as I can, but it is pretty complicated. So we decided we wanted to have two characters, me, and I exist in real life, and this robot, which does not exist in real life. And so we created these digital versions of the characters. The robot named Max, or OptiMax 5000, we created using an AI image generator called MidJourney.
Yeah, no, and I'll take you through as simply as I can, but it is pretty complicated. So we decided we wanted to have two characters, me, and I exist in real life, and this robot, which does not exist in real life. And so we created these digital versions of the characters. The robot named Max, or OptiMax 5000, we created using an AI image generator called MidJourney.
Yeah, no, and I'll take you through as simply as I can, but it is pretty complicated. So we decided we wanted to have two characters, me, and I exist in real life, and this robot, which does not exist in real life. And so we created these digital versions of the characters. The robot named Max, or OptiMax 5000, we created using an AI image generator called MidJourney.
And so we kind of iterated in that. We worked through, okay, what does he look like? What does he look like? And so we finally landed on some images we liked. As for me, I took a bunch of photos of myself, different angles. And so then we went into Runway, which is an AI video generation tool, and we uploaded those photos.
And so we kind of iterated in that. We worked through, okay, what does he look like? What does he look like? And so we finally landed on some images we liked. As for me, I took a bunch of photos of myself, different angles. And so then we went into Runway, which is an AI video generation tool, and we uploaded those photos.
And so we kind of iterated in that. We worked through, okay, what does he look like? What does he look like? And so we finally landed on some images we liked. As for me, I took a bunch of photos of myself, different angles. And so then we went into Runway, which is an AI video generation tool, and we uploaded those photos.
And then we said, OK, create a scene where you see the robot working out alongside Joanna and make it in a suburban background with houses on a paved street. And so then the runway would spit out what we would call the first frame of that. And so we'd have an image, and then we would take that image, and we'd put it into VO, Google's tool, and say what we wanted the motion to look like.
And then we said, OK, create a scene where you see the robot working out alongside Joanna and make it in a suburban background with houses on a paved street. And so then the runway would spit out what we would call the first frame of that. And so we'd have an image, and then we would take that image, and we'd put it into VO, Google's tool, and say what we wanted the motion to look like.
And then we said, OK, create a scene where you see the robot working out alongside Joanna and make it in a suburban background with houses on a paved street. And so then the runway would spit out what we would call the first frame of that. And so we'd have an image, and then we would take that image, and we'd put it into VO, Google's tool, and say what we wanted the motion to look like.
And here's where things got really complicated, and Gerard really did a lot of this work. But... you really have to give the model very specific instructions on what you want to be done. And so he worked alongside Google's Gemini, which is their large language model, to really craft detailed prompts of what we wanted the videos to look like.
And here's where things got really complicated, and Gerard really did a lot of this work. But... you really have to give the model very specific instructions on what you want to be done. And so he worked alongside Google's Gemini, which is their large language model, to really craft detailed prompts of what we wanted the videos to look like.
And here's where things got really complicated, and Gerard really did a lot of this work. But... you really have to give the model very specific instructions on what you want to be done. And so he worked alongside Google's Gemini, which is their large language model, to really craft detailed prompts of what we wanted the videos to look like.
And so these were long texts, like hundreds of words that you would put in with the photo and the text into Google VO, tell it what we'd want it, and out we would get a bunch of videos. And we'd pick from those videos what would look the best for the scene.
And so these were long texts, like hundreds of words that you would put in with the photo and the text into Google VO, tell it what we'd want it, and out we would get a bunch of videos. And we'd pick from those videos what would look the best for the scene.
And so these were long texts, like hundreds of words that you would put in with the photo and the text into Google VO, tell it what we'd want it, and out we would get a bunch of videos. And we'd pick from those videos what would look the best for the scene.
What was crazy was how mixed the reviews were. A lot of people wrote in saying they were blown away and they could not believe how real it looked. They laughed because we played a lot of bloopers. So there was a lot of people that really enjoyed watching this.
What was crazy was how mixed the reviews were. A lot of people wrote in saying they were blown away and they could not believe how real it looked. They laughed because we played a lot of bloopers. So there was a lot of people that really enjoyed watching this.
What was crazy was how mixed the reviews were. A lot of people wrote in saying they were blown away and they could not believe how real it looked. They laughed because we played a lot of bloopers. So there was a lot of people that really enjoyed watching this.