Pieter Levels
๐ค SpeakerAppearances Over Time
Podcast Appearances
So now I mix full body photos in the training with face photos, face crops. And it's all automatic. And I know that other people, they use, again, AI models to detect what are the best photos in this training set and then train on those. But it's all about training data and that's with everything in AI. Like how good your training data is,
So now I mix full body photos in the training with face photos, face crops. And it's all automatic. And I know that other people, they use, again, AI models to detect what are the best photos in this training set and then train on those. But it's all about training data and that's with everything in AI. Like how good your training data is,
So now I mix full body photos in the training with face photos, face crops. And it's all automatic. And I know that other people, they use, again, AI models to detect what are the best photos in this training set and then train on those. But it's all about training data and that's with everything in AI. Like how good your training data is,
is in many ways more important than how many steps you train for, like how many months or whatever with the GPUs, like the gold.
is in many ways more important than how many steps you train for, like how many months or whatever with the GPUs, like the gold.
is in many ways more important than how many steps you train for, like how many months or whatever with the GPUs, like the gold.
Like the photos should be diverse. So for example, if I only upload photos with a brown shirt or green shirt, the model will think that I'm training the green shirt. So the things that are the same every photo are the concepts that are trained. What you want is your face to be the concept that's trained. And everything else to be diverse, like different.
Like the photos should be diverse. So for example, if I only upload photos with a brown shirt or green shirt, the model will think that I'm training the green shirt. So the things that are the same every photo are the concepts that are trained. What you want is your face to be the concept that's trained. And everything else to be diverse, like different.
Like the photos should be diverse. So for example, if I only upload photos with a brown shirt or green shirt, the model will think that I'm training the green shirt. So the things that are the same every photo are the concepts that are trained. What you want is your face to be the concept that's trained. And everything else to be diverse, like different.
Yeah, outside, inside. But there's no like, this is the problem, there's no like manual for this. And nobody knew, we were all just, especially two years ago, we were all hacking, trying to test anything, anything you can think of. And it's frustrating. It's one of the most frustrating and also fun and challenging things to do because with AI, because... It's a black box.
Yeah, outside, inside. But there's no like, this is the problem, there's no like manual for this. And nobody knew, we were all just, especially two years ago, we were all hacking, trying to test anything, anything you can think of. And it's frustrating. It's one of the most frustrating and also fun and challenging things to do because with AI, because... It's a black box.
Yeah, outside, inside. But there's no like, this is the problem, there's no like manual for this. And nobody knew, we were all just, especially two years ago, we were all hacking, trying to test anything, anything you can think of. And it's frustrating. It's one of the most frustrating and also fun and challenging things to do because with AI, because... It's a black box.
And like Carpati, I think, says this. Like, we don't really know how this thing works, but it does something, but nobody really knows why, right? Like, we cannot look into the model of an LLM. Like, what is actually in there? We just know it's like a 3D matrix of numbers, right? So...
And like Carpati, I think, says this. Like, we don't really know how this thing works, but it does something, but nobody really knows why, right? Like, we cannot look into the model of an LLM. Like, what is actually in there? We just know it's like a 3D matrix of numbers, right? So...
And like Carpati, I think, says this. Like, we don't really know how this thing works, but it does something, but nobody really knows why, right? Like, we cannot look into the model of an LLM. Like, what is actually in there? We just know it's like a 3D matrix of numbers, right? So...
It's very frustrating because some things you think they're obvious that they will improve things will make them worse. And there's so many parameters you can tweak. So you're testing everything to improve things.
It's very frustrating because some things you think they're obvious that they will improve things will make them worse. And there's so many parameters you can tweak. So you're testing everything to improve things.
It's very frustrating because some things you think they're obvious that they will improve things will make them worse. And there's so many parameters you can tweak. So you're testing everything to improve things.
In a very vain way. Like me, you know? Like, I want to look good in your podcast, for example. Yeah, for sure.
In a very vain way. Like me, you know? Like, I want to look good in your podcast, for example. Yeah, for sure.