Joscha Bach
๐ค SpeakerAppearances Over Time
Podcast Appearances
Yeah. And of course, we can combine the language model with models that get coupled to reality in real time and can build multimodal model and bridge between vision models and language models and so on. So there is no reason to believe that the language models will necessarily run into some problem that will prevent them from becoming generally intelligent. But I don't know that.
Yeah. And of course, we can combine the language model with models that get coupled to reality in real time and can build multimodal model and bridge between vision models and language models and so on. So there is no reason to believe that the language models will necessarily run into some problem that will prevent them from becoming generally intelligent. But I don't know that.
It's just I don't see proof that they wouldn't. My issue is I don't like them. I think that they're inefficient. I think that they use way too much compute. I think that given the amazing hardware that we have, we could build something that is much more beautiful than our own mind. And this thing is not as beautiful as our own mind, despite being so much larger.
It's just I don't see proof that they wouldn't. My issue is I don't like them. I think that they're inefficient. I think that they use way too much compute. I think that given the amazing hardware that we have, we could build something that is much more beautiful than our own mind. And this thing is not as beautiful as our own mind, despite being so much larger.
It's just I don't see proof that they wouldn't. My issue is I don't like them. I think that they're inefficient. I think that they use way too much compute. I think that given the amazing hardware that we have, we could build something that is much more beautiful than our own mind. And this thing is not as beautiful as our own mind, despite being so much larger.
It's the only thing that works right now. So it's not the only game in town, but it's the only thing that has this utility with so much simplicity. There's a bunch of relatively simple algorithms that you can understand in relatively few weeks that can be scaled up massively.
It's the only thing that works right now. So it's not the only game in town, but it's the only thing that has this utility with so much simplicity. There's a bunch of relatively simple algorithms that you can understand in relatively few weeks that can be scaled up massively.
It's the only thing that works right now. So it's not the only game in town, but it's the only thing that has this utility with so much simplicity. There's a bunch of relatively simple algorithms that you can understand in relatively few weeks that can be scaled up massively.
Yeah, Claude Shannon, when you described chess, suggested that there are two main strategies in which you could play chess. One is that you are making a very complicated plan that reaches far into the future and you try not to make a mistake while enacting it. And this is basically the human strategy.
Yeah, Claude Shannon, when you described chess, suggested that there are two main strategies in which you could play chess. One is that you are making a very complicated plan that reaches far into the future and you try not to make a mistake while enacting it. And this is basically the human strategy.
Yeah, Claude Shannon, when you described chess, suggested that there are two main strategies in which you could play chess. One is that you are making a very complicated plan that reaches far into the future and you try not to make a mistake while enacting it. And this is basically the human strategy.
And the other strategy is that you are brute forcing your way to success, which means you make a tree of possible moves where you look at, in principle, every move that is open to you or the possible answers. And you try to make this as deeply as possible.
And the other strategy is that you are brute forcing your way to success, which means you make a tree of possible moves where you look at, in principle, every move that is open to you or the possible answers. And you try to make this as deeply as possible.
And the other strategy is that you are brute forcing your way to success, which means you make a tree of possible moves where you look at, in principle, every move that is open to you or the possible answers. And you try to make this as deeply as possible.
Of course, you optimize, you cut off trees that don't look very promising, and you use libraries of endgame and earlygame and so on to optimize this entire process. But this brute force strategy is how most of the chess programs were built. And this is how computers get better than humans at playing chess. And I look at the large language models, I feel that I'm observing the same thing.
Of course, you optimize, you cut off trees that don't look very promising, and you use libraries of endgame and earlygame and so on to optimize this entire process. But this brute force strategy is how most of the chess programs were built. And this is how computers get better than humans at playing chess. And I look at the large language models, I feel that I'm observing the same thing.
Of course, you optimize, you cut off trees that don't look very promising, and you use libraries of endgame and earlygame and so on to optimize this entire process. But this brute force strategy is how most of the chess programs were built. And this is how computers get better than humans at playing chess. And I look at the large language models, I feel that I'm observing the same thing.
It's basically the brute force strategy to thought by training the thing on pretty much the entire internet and then in the limit it gets coherent to a degree that approaches human coherence. And on a side effect, it's able to do things that no human could do. It's able to sift through massive amounts of text relatively quickly and summarize them quickly and never lapses in attention and
It's basically the brute force strategy to thought by training the thing on pretty much the entire internet and then in the limit it gets coherent to a degree that approaches human coherence. And on a side effect, it's able to do things that no human could do. It's able to sift through massive amounts of text relatively quickly and summarize them quickly and never lapses in attention and
It's basically the brute force strategy to thought by training the thing on pretty much the entire internet and then in the limit it gets coherent to a degree that approaches human coherence. And on a side effect, it's able to do things that no human could do. It's able to sift through massive amounts of text relatively quickly and summarize them quickly and never lapses in attention and