Dylan Patel

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

hell yeah that's that's what i'm saying and yeah and that's why it's like we want this whole open language models thing the olmo thing is to try to keep the model where everything is open with the data as close to the frontier as possible so we're compute constrained we're personnel constrained we're

17712.391 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

hell yeah that's that's what i'm saying and yeah and that's why it's like we want this whole open language models thing the olmo thing is to try to keep the model where everything is open with the data as close to the frontier as possible so we're compute constrained we're personnel constrained we're

17712.391 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We rely on getting insights from people like John Schulman tells us to do RL on outputs like we can make these big jumps, but it just takes a long time to push the frontier of open source. And fundamentally, I would say that that's because open source AI does not have the same feedback loops as open source software. We talked about open source software for security.

17729.57 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We rely on getting insights from people like John Schulman tells us to do RL on outputs like we can make these big jumps, but it just takes a long time to push the frontier of open source. And fundamentally, I would say that that's because open source AI does not have the same feedback loops as open source software. We talked about open source software for security.

17729.57 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We rely on getting insights from people like John Schulman tells us to do RL on outputs like we can make these big jumps, but it just takes a long time to push the frontier of open source. And fundamentally, I would say that that's because open source AI does not have the same feedback loops as open source software. We talked about open source software for security.

17729.57 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Also, it's just because you build something once and you can reuse it. If you go into a new company, there's so many benefits. But if you open source a language model, you have this data sitting around, you have this training code. It's not like that easy for someone to come and build on and improve because you need to spend a lot on compute. You need to have expertise.

17748.949 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Also, it's just because you build something once and you can reuse it. If you go into a new company, there's so many benefits. But if you open source a language model, you have this data sitting around, you have this training code. It's not like that easy for someone to come and build on and improve because you need to spend a lot on compute. You need to have expertise.

17748.949 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Also, it's just because you build something once and you can reuse it. If you go into a new company, there's so many benefits. But if you open source a language model, you have this data sitting around, you have this training code. It's not like that easy for someone to come and build on and improve because you need to spend a lot on compute. You need to have expertise.

17748.949 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

So until there are feedback loops of open source AI, it seems like mostly an ideological mission. Like people like Mark Zuckerberg, which is like America needs this. And I agree with him, but In the time where the motivation ideologically is high, we need to capitalize and build this ecosystem around what benefits do you get from seeing the language model data. And there's not a lot about that.

17766.496 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

So until there are feedback loops of open source AI, it seems like mostly an ideological mission. Like people like Mark Zuckerberg, which is like America needs this. And I agree with him, but In the time where the motivation ideologically is high, we need to capitalize and build this ecosystem around what benefits do you get from seeing the language model data. And there's not a lot about that.

17766.496 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

So until there are feedback loops of open source AI, it seems like mostly an ideological mission. Like people like Mark Zuckerberg, which is like America needs this. And I agree with him, but In the time where the motivation ideologically is high, we need to capitalize and build this ecosystem around what benefits do you get from seeing the language model data. And there's not a lot about that.

17766.496 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We're going to try to launch a demo soon where you can look at an OMO model and a query and see what pre-training data is similar to it, which is like legally risky and complicated, but it's like... what does it mean to see the data that the AI was trained on? It's hard to parse. It's terabytes of files. It's like, I don't know what I'm going to find in there.

17788.892 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We're going to try to launch a demo soon where you can look at an OMO model and a query and see what pre-training data is similar to it, which is like legally risky and complicated, but it's like... what does it mean to see the data that the AI was trained on? It's hard to parse. It's terabytes of files. It's like, I don't know what I'm going to find in there.

17788.892 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We're going to try to launch a demo soon where you can look at an OMO model and a query and see what pre-training data is similar to it, which is like legally risky and complicated, but it's like... what does it mean to see the data that the AI was trained on? It's hard to parse. It's terabytes of files. It's like, I don't know what I'm going to find in there.

17788.892 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

But that's what we need to do as an ecosystem if people want open source AI to be financially useful.

17808.788 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

But that's what we need to do as an ecosystem if people want open source AI to be financially useful.

17808.788 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

But that's what we need to do as an ecosystem if people want open source AI to be financially useful.

17808.788 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

I had a while to think about this while listening to Dylan's beautiful response. He didn't listen to me. He was so dumb. No, I knew this was coming. And it's like, realistically, training models is very fun because there's so much low-hanging fruit. And the thing that makes my job entertaining, I train models. I write analysis about what's happening with models.

18541.499 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

I had a while to think about this while listening to Dylan's beautiful response. He didn't listen to me. He was so dumb. No, I knew this was coming. And it's like, realistically, training models is very fun because there's so much low-hanging fruit. And the thing that makes my job entertaining, I train models. I write analysis about what's happening with models.

18541.499 View full episode →

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

I had a while to think about this while listening to Dylan's beautiful response. He didn't listen to me. He was so dumb. No, I knew this was coming. And it's like, realistically, training models is very fun because there's so much low-hanging fruit. And the thing that makes my job entertaining, I train models. I write analysis about what's happening with models.

18541.499 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment