Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dylan Patel

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
3551 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

And the idea with Monte Carlo tree search is that you would take an intermediate point in that train, do some sort of expansion, spend more compute, and then select the right one. That's like a very complex form of search that has been used in things like Mu0 and Alpha0 potentially. I know Mu0 does this.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

And the idea with Monte Carlo tree search is that you would take an intermediate point in that train, do some sort of expansion, spend more compute, and then select the right one. That's like a very complex form of search that has been used in things like Mu0 and Alpha0 potentially. I know Mu0 does this.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

There are many extensions to this. I would say the simplest one is that our language models to date have been designed to give the right answer the highest percentage of the time in one response. And we are now opening the door to different ways of running inference on our models in which we need to reevaluate many parts of the training process.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

There are many extensions to this. I would say the simplest one is that our language models to date have been designed to give the right answer the highest percentage of the time in one response. And we are now opening the door to different ways of running inference on our models in which we need to reevaluate many parts of the training process.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

There are many extensions to this. I would say the simplest one is that our language models to date have been designed to give the right answer the highest percentage of the time in one response. And we are now opening the door to different ways of running inference on our models in which we need to reevaluate many parts of the training process.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

which normally opens the door to more progress, but we don't know if OpenAI changed a lot or if just sampling more and multiple choice is what they're doing or if it's something more complex where they change the training and they know that the inference mode is going to be different.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

which normally opens the door to more progress, but we don't know if OpenAI changed a lot or if just sampling more and multiple choice is what they're doing or if it's something more complex where they change the training and they know that the inference mode is going to be different.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

which normally opens the door to more progress, but we don't know if OpenAI changed a lot or if just sampling more and multiple choice is what they're doing or if it's something more complex where they change the training and they know that the inference mode is going to be different.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We are both NVIDIA bulls here, I would say. And in some ways, the market response is reasonable. NVIDIA's biggest customers in the US are major tech companies, and they're spending a ton on AI. And a simple interpretation of DeepSeek is you can get really good models without spending as much on AI.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We are both NVIDIA bulls here, I would say. And in some ways, the market response is reasonable. NVIDIA's biggest customers in the US are major tech companies, and they're spending a ton on AI. And a simple interpretation of DeepSeek is you can get really good models without spending as much on AI.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We are both NVIDIA bulls here, I would say. And in some ways, the market response is reasonable. NVIDIA's biggest customers in the US are major tech companies, and they're spending a ton on AI. And a simple interpretation of DeepSeek is you can get really good models without spending as much on AI.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

So in that capacity, it's like, oh, maybe these big tech companies won't need to spend as much on AI and go down. The actual thing that happened, it's much more complex where there's social factors, where there's the rising in the app store, the social contagion that is happening. And then I think a lot of some of it is just like, I don't trade. I don't know anything about financial markets.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

So in that capacity, it's like, oh, maybe these big tech companies won't need to spend as much on AI and go down. The actual thing that happened, it's much more complex where there's social factors, where there's the rising in the app store, the social contagion that is happening. And then I think a lot of some of it is just like, I don't trade. I don't know anything about financial markets.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

So in that capacity, it's like, oh, maybe these big tech companies won't need to spend as much on AI and go down. The actual thing that happened, it's much more complex where there's social factors, where there's the rising in the app store, the social contagion that is happening. And then I think a lot of some of it is just like, I don't trade. I don't know anything about financial markets.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

But it builds up over the weekend or the social pressure where it's like if it was during the week and there was multiple days of trading when this was really becoming, but it comes on the weekend and then everybody wants to sell. Yeah. And that is a social contagion.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

But it builds up over the weekend or the social pressure where it's like if it was during the week and there was multiple days of trading when this was really becoming, but it comes on the weekend and then everybody wants to sell. Yeah. And that is a social contagion.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

But it builds up over the weekend or the social pressure where it's like if it was during the week and there was multiple days of trading when this was really becoming, but it comes on the weekend and then everybody wants to sell. Yeah. And that is a social contagion.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We were trying to get GPUs on a short notice this week for a demo and it wasn't that easy. We were trying to get just like 16 or 32 H100s for a demo and it was not very easy.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We were trying to get GPUs on a short notice this week for a demo and it wasn't that easy. We were trying to get just like 16 or 32 H100s for a demo and it was not very easy.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

We were trying to get GPUs on a short notice this week for a demo and it wasn't that easy. We were trying to get just like 16 or 32 H100s for a demo and it was not very easy.