Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Dylan Patel

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
3551 total appearances

Appearances Over Time

Podcast Appearances

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

Like, I'm very bad at those things, and thankfully I've, like, been able to surround myself in my life, whether it's through birth or not, with people who help me with the things I'm bad at, because I'm very bad at a lot of things.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

When I don't, like, call people or, like,

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

be considerate of what they're thinking because I'm just vibing and I'm doing whatever, you know, I'm like focused in on like this path.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

That path ends up hurting someone else, right?

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

Whether it's like, hey, I didn't call someone or I didn't like think about their feelings when I did an action or when I said something, but that makes me an asshole.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

And yes, I should be more conscious of this.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

And I try to be, but it's like, it's just one of the things I'm going to wrestle with in my life forever.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

And a lot of times I don't even realize I'm being a freaking idiot until my brother's like, you're a freaking idiot.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

That's the kindest thing anyone's ever done for me is like my brother through my whole life.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

Thank you so much.

Invest Like the Best with Patrick O'Shaughnessy
Dylan Patel - Inside the Trillion-Dollar AI Buildout - [Invest Like the Best, EP.442]

Yeah.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Yeah, so DeepSeq v3 is a new mixture of experts, transformer language model from DeepSeq, who is based in China. They have some new specifics in the model that we'll get into. Largely, this is a open weight model, and it's a instruction model like what you would use in ChatGPT. They also released what is called the base model, which is before these techniques of post-training.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Yeah, so DeepSeq v3 is a new mixture of experts, transformer language model from DeepSeq, who is based in China. They have some new specifics in the model that we'll get into. Largely, this is a open weight model, and it's a instruction model like what you would use in ChatGPT. They also released what is called the base model, which is before these techniques of post-training.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Yeah, so DeepSeq v3 is a new mixture of experts, transformer language model from DeepSeq, who is based in China. They have some new specifics in the model that we'll get into. Largely, this is a open weight model, and it's a instruction model like what you would use in ChatGPT. They also released what is called the base model, which is before these techniques of post-training.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Most people use instruction models today, and those are what served in all sorts of applications. This was released on, I believe, December 26th, or that week. And then weeks later, on January 20th, DeepSeq released DeepSeq R1, which is a reasoning model, which... really accelerated a lot of this discussion.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Most people use instruction models today, and those are what served in all sorts of applications. This was released on, I believe, December 26th, or that week. And then weeks later, on January 20th, DeepSeq released DeepSeq R1, which is a reasoning model, which... really accelerated a lot of this discussion.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Most people use instruction models today, and those are what served in all sorts of applications. This was released on, I believe, December 26th, or that week. And then weeks later, on January 20th, DeepSeq released DeepSeq R1, which is a reasoning model, which... really accelerated a lot of this discussion.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

This reasoning model has a lot of overlapping training steps to DeepSeq v3, and it's confusing that you have a base model called v3 that you do something to to get a chat model, and then you do some different things to get a reasoning model. I think a lot of the AI industry is going through this challenge of communications right now where OpenAI makes fun of their own naming schemes.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

This reasoning model has a lot of overlapping training steps to DeepSeq v3, and it's confusing that you have a base model called v3 that you do something to to get a chat model, and then you do some different things to get a reasoning model. I think a lot of the AI industry is going through this challenge of communications right now where OpenAI makes fun of their own naming schemes.

Lex Fridman Podcast
#459 โ€“ DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

This reasoning model has a lot of overlapping training steps to DeepSeq v3, and it's confusing that you have a base model called v3 that you do something to to get a chat model, and then you do some different things to get a reasoning model. I think a lot of the AI industry is going through this challenge of communications right now where OpenAI makes fun of their own naming schemes.