Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andy Halliday

๐Ÿ‘ค Speaker
3893 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
From DeepSeek to Desktop Agents

the major companies or using the free version or paying $20 a month, roughly to get access to those models.

The Daily AI Show
From DeepSeek to Desktop Agents

Whereas deep seek is less expensive as an open source model.

The Daily AI Show
From DeepSeek to Desktop Agents

And it is used more,

The Daily AI Show
From DeepSeek to Desktop Agents

Two to four times higher across the African continent than the other ones.

The Daily AI Show
From DeepSeek to Desktop Agents

And a Chinese company, Huawei, which has a lot of the telephone infrastructure, mobile phone infrastructure in those developing countries.

The Daily AI Show
From DeepSeek to Desktop Agents

is also partnered with DeepSeek to advance the use of DeepSeek in those countries.

The Daily AI Show
From DeepSeek to Desktop Agents

Now, I'm weaving over to something about DeepSeek on the technical side.

The Daily AI Show
From DeepSeek to Desktop Agents

So take notes.

The Daily AI Show
From DeepSeek to Desktop Agents

There'll be a quiz on this afterward.

The Daily AI Show
From DeepSeek to Desktop Agents

DeepSeek has just introduced a new technique in LLM inference that's advancing its capability in pure reasoning in a dramatic way.

The Daily AI Show
From DeepSeek to Desktop Agents

So, you know, the innovations that the Chinese companies starved of the sort of the scaling compute capabilities available, if you can acquire the top end data center infrastructure like the NVIDIA Blackwell chips and so on, they've innovated around efficiencies that are along two different dimensions and,

The Daily AI Show
From DeepSeek to Desktop Agents

I'll circle back to this, but one of those two dimensions is the use of sparsity.

The Daily AI Show
From DeepSeek to Desktop Agents

Now, sparsity is the opposite of dense in the terminology of AI.

The Daily AI Show
From DeepSeek to Desktop Agents

Dense means that you're using every layer of the network in each inference run.

The Daily AI Show
From DeepSeek to Desktop Agents

That's a dense, deep neural network.

The Daily AI Show
From DeepSeek to Desktop Agents

And sparsity means you're only activating certain portions of it.

The Daily AI Show
From DeepSeek to Desktop Agents

So if you have a 100 billion parameter model, any one inference run is dynamically assessing which portions of that deep neural network, the LLM, which layers of those have to be activated.

The Daily AI Show
From DeepSeek to Desktop Agents

And this has given rise to the primary architecture for LLMs today, which is called mixture of experts.

The Daily AI Show
From DeepSeek to Desktop Agents

So the only experts that are activated in this context are the ones which are relevant to the query.

The Daily AI Show
From DeepSeek to Desktop Agents

And that reduces the computational overhead and it makes for a more efficient and effective inference run and reduces the cost in both energy and compute time and allows for a larger context window to be executed.