Andy Halliday

The Daily AI Show

From DeepSeek to Desktop Agents

So all of those things are improving on the efficiency scale.

604.223 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

The second dimension has to do with memory.

609.375 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

And this is where the new deep seek technique comes in.

613.103 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

So.

616.791 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

We know that models just left as a dense model and being injected with your prompt and some additional context that you type in at the time of inference, they can be subject to hallucinations.

617.858 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

And so we like to ground that with a retrieval augmented generation model where you have an external memory, a database that is going to be referenced as context based.

631.691 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

And semantic relevance is used to selectively retrieve the relevant components of the grounding truth data that's in that retrieval augmented generation, typically a vector database, in order to achieve that semantic retrieval.

644.504 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

So that's this outboard memory that's used to inform the inference process in an efficient way.

663.269 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

So those are the two big dimensions of advancement.

670.928 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

One has to do with sparsity and the other has to do with memory.

673.895 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

Okay, so memory being like external caching or placement of and then retrieval of static knowledge that doesn't really change and isn't subject to reasoning and manipulation by the computational process in inference.

678.322 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

Okay, so what did DeepSeek do?

697.733 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

DeepSeq introduced this thing called Ngram.

699.736 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

It's a novel module that's added to their LLM that provides conditional memory.

702.68 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

And here's the jargon.

711.051 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

It's a complementary axis of sparsity.

715.597 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

that adds to the conditional computation paradigm of mixture of experts models in large language models.

719.328 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

So what is this doing?

728.097 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

What it's doing is it's efficiently identifying the things that are static knowledge in the input context and putting those into a file, in effect, that's sort of a scratchpad kind of memory.

730.319 View full episode →

The Daily AI Show

From DeepSeek to Desktop Agents

And this then frees up.

747.897 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment