Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Steve Hsu

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
318 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View
The difference between early and late AI adopters

And the way we program, we use sort of old-style programming in this platform.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

We sort of force the model to only use that knowledge base in answering the fact part of questions.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

And so that, that extra constraint, it solves the hallucination problem and makes both the behavior and the knowledge base of the AI reliable.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

So if you look at the architecture, it is actually a piece of it is RAG.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

And interestingly, we actually, when we founded the startup, we actually filed a patent.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

The company filed a patent on our architecture.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

And that was actually before the word RAG was in wide usage.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

So it is possible, who knows how the USPTO, Patent and Trademark Office, operates, but we might be issued a patent on RAG.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

Of course, it's not referred to RAG in the patent filing.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

It has a lot of similarity.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

Another thing you might do is you might have multiple models involved in the generation of the response in which some models are just error-checking

Azeem Azhar's Exponential View
The difference between early and late AI adopters

the proposed response of the big model against what the little models can see in the knowledge base.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

And all that, if you're doing voice, which we do, all that has to happen in a latency time of less than two seconds.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

So humans, if I stop speaking and I'm waiting for you to respond to me, if it goes more than a couple seconds, it's kind of strange.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

And so all of that stuff that I just described to you is engineered down so that the latency is between one and two seconds, so it sounds natural.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

Well, it is improved.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

The situation is improved if you're using a model that has reasoning capabilities.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

What's going on in the reasoning is the model has been taught as it sort of talks to itself in trying to solve, generate a good response to your query.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

It has been taught to double check facts or components of the reasoning.

Azeem Azhar's Exponential View
The difference between early and late AI adopters

However, if the model doesn't really have access to the actual ground truth, it can still go off the rails because it can think X is true.