Steve Hsu

👤 Speaker

See mentions of this person in podcasts

318 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View

The difference between early and late AI adopters

And the way we program, we use sort of old-style programming in this platform.

326.11 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

We sort of force the model to only use that knowledge base in answering the fact part of questions.

330.576 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

And so that, that extra constraint, it solves the hallucination problem and makes both the behavior and the knowledge base of the AI reliable.

338.425 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

So if you look at the architecture, it is actually a piece of it is RAG.

353.203 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

And interestingly, we actually, when we founded the startup, we actually filed a patent.

357.828 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

The company filed a patent on our architecture.

362.694 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

And that was actually before the word RAG was in wide usage.

365.037 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

So it is possible, who knows how the USPTO, Patent and Trademark Office, operates, but we might be issued a patent on RAG.

369.022 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

Of course, it's not referred to RAG in the patent filing.

377.975 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

It has a lot of similarity.

380.418 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

Another thing you might do is you might have multiple models involved in the generation of the response in which some models are just error-checking

382.2 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

the proposed response of the big model against what the little models can see in the knowledge base.

391.227 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

And all that, if you're doing voice, which we do, all that has to happen in a latency time of less than two seconds.

397.515 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

So humans, if I stop speaking and I'm waiting for you to respond to me, if it goes more than a couple seconds, it's kind of strange.

404.004 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

And so all of that stuff that I just described to you is engineered down so that the latency is between one and two seconds, so it sounds natural.

412.275 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

Well, it is improved.

482.42 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

The situation is improved if you're using a model that has reasoning capabilities.

483.782 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

What's going on in the reasoning is the model has been taught as it sort of talks to itself in trying to solve, generate a good response to your query.

488.168 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

It has been taught to double check facts or components of the reasoning.

497.2 View full episode →

Azeem Azhar's Exponential View

The difference between early and late AI adopters

However, if the model doesn't really have access to the actual ground truth, it can still go off the rails because it can think X is true.

502.147 View full episode →

← Previous Page 3 of 16 Next →

Report any issue