Damien Tanner

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

I mean, you can stream the input, but like you need the complete thing

3579.036 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

the complete question to send to the LLM to then make a request to the LLM to start generating a response, right?

3584.821 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

There is no duplex LLM that takes input and generates output at the same time.

3591.176 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Technically.

3599.094 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Yeah, yeah, yeah.

3625.831 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

So we can do that.

3626.711 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

in terms of like, because we have the partial transcripts, yeah.

3629.658 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

So we can stream you the partial transcripts and then say, okay, now it's done.

3632.181 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Now make the LLM call.

3635.184 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Then you make the LLM call.

3636.946 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

But interesting, sending text is actually super fast in the context of voice conversation, right?

3639.068 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And actually the default example is crazy.

3645.135 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

I didn't think this would work until we tried it, but it just uses a webhook.

3648.158 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

When the user finishes speaking, the basic example sends your Next.js API route a webhook with the user text.

3652.523 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And turns out the webhook, sending webhook with a few sentences in it, that's like, that's fine.

3659.662 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

That's fast.

3666.001 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

It's all the other stuff like then waiting for the LLM to respond.

3667.405 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Yeah, and we've got a WebSocket endpoint now, so we can kind of shave off that HTTP connection and everything.

3681.614 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

But yeah, then the big heavy latency items come in.

3686.882 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

So generating an LLM response.

3692.311 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment