Damien Tanner

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

We didn't touch on it, but interruptions is this other really difficult dynamic part where whilst the agent is speaking its response to you, if the user starts speaking again, you then need to decide in real time whether the user is interrupting the agent.

4315.328 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Or are they just going, mm-hmm, yeah, and agreeing with the agent?

4330.931 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Oh, gosh, yes.

4333.996 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Or are they trying to say, ooh, stop?

4334.757 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

I bet that's a hard problem to solve.

4336.94 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

We have to still be transcribing audio even when the user's hearing it.

4338.843 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And we've got to deal with background noise and everything.

4343.028 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And then when we're confident the user is trying to interrupt the agent, we've then got to do this whole kind of state change where we tear down all of this in-flight LLM request, in-flight voice generation request,

4345.852 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And then as quickly as possible, start focusing on the user's new question.

4362.387 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And especially if their interruption is really short, like stop.

4367.045 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

Suddenly you've got to tear down all the old stuff, transcribe that word stop, then ship that as a new LLM, request to the backend, generate the response, and then get the agent speaking back as quickly as possible.

4372.04 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

It's all happening down one pipe, as it were, at the end of the day.

4383.473 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

It's like audio from the browser, microphone, and then audio replaying back.

4389.34 View full episode →

The Changelog: Software Development, Open Source

The era of the Small Giant (Interview)

And we would have bugs like you'd interrupt the agent, but then...