Damien Tanner
๐ค SpeakerAppearances Over Time
Podcast Appearances
Real, real pain in the ass to debug.
It really depends on the use case.
How you configure the voice agent really depends on how the voice agent is being used, right?
Like a therapy voice agent needs to behave very differently than a vet appointment booking answering phone agent.
MARK BLYTH, Yeah.
There's that.
And when we call that audio environments, that's often an early issue users have.
They're like, well, my user's called from cafes.
And it kind of really misunderstands them.
And big problem with audio transcription, it just transcribes any audio it hears, right?
So if someone's talking behind you, it doesn't know that
The model doesn't quite know that's a relevant conversation.
It's just transcribing it all.
But if you imagine the therapy voice agent, it needs to actually not respond too quickly to the user.
It needs to let the user have long, pondering thoughts, long sentences.
Big pauses.
And so you can choose a few different levels of interruption, right?
You can just interrupt when you hear any word.
By default, we interrupt when we hear any word.
That's not a filler word.