Grant Harvey
๐ค SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
Have you seen anything over the past year or like six months even that has lit you up in terms of like, ooh, this could be a good alternative for that memory context problem?
Or are you like, we still haven't seen anything that's even close?
Although I will say with that equation, being able to work in parallel, I think makes you maybe perhaps make those trade-offs or you can make some of those calculations more efficiently.
Would you agree with that?
If that makes sense?
How does this work in a Gentic context then?
How would it be different, let's say, in something like Cloud Code, where it's going out, it's doing a bunch of tool calls, it's reasoning.
How does it play out in that scenario, if different?
Yes.
Yes.
Um, you mentioned earlier, so like one of the, the, um, unique things and benefits of being the person who basically made this, um, uh, is that, you know, all of the tricks of how to, you know, run it and deploy it.
Um, and you know, there's a lot of frameworks that support regular LLMs, but maybe not as many frameworks or, you know, you have an in-house framework for dealing with diffusion.
Would you ever make like an open version of that?
Like, do you want everyone to go through you?
Like what, what's your, what's your business plan there?
I guess.
Yeah, that's fair.
No, that's totally fair.
I was just thinking like, you know, like in one of NVIDIA's strengths, right, is CUDA, which kind of keeps everyone locked in.
So I was wondering if there was a similar kind of play there eventually where, you know, you make it easy for everyone to do this, but then you're still the gatekeeper in some way.