Kieran Kunhya
π€ SpeakerAppearances Over Time
Podcast Appearances
And this is also why it's handwritten.
I don't know anyone.
I've never heard any other project than David doing that.
This is why Kiran calls it an art, right?
It is an art.
per dollar invested, right?
And sometimes it's going to be a problem that is limited by your hardware.
A good analogy is what you see in quantization in LLMs, right?
And people are doing, oh, I'm going to do that in FP8 or FP4 or some crazy things like Microsoft Fear who did it in 1.5.
Because you're constrained by memory, because you're constrained by the machine you can run, because at some point we are doing real time.
And I believe this is going to happen on AI inference also, is that at some point you need to get faster and you cannot always get harder, more powerful hardware, right?
So you need to analyze code and see where, like, where is the mission critical?
Where is the things that are called non-stops?
And for example, David is a good example.
It's going to be run,
billions of hours per day.
That makes sense.
It doesn't make sense to be on the glue of FFmpeg CLI.
It makes sense over there.
We are also arriving at a point where we've done so many great things, but the hardware is getting back to us, right?