Dylan Patel
๐ค SpeakerAppearances Over Time
Podcast Appearances
He's got the voting shares.
I don't think it's a diminishing return, right?
I think that's important to recognize, right?
Given it's a log-log chart, scaling laws are.
Given there's no model architecture improvements, you just throw more compute data, model size at it, it gets better at this pace.
Everything has shown that it will continue and it's continued over more than 10 orders of magnitude.
Well, GPT-5 is not necessarily that much bigger than the 4.0, right?
And 4.0 is smaller than 4.
What's changing is sort of the paradigm of how you spend the compute
And also like if they made a bigger model, could they even serve it?
No, they did 4.5 and it was terrible.
No one could, no one could serve it.
It was actually like quite a bit smarter, but they couldn't actually serve it at any reasonable cost and speed.
Anthropic has the same issue.
I wouldn't even call it an issue, but like all of their revenue comes from four sonnet.
It doesn't come from 4.1 Opus, which is the better model.
It's bigger, but it's slow.
Because the hardware is not caught up in terms of inference speed for that.
And so no one wants to use a slow model, right?
The user experience sucks.