Chamath Palihapitiya
π€ SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
OpenAI, even if they're not at a billion, they're still at 900 million weekly users, which is well ahead of whatever Claude is at.
I think Claude is like probably sub 100 million sacks, you may know.
And then Gemini is probably closer to them at 700 to a billion, somewhere in that range, probably pretty neck and neck with OpenAI.
So, you know, the consumer market looks like it's trending towards a chat GPT slash Google fight for first place and second place, and then probably anthropic in third place.
And maybe Elon emerges and takes off enabled by his compute capacity.
And then the enterprise market is a little bit of a different story.
And that's its own market, which is kind of anthropic.
or probably Google in the lead, actually, if you look at all the Vertex use, Google claims that 75% of GCP customers are active users of Vertex.
So there's probably a pretty sizable market share that Google's captured on the enterprise side as well.
This is also probably why Google stock has absolutely ripped over the last couple of months is they're literally in first place or fighting for first place in enterprise and consumer.
But I still think that there's a lot of opportunity to Chamath's point about the compute and energy capacity constraints in improving how we actually scale and deploy models in both the enterprise and the consumer setting.
And it is such early days.
And I just want to highlight this paper that came out.
from MIT, from these two scientists.
And these guys published a paper on pruning techniques and neural networks.
This paper showed that you could actually reduce the size of these networks by 90% and get the same accuracy out by pruning very large models down to smaller models.
And then you can make a selection on which model to run for inference.
And by doing this, you can actually reduce inference costs by 10x, you can get 10x the output
per energy unit that goes into the data center with no loss of accuracy.
And so it's a really interesting call it algorithmic technique that can be applied to the existing large models to actually make them much lower energy use.