Lennart Heim
👤 PersonAppearances Over Time
Podcast Appearances
Absolutely.
Well, I mean, we just talked about history.
We talked about going from two chips to 100,000 chips.
The two chips was not a transformer architecture.
At some point, we developed transformers.
You know, at some point, we had a new loss function.
Me, as a technologist, I love when people develop new papers, but I'm interested in the macro trends.
That's why I just love, like, line going up, compute Moore's law, more transistors per area.
Generally, too.
How do we do it?
Completely different architectures over time.
If you find the right abstraction layer, again, you see these exponentials being the case there.
Then some people argue, like, oh, we have a new architecture that does this.
And they're like, true.
That's what happens all the time.
This is what we call increasing algorithmic efficiency and computer efficiency.
I think it's quite unlikely that somebody tomorrow pulls out a new architecture and says, oh, look, AGI on a smartphone.
I don't think that's the case.
I think we would first build AGI on a big cluster.
And then a couple of decades later, we would build it on a smartphone.