Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Azeem Azhar

👤 Speaker
6838 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

And you know exactly what that is like because you have used these tools.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

word after word after word, sequentially at a time, like a slow teletype from the old days.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

The model is producing tokens one at a time, each depending on the previous one.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

This can't be parallelized.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

It's structurally sequential.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

The bottleneck here now is no longer raw compute.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

It's memory bandwidth.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

How fast can you stream the model's weights from memory for each individual token step?

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

So now you have lots of GPU calls sitting largely idle waiting on memory reads just to produce one token at a time.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

So GPUs are not fantastic here.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

They weren't designed for this.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

And as we move from the training era to the inference era, well, the workflows are shifting from building models to running them constantly at scale for billions of users and lots of agents.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

That efficiency becomes a serious problem.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

That's why you acquire Grog,

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

do whatever deal you did with grok it's a fast move a high conviction move by an incumbent to ensure it can serve that changing market a quick note if you want to support us in bringing more of these conversations to the world please consider subscribing to the show

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

So NVIDIA is going to make some new chips or systems with GROP technology embedded later this year.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

The point is that that combined architecture uses NVIDIA's homegrown Vero Rubin GPUs and GROP processing units, and it will result in a 35-fold improvement in the throughput per megawatt of power versus NVIDIA's current Blackwells, which are the posh chips of the moment.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

This isn't the first time NVIDIA has done something like this.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

They acquired Mellanox.

Azeem Azhar's Exponential View
What NVIDIA’s bet on OpenClaw means for the future of AI and your token budget

It was a new networking company and it turned into a real advantage for NVIDIA.