Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Azeem Azhar

๐Ÿ‘ค Speaker
6838 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

We are starting to see more and more companies routing requests to cheaper, which means models that use less electricity and cost less models in

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

within their application.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So you put in the query and the query figures out, oh, maybe I should send this to DeepSeek rather than to an open AI model to get the result.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

And we've made real technical progress both across the algorithms and across the chips that serve them over the last few years.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

If you look at a sort of GPT-4 level class of inferencing back in 2022, it would take

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

one watt hour to generate 50 tokens.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So what is a watt hour?

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

If you've got a 10 watt LED bulb, one watt hour is sort of leaving that on for six minutes to get your 50 tokens.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Today with the latest NVIDIA chips and more efficient optimized language models, we're getting to about 600 tokens per watt hour.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So that's a 20X improvement just over four years.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Of course, the amount of tokens we want has increased significantly.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

And so on the other hand, you have these new reasoning models that might burn 10 to 100 times more tokens per query.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So what you have there is these firms, the hyperscalers who are really hungry for compute, they're hungry for compute, not just for AI, but for other workloads.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

And they're hungry for that because we as consumers and as businesses want those types of services.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So you've got those on one hand, the grids can't keep up, the GPUs are being rationed between training and serving.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

This is a short-term squeeze is my sense and that over as the industry matures, the trade-offs will become much more apparent.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

We'll get through some of the blockages around providing power.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

We often see that in markets that you get these squeezes and ultimately the market industries take one or two years to reconfigure and to be able to deliver what is required.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

It's just not going to happen tomorrow.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

I think the third thing is about the economic engine and the question about whether we are going to see results from all of this.