Azeem Azhar

👤 Speaker

6838 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

We are starting to see more and more companies routing requests to cheaper, which means models that use less electricity and cost less models in

460.391 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

within their application.

470.692 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So you put in the query and the query figures out, oh, maybe I should send this to DeepSeek rather than to an open AI model to get the result.

471.814 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

And we've made real technical progress both across the algorithms and across the chips that serve them over the last few years.

478.706 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

If you look at a sort of GPT-4 level class of inferencing back in 2022, it would take

487.582 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

one watt hour to generate 50 tokens.

494.013 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So what is a watt hour?

497.037 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

If you've got a 10 watt LED bulb, one watt hour is sort of leaving that on for six minutes to get your 50 tokens.

498.359 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Today with the latest NVIDIA chips and more efficient optimized language models, we're getting to about 600 tokens per watt hour.

504.969 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So that's a 20X improvement just over four years.

513.862 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Of course, the amount of tokens we want has increased significantly.

518.088 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

And so on the other hand, you have these new reasoning models that might burn 10 to 100 times more tokens per query.

522.194 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So what you have there is these firms, the hyperscalers who are really hungry for compute, they're hungry for compute, not just for AI, but for other workloads.

528.261 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

And they're hungry for that because we as consumers and as businesses want those types of services.

537.112 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So you've got those on one hand, the grids can't keep up, the GPUs are being rationed between training and serving.

542.499 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

This is a short-term squeeze is my sense and that over as the industry matures, the trade-offs will become much more apparent.

549.127 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

We'll get through some of the blockages around providing power.

557.655 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

We often see that in markets that you get these squeezes and ultimately the market industries take one or two years to reconfigure and to be able to deliver what is required.

562.079 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

It's just not going to happen tomorrow.

573.851 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

I think the third thing is about the economic engine and the question about whether we are going to see results from all of this.

575.418 View full episode →

← Previous Page 102 of 342 Next →

Report any issue