Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Azeem Azhar

๐Ÿ‘ค Speaker
5487 total appearances

Appearances Over Time

Podcast Appearances

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

And earlier this week, Brookfield

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

which is an asset manager lined up with our estimates that in a few years, about 70 to 75% of compute cycles will be used on inference.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So that's going to be a tension, right?

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Do we pay bills today or do we build the next big thing in some different way?

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

And you see the labs.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

I mean, I think the contrast between Anthropic and OpenAI is most marked in how they approach that, right?

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Anthropic appears to be rather more focused in thinking through the economics of that particular trade-off between training and inference.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

There are levers to address that.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Efficiency gains being one that is an obvious approach.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

We are starting to see more and more companies routing requests to cheaper, which means models that use less electricity and cost less models in

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

within their application.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So you put in the query and the query figures out, oh, maybe I should send this to DeepSeek rather than to an open AI model to get the result.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

And we've made real technical progress both across the algorithms and across the chips that serve them over the last few years.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

If you look at a sort of GPT-4 level class of inferencing back in 2022, it would take

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

one watt hour to generate 50 tokens.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So what is a watt hour?

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

If you've got a 10 watt LED bulb, one watt hour is sort of leaving that on for six minutes to get your 50 tokens.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Today with the latest NVIDIA chips and more efficient optimized language models, we're getting to about 600 tokens per watt hour.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

So that's a 20X improvement just over four years.

Azeem Azhar's Exponential View
What it will take for AI to scale (energy, compute, talent)

Of course, the amount of tokens we want has increased significantly.