Azeem Azhar

What it will take for AI to scale (energy, compute, talent)

And earlier this week, Brookfield

What it will take for AI to scale (energy, compute, talent)

which is an asset manager lined up with our estimates that in a few years, about 70 to 75% of compute cycles will be used on inference.

424.69 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So that's going to be a tension, right?

433.198 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Do we pay bills today or do we build the next big thing in some different way?

434.7 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

And you see the labs.

439.064 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

I mean, I think the contrast between Anthropic and OpenAI is most marked in how they approach that, right?

440.485 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Anthropic appears to be rather more focused in thinking through the economics of that particular trade-off between training and inference.

446.611 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

There are levers to address that.

453.698 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Efficiency gains being one that is an obvious approach.

455.842 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

We are starting to see more and more companies routing requests to cheaper, which means models that use less electricity and cost less models in

460.391 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

within their application.

470.692 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So you put in the query and the query figures out, oh, maybe I should send this to DeepSeek rather than to an open AI model to get the result.

471.814 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

And we've made real technical progress both across the algorithms and across the chips that serve them over the last few years.

478.706 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

If you look at a sort of GPT-4 level class of inferencing back in 2022, it would take

487.582 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

one watt hour to generate 50 tokens.

494.013 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So what is a watt hour?

497.037 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

If you've got a 10 watt LED bulb, one watt hour is sort of leaving that on for six minutes to get your 50 tokens.

498.359 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Today with the latest NVIDIA chips and more efficient optimized language models, we're getting to about 600 tokens per watt hour.

504.969 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

So that's a 20X improvement just over four years.

513.862 View full episode →

Azeem Azhar's Exponential View

What it will take for AI to scale (energy, compute, talent)

Of course, the amount of tokens we want has increased significantly.

518.088 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment