Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Jaeden Schaefer

๐Ÿ‘ค Speaker
2075 total appearances

Appearances Over Time

Podcast Appearances

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

This is very purpose-built.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

It's a silicon platform, and they are aimed at one of the most expensive and also it's one of the most complex parts of modern AI systems if you're looking at this from kind of like an operational perspective, and that is large-scale inference.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

The Maya 200 is the successor of the Maya 100, which Microsoft, they actually launched that one back in 2023 as it was kind of like their first serious in-house AI chip that they were creating.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

This new generation, now that they've made the 200, is a really big step forward.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

So there's a couple of things that it does.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

Number one is just raw performance.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

And then also how tightly the chip is integrated into Microsoft's kind of broader cloud and also AI stack.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

So according to them, the Maya 200 has more than 100 billion transistors and it's capable of delivering up to 10 petaflops of performance in a 4-bit precision and roughly 5 petaflops in 8-bit, which is a massive increase over the last generation.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

And I think it's really trying to optimize for just running larger language models efficiently and doing this in production.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

It's interesting to me seeing Microsoft get into the chips game.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

There's a lot of competitors in this space, but not a lot of competitors that could really compete at this level.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

And Microsoft, I think, sees just how much money they'll have to spend, let alone, you know, not to mention just how they're not able to customize everything the way they like if they're if they're using outside suppliers for this.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

So it's interesting for me seeing them get into this.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

And for those that are curious, right, inference is essentially just the process of executing a training AI model to generate outputs as opposed to training, which involves teaching the model in the first place.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

Right.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

So we have inference, which is getting it to generate for you.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

And what's interesting is I think we talk a lot about the GPUs involved from NVIDIA if you want to train an AI model and just, you know, how intense that can be.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

And yes, it does cost a lot of money.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

It is very intense.

AI HR
Microsoft Reveals Maya 200 AI Inference Chip

But I think it's also important to remember there are millions of people around the world using these AI models.