Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Jaden Schaefer

๐Ÿ‘ค Speaker
1542 total appearances

Appearances Over Time

Podcast Appearances

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

And so I think this kind of Maya 200 chip is also reflecting a really big shift in the whole industry, right?

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

The whole world's largest cloud providers, more and more they're getting into designing their own chips to try to reduce their reliance on NVIDIA.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

And let's be honest, NVIDIA's GPUs have become basically the backbone of the AI boom.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

But I think it also...

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

like remains that they're very expensive they're very supply constrained it's hard to get them and so i think google was kind of pioneering this whole approach uh years ago they had their tensor processing unit their tpus which are now on you know they're offered as a cloud service rather than kind of a standalone hardware amazon also followed up and kind of copied google they did tranium

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

and Inferentia.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

It's kind of their in-house accelerators for training and inference.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

And then recently they also rolled out a new generation that they were kind of aiming at improving some of the price performance and, you know, for like larger models and stuff.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

So we see Google doing it.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

We do see Amazon with AWS doing it.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

So it kind of only makes sense that we're seeing Microsoft get more serious about this.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

And I mean, they already had this, the 100 version of this chip.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

Now this is the 200.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

I think Microsoft is now really solidly positioned with Maya kind of as like a peer for some of those other alternatives from Google and Amazon.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

And so in their big announcement, they said that it delivered roughly three times the FP4 performance of third generation Amazon Tranium chips and exceeded the FP8 performance of Google's seventh generation TPUs.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

So

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

I think while those types of comparisons often depend on, you know, specific workloads, if we're being 100% honest, I think they do show that Microsoft's in like, they're really trying to be competitive, not just internally, but also across the broader AI kind of cloud market, they know that this isn't just them that it's going to be using these for training, they're going to have other customers and other people doing this.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

So

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

I think it's really important to remember Maya is not, you know, being treated as sort of an experimental side project.

AI in Business
Microsoft Reveals Maya 200 AI Inference Chip

Microsoft says that the chip is already powering internal workloads, which includes models developed by its superintelligence team and also some of their core features of Copilot.