Jaden Schaefer

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

And yes, it does cost a lot of money, it is very intense.

255.153 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

But I think it's also important to remember, there are millions of people around the world using these AI models.

258.422 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

And we also need to optimize the tech stack for people that are generating stuff.

264.299 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

So

268.049 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

I think, well, training oftentimes gets a lot of kind of like the headlines and people talk about it a lot because it's basically this kind of massive upfront compute demand, right?

268.31 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

Like in order to train one of these models, you're spending millions and millions of dollars.

276.939 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

I think inference is quietly becoming a really dominant cost center for a lot of these AI companies because their models are, you know, getting deployed to millions of users.

280.743 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

That's chatbots.

289.712 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

And then if you look at Google, that's like all of the search tools.

290.693 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

You have copilots for Microsoft and a bunch of others and a lot of the enterprise software.

293.516 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

So

297.32 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

every query autocomplete or, you know, generated paragraph, every bit of that is consuming compute power and cooling.

297.841 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

So as a result, even like a very small efficiency gain at the chip level can translate into some really big cost savings at cloud scale.

304.671 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

So it's interesting because this is, you know, obviously something Microsoft's concerned about, but every other AI company should be and is concerned about this as well, because they need to make those, you know, they need to make the cost savings, not just when they're training the model, but when they're actually generating stuff.

312.203 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

Microsoft right now, they're betting that this new kind of Maya 200, it's going to be a really big shift in that financial equation.

325.162 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

They said that the chip is going to be designed to essentially run today's largest frontier models.

331.812 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

So you can imagine the ones that they partnered with, like OpenAI, and they're going to be able to do that on a single node while leaving enough headroom to accommodate larger and more demanding architecture in the future, which is kind of interesting, right?

336.219 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

They're not just looking at what is OpenAI, what do our AI models need today?

347.396 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

They're looking at what is it going to need in the future.

350.681 View full episode →

AI in Business

Microsoft Reveals Maya 200 AI Inference Chip

So

352.644 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment