Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

Dwarkesh Podcast

Satya Nadella — How Microsoft is preparing for AGI

12 Nov 2025

Transcription

Full Episode

0.031 - 7.82 Dwarkesh Patel

Today, we are interviewing Satya Nadella. We being me and Dylan Patel, who is founder of Semi Analysis. Satya, welcome.

0

8.04 - 10.623 Unknown

Thank you. It's great. Thanks for coming over to Atlanta.

0

10.783 - 24.418 Dwarkesh Patel

Yeah. Thank you for giving us a tour of the new facility. It's been really cool to see. Absolutely. Satya and Scott Guthrie, Microsoft's EVP of Cloud and AI, give us a tour of their brand new Fairwater 2 data center, the current most powerful in the world.

0

25.36 - 47.275 Satya Nadella

We've tried to 10x the training capacity every 18 to 24 months. And so this would be effectively a 10x increase, 10x from what GPD-5 was trained with. And so to put it in perspective, the number of optics, the network optics in this building is almost as much as all of Azure across all our data centers two and a half years ago. It's kind of what, 5 million network connections.

0

47.656 - 59.53 Dwarkesh Patel

You've got all this bandwidth between different sites in a region and between the two regions. So is this like a big bet on scaling in the future that you anticipate in the future there's going to be some huge model that needs to require two whole different regions to train?

59.97 - 80.579 Unknown

The goal is to be able to kind of aggregate these flops for a large training job and then put these things together across sites. Right. And the reality is you'll use it for training and then you'll use it for data gen, you'll use it for inference in all sort of ways. It's not like it's going to be used only for one workload forever.

80.759 - 95.563 Satya Nadella

Fairwater 4, which you're going to see under construction nearby, will also be on that one petabits network so that we can actually link the two at a very high rate and then basically we do the AIWAN connecting to Milwaukee where we have multiple other Fairwaters being built.

95.543 - 120.384 Unknown

Literally, you can see the model parallelism and the data parallelism. It's kind of built for essentially the training jobs, the pods, the super pods across this campus. And then with the WAN, you can go to the Wisconsin data center and literally run a training job with all of them getting aggregated.

120.404 - 139.314 Satya Nadella

And what we're seeing right here is this is a cell with no servers in it yet, no racks. How many racks are in a cell? Let me think about it. We don't necessarily share that per se, but let me... That's the reason I asked. You'll see upstairs. I'll start counting. You can start counting. We'll let you start counting. How many cells are there in this building? That part also I can't tell you.

Comments

There are no comments yet.

Please log in to write the first comment.