Scott Guthrie

👤 Speaker

See mentions of this person in podcasts

284 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

Aerial photography, we showed a video in my keynote this morning for the first time of the Atlanta facility.

945.305 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

They are the densest concentration of GPU power in the world.

952.27 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And by densest, I mean the most GPUs.

955.558 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And then the most GB, Grace Blackwell GPUs, the latest all liquid cooled, which means that just from a pure token perspective,

958.325 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

You know, a Grace Blackwell 300 is roughly 12x the throughput of AWS Tranium 2.

969.365 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

Yeah.

975.878 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

So it's, you know, there might be one GPU each, but one is 12 times more powerful than the other.

976.559 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

Yeah, exactly, exactly.

982.751 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And so you pack them all into one data center, and the data center is a flat network.

984.494 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

So it's basically, I think it was a two-tier flat network as opposed to a traditional cloud data center.

989.564 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

It would be usually three tiers, and so that means that when you want a GPU to talk to another GPU,

995.282 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

You're going through more switches, which means lower throughput, more latency.

1001.798 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

We've tried to keep a very flat network so that each of these GPUs can interconnect with the others at incredibly high volumes.

1006.847 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And then part of what we've done to kind of optimize speed even is these data centers in the video you'll see are double-decker, meaning they're two floors.

1013.559 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And the reason we do that is we put the network core of the data center in the center of the building on the second floor.

1021.714 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And that means that no cable in the data center is more than about 230 meters.

1026.824 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

Yeah.

1032.491 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

Or 230 meters in length.

1034.414 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

And that minimizes latency because you're only as fast as the speed of light.

1036.617 View full episode →

The Neuron: AI Explained

Inside Microsoft's AI Superfactory with Scott Guthrie

But it also means that from a retransmission perspective, you're also really minimizing the number of retransmissions literally at the wire level.

1039.981 View full episode →

← Previous Page 7 of 15 Next →

Report any issue