Scott Guthrie
👤 PersonAppearances Over Time
Podcast Appearances
Aerial photography, we showed a video in my keynote this morning for the first time of the Atlanta facility.
They are the densest concentration of GPU power in the world.
And by densest, I mean the most GPUs.
And then the most GB, Grace Blackwell GPUs, the latest all liquid cooled, which means that just from a pure token perspective,
You know, a Grace Blackwell 300 is roughly 12x the throughput of AWS Tranium 2.
Yeah.
So it's, you know, there might be one GPU each, but one is 12 times more powerful than the other.
Yeah, exactly, exactly.
And so you pack them all into one data center, and the data center is a flat network.
So it's basically, I think it was a two-tier flat network as opposed to a traditional cloud data center.
It would be usually three tiers, and so that means that when you want a GPU to talk to another GPU,
You're going through more switches, which means lower throughput, more latency.
We've tried to keep a very flat network so that each of these GPUs can interconnect with the others at incredibly high volumes.
And then part of what we've done to kind of optimize speed even is these data centers in the video you'll see are double-decker, meaning they're two floors.
And the reason we do that is we put the network core of the data center in the center of the building on the second floor.
And that means that no cable in the data center is more than about 230 meters.
Yeah.
Or 230 meters in length.
And that minimizes latency because you're only as fast as the speed of light.
But it also means that from a retransmission perspective, you're also really minimizing the number of retransmissions literally at the wire level.