Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Reiner Pope

๐Ÿ‘ค Speaker
1157 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Yeah, I mean, let's sort of zoom in on this and look at the wire density.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

I'll draw this diagram just once more so we have a bit of a cleaner version to work with and a larger version.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Let's say I have some switches in the middle.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And let's say I'm going to have, initially I'm going to start with just two GPUs on each side or two trays of GPUs on each side.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And let's say maybe each tray wants to have two cables coming out of it.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So I get some kind of, I physically run vertical cables that look like this running onto the switches.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Now, if I want to double the number of GPUs in a rack,

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

I need to run like literally twice the density of cables.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So I need to run these as well.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Yeah, so there is space outside the rack.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Inside the rack, like these racks are like, I mean, as they become more optimized, these racks are very tight.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So there's connector density going from the tray into the rack and the rack's backplane.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And then the backplane itself has a really high density.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

There are other physical constraints, including bend radius of cables.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

You don't want to snap them and so on.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Yeah, so rack design is not my expertise, but when I talk to folks on what are the constraints they're up against, it's a combination of what are the big physical things you're optimizing for,

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Space, weight of the rack, like it's actually really heavy.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And so like you need enough metal top to not sag and fall.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

But then you add more metal and it's heavier.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And then power and cooling.