Reiner Pope

So this all-to-all pattern of communication that shows up and how the Blackwell racks are configured is a perfect fit for the communication pattern that the MOE actually wants to do.

2155.487 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

However, if you think maybe I want to do, like maybe one rack is too slow and I want to do two racks,

2171.143 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

then i have this challenge that like maybe i've got some sort of rack boundary drawn outside here like this um

2178.07 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And I no longer, in fact, have all-to-all communication between all the GPUs in two racks.

2187.301 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And so the rack-to-rack communication ends up being a substantial bottleneck.

2192.847 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So this sort of, like, the fundamental thing here is that one rack is actually the bounds the size of an expert layer you can do.

2197.212 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And so this has been part of what's been driving towards larger and larger interconnect domains.

2204.781 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, and this is a place where it starts to be very different, in fact, between NVIDIA, for example, and Google, and then others, including us.

2225.854 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So generally, a rack is a...

2233.868 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

It is a physical structure.

2239.023 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment