Reiner Pope

👤 Speaker

1157 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

if you're going to hold on to it for a very short amount of time yeah then the um all of this is like multiplied by the um hold time yep this one is and so is this one um and interestingly they have different prices to write for and as you specify this in the api for five minutes versus an hour

7161.95 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, which suggests that the five minutes is HBM and the hour is DDR.

7187.228 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

I think that's a pretty good assumption.

7191.135 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

If you look at the numbers, it might also turn out that it's one tier down and it's DDR versus Flash.

7193.919 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, okay, interesting.

7199.128 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So actually, we might actually be able to determine which memory tier it is by the durations, actually.

7235.741 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, exactly.

7246.654 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

I think this will probably end up being...

7247.395 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

it's going to be the drain time of the memory tier that you're in.

7251.64 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And so what that means is like, given that I know I'm going to be holding something for five minutes, I would like to pick a memory that I can read every five minutes.

7254.745 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Like I can read the whole memory once per five minutes, ballpark.

7267.545 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So that is the drain time of the memory.

7271.071 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So if I take all the storage capacity over storage bandwidths,

7272.353 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

bandwidth, I would like this to be like equal to five minutes or something like that.

7279.4 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And so actually we did this calculation for HBM.

7285.246 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

For HBM, we know that this number is 20 milliseconds.

7287.388 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So HBM is much too short, like much too small.

7290.791 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

DDR could be about an order of magnitude or two off from this.

7295.176 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And so this is probably in the order of like, actually, I think it might even be in the seconds, like one to 10 seconds.

7299.8 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And then

7306.367 View full episode →

← Previous Page 53 of 58 Next →

Report any issue