Reiner Pope

👤 Speaker

1157 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

this is really i don't have these numbers memorized but generally as you go to slower tiers uh flash is plausibly in the order of one minute um and then like spinning disk which is massively different i think is on the order of one hour so this might actually identify that the tiers are probably flash and spinning disk sorry why why is this the calculations the storage cap divided by the bandwidth so um you've got a bunch of different memory tiers like we've listed four of them um

7308.914 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

uh the your choice like your choice of which memory tier is like you want to minimize the cost yeah um and so you are like what fraction of the device are you using you're using some fraction of the device for the holding onto it and then using some fraction of the device to retrieve it um and so let's say

7336.1 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

I'm using like 10% of the device and I want to equalize those two fractions.

7357.936 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

That's a sign that I've hit the right thing.

7362.884 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So let's say I've got some runtime here, like I'm going to hold on for all of this time.

7365.829 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And then, so this is the time hold.

7370.536 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And then there's going to be some amount of time here, which is time retrieve.

7375.344 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And I want, I mean, basically to equalize the costs, these two costs, um, I want the retrieval time to be equal to the hold time, uh, times the like fraction of capacity.

7382.523 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

because this is the retrieval time.

7402.722 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, I mean, this is how many other things I can hold simultaneously.

7406.11 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

I think that probably indicates that this is the two tiers of Flash and Spinning Disk.

7423.391 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

I'm kind of shocked to see Spinning Disk being used at all because it's such an old technology.

7427.943 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah.

7432.535 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

It's a really unattractive technology, but it's useful in some places.

7438.351 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah.

7515.344 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So, I mean, like, the mixing, like, I try to look for other examples where mixing, like, scrambling mixing shows up as well.

7516.826 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

There's actually almost even, like, a physical example where...

7523.618 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Like you're stirring something, you're making a cake and you want to stir the batter.

7526.703 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And like literally the idea, like first stir it this way and then stir it this way is like actually not too bad of an approach.

7529.908 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

But beyond that, like back to the digital world, there are some differences.

7535.836 View full episode →

← Previous Page 54 of 58 Next →

Report any issue