Reiner Pope

👤 Speaker

1157 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So I guess we want the cost per token, in fact.

6264.181 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Or the time per token.

6266.945 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Well, actually, for processing the entire batch.

6276.961 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So, at this cost, we have processed this many tokens, like, let it pre-fill.

6279.206 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah.

6285.36 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Well, I guess, pre-fill, yeah, like, of the paths.

6285.981 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah.

6289.529 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Not this prefix, but it's this cost.

6290.391 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Okay, let's proceed to the paths.

6292.436 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So the result we want to work towards is that pre-fill is compute limited and decode is memory bandwidth limited.

6308.439 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

T... We want the cost per token, so it'll be T over some stuff.

6325.76 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

T over length of the pass.

6330.927 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

But then why is it cheaper?

6369.097 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Why does it cost higher?

6370.62 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, yeah.

6371.561 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So, I mean, we're going to... It's this division by length pass that actually makes it all...

6372.723 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So... Okay, yeah, this is going to divide out, but then we're going to get... All of this is going to divide by length of pass, and it's going to make the memory cost cheaper.

6380.165 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Length of the pass, when it's one, that is decode.

6408.903 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

When it is bigger, that is pre-file.

6412.507 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Okay, I see, I see, I see.

6414.128 View full episode →

← Previous Page 47 of 58 Next →

Report any issue