Reiner Pope

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

It's a little bit nicer actually.

5105.653 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So data in pre-training plus this, oh, I didn't have the inefficiency over here either.

5107.035 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Inefficiency data in pre-training plus some multiple of like alpha times the data in RL

5116.569 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

is just going to end up equal to the sum of beta times the data in inference.

5126.42 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And then let's just roughly size the alpha.

5137.987 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

This alpha, it's going to be...

5140.372 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Uh, this is like the, it's maybe somewhere in the range of two to six, uh, two to six over six, um, from this term compared to this term.

5144.932 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Um, and then we've got an inefficiency term, which, uh, I would say is maybe in the range of like 30%, something like that.

5153.247 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Um, so, uh, so, so this alpha is going to be something like, um, one on 10, one over 10, let's say.

5159.839 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And this beta here is actually the same.

5168.523 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

It's a third.

5172.088 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

It's one third times 30%.

5173.289 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So it's also equals 1 in 10.

5174.871 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Something like that.

5178.616 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah, okay.

5184.824 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

We can make this like 2 in 10.

5185.525 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Make it a bit bigger.

5186.787 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So, yeah, just write it out once more.

5189.11 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

This is 2 over 10.

5191.332 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

This is 1 over 10.

5194.076 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment