Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Reiner Pope

๐Ÿ‘ค Speaker
1157 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

It's a little bit nicer actually.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So data in pre-training plus this, oh, I didn't have the inefficiency over here either.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Inefficiency data in pre-training plus some multiple of like alpha times the data in RL

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

is just going to end up equal to the sum of beta times the data in inference.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And then let's just roughly size the alpha.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

This alpha, it's going to be...

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Uh, this is like the, it's maybe somewhere in the range of two to six, uh, two to six over six, um, from this term compared to this term.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Um, and then we've got an inefficiency term, which, uh, I would say is maybe in the range of like 30%, something like that.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Um, so, uh, so, so this alpha is going to be something like, um, one on 10, one over 10, let's say.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

And this beta here is actually the same.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

It's a third.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

It's one third times 30%.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So it's also equals 1 in 10.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Something like that.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Yeah, okay.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

We can make this like 2 in 10.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Make it a bit bigger.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So, yeah, just write it out once more.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

This is 2 over 10.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

This is 1 over 10.