Dwarkesh Patel

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So if you can, I would really recommend watching it on a video platform like YouTube.

56.189 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Okay, full disclosure, I am an angel investor in Maddox, but that's unrelated to this podcast.

61.439 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Reiner, maybe to kick us off, I'll ask this question.

68.131 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So, we have a couple of companies like Claude and Codex and Cursor offering something like Fast Mode, where for 6x the price, they'll stream you tokens at 2.5x the speed.

70.676 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Mechanically, I'm curious what's going on here.

81.436 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Why is it the case that you can pay more to get faster latency?

83.618 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And two, could you keep going?

87.282 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Could you pay 100x more and somehow get even faster speeds or much, much faster speeds?

89.404 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And three, could you go the other way?

95.331 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Could you have something like cloud code slow mode where if you are willing to wait for minutes on end, you could get even cheaper prices?

96.832 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

So maybe this will help motivate the kind of analysis that you'll be doing through the lecture.

106.082 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Maybe I'll just interrupt from time to time to ask some very naive questions or to clarify some basic points, but...

252.957 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

just for the audience, you're not serving one user at a time.

258.185 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

The batch refers to the fact that you're serving many different users at the same time.

261.349 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

Yeah.

265.053 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And that's a whole batch.

265.774 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And maybe just back in, let's just explain what the KV cache is real quick.

356.801 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

It seems like the way you've drawn the slopes for...

644.126 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

compute time and how the kb grows and what implication the kb has on memory uh time that as yeah what if this were above or below or yeah or is that necessarily the case because if this is always true then this batch size grows compute always dominates uh kb and which which suggests that if you have big enough batch size maybe memory is never an issue

647.71 View full episode →

Dwarkesh Podcast

Reiner Pope – The math behind how LLMs are trained and served

And is there something especially significant about the slope being exactly the slope of the...

684.652 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment