Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

๐Ÿ‘ค Speaker
15267 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Or what does that tell us about...

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

What are we trying to learn here?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

What does that actually tell us?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

What variable does it help us clamp?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Well, the compute has presumably gotten five... Like, the only thing that could have changed is the compute is 5x more expensive as a result.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Sorry, I'm not sure I understood.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

This is...

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

This is for processing the next token in prefix?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So this is 5x more expensive?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Input is 5x more expensive?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

No, output is more expensive.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Output is 5x more expensive.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Why don't we do this?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Why don't we have... Why don't we just chart it with, like, len pass on the x-axis?

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Yep, yeah.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

T on the y-axis.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Mm-hmm.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Yeah, that'll be right.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Okay, so...

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Okay.