Dwarkesh Patel
๐ค SpeakerAppearances Over Time
Podcast Appearances
Or what does that tell us about...
What are we trying to learn here?
What does that actually tell us?
What variable does it help us clamp?
Well, the compute has presumably gotten five... Like, the only thing that could have changed is the compute is 5x more expensive as a result.
Sorry, I'm not sure I understood.
This is...
This is for processing the next token in prefix?
So this is 5x more expensive?
Input is 5x more expensive?
No, output is more expensive.
Output is 5x more expensive.
Why don't we do this?
Why don't we have... Why don't we just chart it with, like, len pass on the x-axis?
Yep, yeah.
T on the y-axis.
Mm-hmm.
Yeah, that'll be right.
Okay, so...
Okay.