Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Dwarkesh Patel

๐Ÿ‘ค Speaker
15267 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

What has changed that has allowed NVIDIA to go from Hopper was 8, then Blackwell is 72, and now...

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

the cost of figuring out which cable hops to which, or like which signal.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Excuse me, I have a question.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

But if you look at a physical data center,

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Seems like there's a lot of space within a rack.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

I don't know, just like the cables are like really big.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

It's literally the physical space to put a cable that's constraining it.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

I had no idea.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Interesting.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

That seems surprising.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

The rack is so big, and we can't just stuff more cables in there.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Deep work is by its nature quite aversive.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So even things which seem like work, like Slack and email, can be easy ways to distract yourself.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So I often wish that I could just turn the internet off.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

But if I'm prepping for an interview, even if I have the papers and books on hand, it's still super useful to be able to do a back and forth on the LLM so I can break down concepts and research follow ups.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

Google's new Gemma 4 is the first open model that allows me to have this kind of fully disconnected focus machine.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

It's small enough to run on my laptop, but good enough to actually be useful.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

So to prep for this episode, I downloaded Reiner's scaling book and shut off the internet.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

I was able to have Gemma help me understand the material and answer my questions.

Dwarkesh Podcast
Reiner Pope โ€“ The math behind how LLMs are trained and served

If you want an LLM that you can run locally on your laptop or even your phone, you should check out Gemma 4.