Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Noam Shazeer

๐Ÿ‘ค Speaker
See mentions of this person in podcasts
692 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

And, you know, that's part of kind of one of the things that PathWave was designed to support is

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

you have these components and the components can be in variable cost.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

And you kind of can say, for this particular example, I want to go through this subset of the model.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

And for this example, I want to go through this subset of the model and have the system kind of orchestrate that.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

But it also means, I think, you don't necessarily need to grow your entire model footprint to be the size of a data center.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

You might want it to be a bit below that.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

and then have, you know, potentially many replicated copies of one particular expert that is being used a lot so that you get better load balancing.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

Right, interesting.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

So, like, this one's being used a lot because we get a lot of math questions, and this one on, you know, maybe it's an expert on Tahitian dance, and it is called on really rarely.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

That one maybe you even page out to DRAM rather than putting it in HBM.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

But you want the system to kind of figure all this stuff out based on load characteristics.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

Yeah.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

I mean, you're starting to see some of this by having a lot of uses of Gemini models across Google that are not necessarily, you know, fine-tuned.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

They're just sort of

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

you know, given instructions for this particular use case and this feature in this product setting.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

So I definitely see a lot more sharing of what the underlying models are capable of across more and more services.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

You know, I do think that's a pretty interesting direction to go for sure.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

Yeah, and I think you might see that might be a big base model, and then you might want customized versions of that model with different modules that are added onto it for different settings that maybe have access restrictions.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

Like maybe we have an internal one for Google use for Google employees that we've trained some modules on internal data, and we don't allow anyone else to use those modules.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer โ€“ 25 years at Google: from PageRank to AGI

we can make use of it and maybe other companies you add on other modules that are useful for that company setting and serve it in our cloud APIs.