Noam Shazeer

And I feel like this kind of more organic growth of expertise, and when you want more expertise of that, you kind of add some more capacity to the model there and let it learn a bit more on that kind of thing.

6823.103 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And also this notion of like adapting,

6834.694 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

the connectivity of the model to the connectivity of the hardware is a good one.

6838.237 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

So I think you want incredibly dense connections between artificial neurons in sort of the same chip and the same HBM, because that doesn't cost you that much.

6842.663 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

But then you want a smaller number of connections to nearby neurons.

6856.123 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

So like a chip away, you should have some amount of connections.

6863.394 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And then like many, many chips away, you should have a smaller number of connections where you send over a very limited kind of bottlenecky thing, the most important things that this part of the model is learning for other parts of the model to make use of.

6867.059 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And even across multiple TPU pods, you'd like to send even less information, but the most salient kind of representations.

6882.516 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And then across metro areas, you'd like to send even less.

6889.784 View full episode →

Dwarkesh Podcast

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Yeah, I'd like that to emerge organically.