Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Noam Shazeer

👤 Person
692 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

So it's not like the company was going to do this one little thing and stay doing that.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And also, you could see that what we were doing initially was in that direction, but you could do so much more in that direction.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

I mean, I think of it as actually changing quite a bit in the last couple of decades.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

So like the two decades ago to one decade ago, it was awesome because you just like wait and like 18 months later, you get much faster hardware and you don't have to do anything.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And then more recently,

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

You know, I feel like the general purpose CPU-based machines scaling has not been as good.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Like the fabrication processes improvements are now taking three years instead of every two years.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

The architectural improvements in, you know, multi-core processors and so on are, you know, not giving you the same boost that we were getting, you know, 20 to 10 years ago.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

But I think at the same time, we're seeing improvements

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

much more specialized computational devices like machine learning accelerators, TPUs, very ML-focused GPUs more recently, are making it so that we can actually get really high performance and good efficiency out of the more modern kinds of computations we want to run that are different than a twisty pile of C++ code trying to run Microsoft Office or something.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Well, I would say that the pivot to hardware oriented around that was an important transition.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Because before that, we had CPUs and GPUs that were not, you know, especially well suited for deep learning.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And then, you know, we started to build, say, TPUs at Google that were really just reduced precision linear algebra machines.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And then once you have that, then you want to... Right.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

I know, by the way, the arithmetic can be like really low precision, so then you can squeeze even more multiplier units in.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

You'd have a lot more lookups into very large memories.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Yeah.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

I mean, I think one thing, one general trend is we're getting better at quantizing or having much more reduced precision models.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

You know, we started with TPU v1.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

We weren't even quite sure we could quantize a model for serving with 8-bit integers.