Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Noam Shazeer

👤 Person
692 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

It's very helpful to have a 1,000th scale problem and then vet 100,000 ideas on that and then scale up the ones that seem promising.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

I mean, I think one thing people should be aware of is the improvements from generation to generation of these models often are partially driven by hardware and larger scale, but equally and perhaps even more so driven by major algorithmic improvements and major changes in the model architecture and the training data mix and so on that really make the model better per flop that is applied to the model.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

So I think that's a good realization.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And then I think if we have automated exploration of ideas, we'll be able to vet a lot more ideas and bring them into kind of the actual, you know, production training for next generations of these models.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And that's going to be really helpful because that's sort of what we're currently doing with a lot of machine learning research.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Brilliant machine learning researchers is looking at lots of ideas.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

you know, winnowing ones that seem to, to work well at small scale, seeing if they work well at medium scale, bringing them into larger scale experiments, and then like settling on like adding a whole bunch of new and interesting things to the, to the final model recipe.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Um, and then I think if we can do that, you know, a hundred times faster through, uh,

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

those machine learning researchers just gently steering a more automated search process rather than sort of hand babysitting lots of experiments themselves, that's gonna be really, really good.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

For that, more hardware is a good solution and better hardware.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Yeah, I mean, I've been pretty excited lately about how could we dramatically speed up the chip design process.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Because as we were talking earlier, the current way in which you design a chip takes you roughly 18 months to go from we should build a chip to something that you then hand over to TSMC and then TSMC takes four months to fab it and then you get it back and you put it in your data centers.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

So that's a pretty lengthy cycle

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And the fab time in there is a pretty small portion of it today.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

But if you could make that the dominant portion so that instead of taking 12 to 18 months to design the chip, and with 150 people, you could shrink that to a few people with a much more automated search process, exploring the whole design space of chips and getting feedback from all aspects of the chip design process

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

for the kind of choices that the system is trying to explore at the high level, then I think you could get, you know, perhaps much more exploration and more rapid design of something that you actually want to give to a fab.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And that would be great because you can shrink that time, you can shrink the deployment time by kind of designing the hardware in the right way so that you just get the chips back and you just plug them in to some system.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

And that will then, I think, enable a lot more specialization.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

It will enable a shorter timeframe for the hardware design so that you don't have to look out quite as far into what kind of ML algorithms would be interesting.

Dwarkesh Podcast
Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

Instead, it's like you're looking at six to nine months from now, what should it be, rather than two, two and a half years.