Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Mark Graham

👤 Person
297 total appearances

Appearances Over Time

Podcast Appearances

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

Where to start? I mean, those are some big questions. A very general statement that I can make is that about a third of the old web measured in, say, 10 or 15 years or something like that is gone. So about a third. In some cases, it's less, and in some cases, it's more. And certainly for an individual website that may have had millions of pages, like GeoCities, for example, It's 100% gone, right?

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

Where to start? I mean, those are some big questions. A very general statement that I can make is that about a third of the old web measured in, say, 10 or 15 years or something like that is gone. So about a third. In some cases, it's less, and in some cases, it's more. And certainly for an individual website that may have had millions of pages, like GeoCities, for example, It's 100% gone, right?

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

So it's just not there on the live web. But it turns out that in more than two-thirds of the cases that we've looked at where a given URL is no longer available, it is available through the Wayback Machine. So one way of looking at that is saying that instead of saying that maybe a third of the old web is gone, maybe a ninth of the old web is gone.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

So it's just not there on the live web. But it turns out that in more than two-thirds of the cases that we've looked at where a given URL is no longer available, it is available through the Wayback Machine. So one way of looking at that is saying that instead of saying that maybe a third of the old web is gone, maybe a ninth of the old web is gone.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

So it's just not there on the live web. But it turns out that in more than two-thirds of the cases that we've looked at where a given URL is no longer available, it is available through the Wayback Machine. So one way of looking at that is saying that instead of saying that maybe a third of the old web is gone, maybe a ninth of the old web is gone.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

And once again, these are very broad generalizations because much of that material was backed up and can be accessed through web.archive.org from the Wayback Machine. But you asked a different question.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

And once again, these are very broad generalizations because much of that material was backed up and can be accessed through web.archive.org from the Wayback Machine. But you asked a different question.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

And once again, these are very broad generalizations because much of that material was backed up and can be accessed through web.archive.org from the Wayback Machine. But you asked a different question.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

I don't know. They're getting different, right? So things are changing.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

I don't know. They're getting different, right? So things are changing.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

I don't know. They're getting different, right? So things are changing.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

So first of all, let's look a little bit like why things go away. There are very benign reasons why things go away. Maybe a company has simply gone out of business or a government has changed. And so there's a new administration. And so you would expect if a company goes out of business, what entity would want to keep that company's website alive, for example, or a publication, right?

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

So first of all, let's look a little bit like why things go away. There are very benign reasons why things go away. Maybe a company has simply gone out of business or a government has changed. And so there's a new administration. And so you would expect if a company goes out of business, what entity would want to keep that company's website alive, for example, or a publication, right?

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

So first of all, let's look a little bit like why things go away. There are very benign reasons why things go away. Maybe a company has simply gone out of business or a government has changed. And so there's a new administration. And so you would expect if a company goes out of business, what entity would want to keep that company's website alive, for example, or a publication, right?

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

Thousands of local news organizations have shut down in the United States over the last 10 or 15 years, for example. News organizations, media organizations are shut down by governments when they go out of favor. When the failed coup happened in Turkey a few years ago, Wikipedia has documented about 150 media organizations were shut down.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

Thousands of local news organizations have shut down in the United States over the last 10 or 15 years, for example. News organizations, media organizations are shut down by governments when they go out of favor. When the failed coup happened in Turkey a few years ago, Wikipedia has documented about 150 media organizations were shut down.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

Thousands of local news organizations have shut down in the United States over the last 10 or 15 years, for example. News organizations, media organizations are shut down by governments when they go out of favor. When the failed coup happened in Turkey a few years ago, Wikipedia has documented about 150 media organizations were shut down.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

We have a collection of four websites, four news sites from Hong Kong, for example. Apple Daily was one that were shut down for political reasons. In all of those cases, we have really good archives of that material. We have, for example, a full text searchable index of about a million pages from Gawker.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

We have a collection of four websites, four news sites from Hong Kong, for example. Apple Daily was one that were shut down for political reasons. In all of those cases, we have really good archives of that material. We have, for example, a full text searchable index of about a million pages from Gawker.

Decoder with Nilay Patel
How the Wayback Machine is fighting linkrot

We have a collection of four websites, four news sites from Hong Kong, for example. Apple Daily was one that were shut down for political reasons. In all of those cases, we have really good archives of that material. We have, for example, a full text searchable index of about a million pages from Gawker.