Internet Archive and robots.txt

Antonio Carlini a.carlini at
Fri Jul 3 07:27:15 CDT 2020

When I try 
in I get the usual:

This URL has been excluded from the Wayback Machine."

That's supposed to be because robots.txt prevents spidering so the 
Internet Archive takes down the pages (even if they were previously 
available, it seems).

But is back and if you go far enough down you'll see that they know where the domain 
came from.

So if whoever now controls could be persuaded to ask, would 
the Internet Archive allow those pages back out into the 
open again?

(I'm asking here because I think there's at least one person on this 
list who might be able to provide a reasonably authoritative answer).

I did happen to notice that is back too ...


Antonio Carlini
antonio at

More information about the cctalk mailing list