Al et al,
To be clear - the problem is that Google consumes bandwidth by
repeatedly downloading static documents, verses downloading dynamic
content whose index status might be new or dirty?
Lee Courtney
-----Original Message-----
From: cctalk-bounces at
classiccmp.org
[mailto:cctalk-bounces at
classiccmp.org] On Behalf Of Brad Parker
Sent: Sunday, May 13, 2007 5:12 PM
To: General Discussion: On-Topic and Off-Topic Posts
Subject: Re: Subject: Re: Manuals being scanned
Al Kossow wrote:
Google is particularly bad about fetching documents over and over
again.
mmm... any evidence they are using OCR to index pdf's?
of all the places I'd *like* them to OCR, it's bitsavers.
in fact, mmmmm, I'd like to connect the two dots. bitsavers
+ google (and, and/all mit, standford, cmu, ... software archives)
something to start mentioning at various fund raising
cocktail parties :-)
-brad