Al et al,
To be clear - the problem is that Google consumes bandwidth by
repeatedly downloading static documents, verses downloading dynamic
content whose index status might be new or dirty?
Lee Courtney
  -----Original Message-----
 From: cctalk-bounces at 
classiccmp.org
 [mailto:cctalk-bounces at 
classiccmp.org] On Behalf Of Brad Parker
 Sent: Sunday, May 13, 2007 5:12 PM
 To: General Discussion: On-Topic and Off-Topic Posts
 Subject: Re: Subject: Re: Manuals being scanned
 Al Kossow wrote:
Google is particularly bad about fetching documents over and over
again. 
 mmm...  any evidence they are using OCR to index pdf's?
 of all the places I'd *like* them to OCR, it's bitsavers.
 in fact, mmmmm, I'd like to connect the two dots.  bitsavers
 + google (and, and/all mit, standford, cmu, ... software archives)
 something to start mentioning at various fund raising
 cocktail parties :-)
 -brad