From: "Vintage Computer Festival" <vcf at
siconic.com>
On Thu, 19 May 2005, Dwight K. Elvey wrote:
In any case, these are all academic in
comparison to the problems
of indexing. I don't even have the beginings of how to deal
with that problem.
Google :)
Hi
It works surprisingly well but it still misses a lot.
Like when I was looking for the data sheets of the WD1100V-01.
The information was out there, it just wasn't indexed.
Most document writing programs today have that automatic
indexing by marking things as you go along to place in
the index. It requires that someone actually realizes
what needs to be indexed. Then comes the problem of cross
references. Add to that synonyms.
I was looking through the directories of one of the images
I'd captured from the Polymorphic stuff and found that
a disk labeled "GAMES" contained a version of Forth.
That may have been the persons personal feelings
about it but it was not good indexing.
My guess is that Google is missing 90 to 95% of the
relevant information out there. If you include site links
on individual pages that improves to about 85% at best.
Now, add to that the problem of something that exist
but gets somehow placed in the wrong place.
Indexing will be the biggest challenge!
Dwight