Antonio Carlini wrote:
Good idea, although keeping track of a large number of
documents
in a thread like that will not scale well. You want to scrape
the data and turn it into a web-accessible database.
Like
http://vt100.net/manx, but for all docs and not
just DEC ones.
Manx is intended to cover more than just DEC documents, but I've been
concentrating on populating those first; there are over 4000 manuals in
the catalogue so far, of which over 1500 are online.
Manx already has a database structure that can accommodate multiple
companies and plenty of information about each manual, including
abstracts, table of contents and full text index.
I suppose I really need a way for third parties to add items to Manx. It
can already import tab-separated-variable files, which makes it very
easy to do a load of searching for new manuals and then add the results.
I'll look into this.
- Paul