This isn't strictly on-topic, but I believe it's important to many
people here. I just learned about this on another list.
HP developed an OCR engine called Tesseract that is supposed to be
pretty good. They released it to the open-source world, and Google has
picked it up and started working on it. The code itself is available
via SourceForge. Here is the announcement:
http://google-code-updates.blogspot.com/2006/08/announcing-tesseract-
ocr.html
With all the document preservation activities going on these days, in
our circles as well as others, this may be a significant development.
-Dave
--
Dave McGuire
Cape Coral, FL