paper -> HTML (and The First PC)

29 Dec 1998

I struggled for a bit trying to convert paper to HTML, but found it an
awkward task.  I'm sure the state of the art has advanced beyond:
        1) do a color scan to grab images
        2) clean up images
        3) resize based on guess at a good size and res for web pages
        4) scan again as B/W line art
        5) OCR
        6) clean up OCR
        7) create HTML combining OCR'd text and images
I don't much like PDF for web docs, so an HTML solution would be best.  It
looks like the "pro" version of Xerox's OCR software might automate the
task somewhat.  Any recommendations?
In any case, here's a picture of Simon, the first personal computer from
~1950:
        http://www.yowza.com/classiccmp/berkeley/simon.gif
More info will be made available as I get this scanning stuff down to a
science.
-- Doug

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

paper -> HTML (and The First PC)