If you've OCRed the data, HTML is probably fine
for pure text.
Ugh! Please, no! For pure text, use - pure text!
HTML is maybe appropriate for serving it up on the Web, but even there,
I'd much prefer text/plain for something that is indeed plain text.
I believe (but have not tried) that you can go from
PDF to text in
this case without any great difficulty
Well, _I_ can't. Or at least if I can I have no idea how. GhostScript
(the only open-source PDF reader I know of) doesn't seem to have a
plain-text output option.
/~\ The ASCII der Mouse
\ / Ribbon Campaign
X Against HTML mouse(a)rodents.montreal.qc.ca
/ \ Email! 7D C8 61 52 5D E7 2D 39 4E F1 31 3E E8 B3 27 4B