transcribing old docs

List overview All Threads
Download

newer

older

Mac 128K board

RGB & LCD...

dgriffi＠cs.csubak.edu

28 Jul 2006 28 Jul '06

1:12 a.m.

I'm transcribing the docs for a Radio Shack PT-210 printing terminal because I don't see it online anywhere and I just recently acquired a photocopy. This manual has a fair number of typos and a peculiar capitalization scheme which is typical of writing from the 1700s. Here's an example: [begin quote] If you set the PT-210 to Half Duplex and the Host is echoing the character, you will see two of each character on the Paper -- one character will be from the PT-210 and the other echoed from the Host. [end quote] So, is it a Good Idea to correct stuff like this? Should I be concerned about maintaining the page numbering? -- David Griffith dgriffi at cs.csubak.edu A: Because it fouls the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail?

Show replies by date

nico＠farumdata.dk

28 Jul 28 Jul

1:48 a.m.

----- Original Message ----- From: "David Griffith"

...

I would say, that you want to preserve the style, but not the typos and the page numbering. If I saw e.g. a DOS/VS RPG II User's Guide with right-adjusted text, or Spec Syntax made up with full lines instead of hyphens, it wouldnt feel right. Just my 2c Nico

julesrichardsonuk＠yahoo.co.uk

7:13 a.m.

Nico de Jong wrote:

...

----- Original Message ----- From: "David Griffith"

I would say, that you want to preserve the style, but not the typos and the page numbering.

Storage availability and data transfer rates are so much better than they once were. I can't help but feel that at some point the best formats are going to encapsulate both scanned images and raw text - the images will be as faithful reproductions of the original as possible (greyscale, and including full colour where necessary) whilst each page also has a plain-text transcribed version attached. That way you get something that's as close to the original 'look and feel' of the document as possible (and allows you to cross-reference between the electronic version and a real copy), but you also get modern abilities that come with having text that can be searched, copied electronically etc. Lots of historians seem concerned about preserving the raw content - which is fantastic. But pulling that data out of its original context feels a little like running an emulator versus the real hardware. cheers J.

alexandre-listas＠e-secure.com.br

6:33 a.m.

...

Storage availability and data transfer rates are so much better than they

once

...

were. I can't help but feel that at some point the best formats are going

...

encapsulate both scanned images and raw text - the images will be as

faithful

...

reproductions of the original as possible (greyscale, and including full colour where necessary) whilst each page also has a plain-text transcribed version attached.

Adobe Acrobat does that for ages...

julesrichardsonuk＠yahoo.co.uk

7:48 a.m.

Alexandre Souza wrote:

...

Storage availability and data transfer rates are so much better than they

once

were. I can't help but feel that at some point the best formats are going

encapsulate both scanned images and raw text - the images will be as

faithful

reproductions of the original as possible (greyscale, and including full colour where necessary) whilst each page also has a plain-text transcribed version attached.

Adobe Acrobat does that for ages...

It does, but hardly anybody makes use of it - you either get a document containing only scanned pages (and possibly some form of contents menu), or a document containing "plain text" where the original scans have been thrown away. It's very rare that the two are seen combined, presumably because outside of the context of historical preservation there's no real justification to do so; in those cases the raw content itself is more important than how it's actually arranged. -- (\__/) (='.'=) This is Bunny. Copy and paste bunny into your (")_(") signature to help him gain world domination.

alexandre-listas＠e-secure.com.br

7:57 a.m.

...

It does, but hardly anybody makes use of it - you either get a document containing only scanned pages (and possibly some form of contents menu),

or a

...

document containing "plain text" where the original scans have been thrown away. It's very rare that the two are seen combined, presumably because outside of the context of historical preservation there's no real justification to do so; in those cases the raw content itself is more important than how it's actually arranged.

It depends of who uses it. I use the adobe acrobat suite and try to scan AND ocr every document, to preserve formatting and text search abilities. Most (well?) scanned books I see on the net uses this feature, it is just a matter of standarization. Just like rippers try to rip tv shows into a standard format/resolution/bitrate, there should be any way of creating a standard for scanning docs and books. There are many things that needs to be shared on the net and still are in paper.

trixter＠oldskool.org

30 Jul 30 Jul

11:11 p.m.

Alexandre Souza wrote:

...

Just like rippers try to rip tv shows into a standard format/resolution/bitrate, there should be any way of creating a

Bad example, BTW; most ripped TV shows have thrown away resolution, framerate, or both. -- Jim Leonard (trixter at oldskool.org) http://www.oldskool.org/ Help our electronic games project: http://www.mobygames.com/ Or check out some trippy MindCandy at http://www.mindcandydvd.com/ A child borne of the home computer wars: http://trixter.wordpress.com/

7297

days inactive

7300

days old

test-drb@ccmp.vtda.org

Manage subscription

6 comments

5 participants

tags (0)

participants (5)

alexandre-listas＠e-secure.com.br
dgriffi＠cs.csubak.edu
julesrichardsonuk＠yahoo.co.uk
nico＠farumdata.dk
trixter＠oldskool.org