[a bunch of replies all in one]
On Wed, 16 Mar 2005, Vintage Computer Festival wrote:
However, I find that media is much more sturdy than
these
discussions would indicate. We tend to gripe very loudly (and rightly so)
when poor quality backup media takes our data with it. But what about all
those times we go back to our old backups and thankfully find what we
need?
That's true. It's not like 100% of all media rots to nothing. But
"finding" old media that's still readable is different from
"archiving".
I consistently read lots of data with nary a problem:
20-30 year old
floppy disks (5.25" and 8"), 20-30 year old mag tapes, even 20 year old
VHS tapes. And of course the punch cards don't count ;)
No offense, but you're in the business of doing so :-) Only a
minority of even us can read and convert old formats, even when
readable.
On Thu, 17 Mar 2005, Jim Leonard wrote:
If you somehow
think that tapes stored in a controlled vault is
more reliable, or less susceptible to bit rot than rotating
spindles, I believe you are wrong (and my every experience and
observation says otherwise). Only fiche and paper are statically
reliable.
You are omitting cost. Tapes in a vault cost significantly less over 10
years than the cost of 4 live systems + replacement hard drives + network
bandwidth + electricity for your method.
My assertion is that it's a false economy; besides bit-rot,
there's future (near-guarenteed) incompatibilities, and if enough
time passes in multiple media (papertape... 1/2" mag tape...
DC300... 3740... QIC... etc)
In fact, with Moore's Law and related continuous hard disk is *far
cheaper*. It's just that you're not deferring the cost to your
future inlaws!
Your method, unless I am misunderstanding you, has no
revisioning. It
protects against hardware failure, but how many revisions do you keep? What
protects against you mistakenly deleting the wrong directory, etc.?
That's what filesystems are for.
I also extended a rotating backup scheme I found on the net, that
uses hard links and an intentional feature of rsync; it fully
saves N copies of the data with true incremental changes with only
about 10% more diskspace than the original files themselves, for
data that changes at a "reasonable" rate of change (per week,
say). (Example: I have a big RAID server that has 5 weeks of 100%
snapshots; the original data is about 360GB, 5 weeks worth is
about 500GB.
http://wps.com/temp/rotate-backup
I agree completely that archival data should be
monitored in some fashion
instead of being locked away forever. And, cost permitting, transferred to
new media.
On rotating spindles, this is continuous...
On Thu, 17 Mar 2005, Dwight K. Elvey wrote:
One other thought is that one should have a way to
transfer the raw data from what ever media you have.
This allows one to have redundent information stored
on the same media. If you depend on normal file systems,
you run the risk of the file system not allowing you
to access the data simply because a small part is
damaged.
Hence multiple, ordinary servers, physically separate. The chances
of all of them developing a bad disk simultaneously is exceedingly
low.
If one of my servers craps a disk, that machine is repaired; data
from the other server(s) is automatically (cron) copied
to it.
Self-healing.
On Thu, 17 Mar 2005, Doc Shipley wrote:
>>> This reminds me that I read not too long
ago that many of the super
>>> computer labs ship PCs between sites because it's *faster* to ship a
>>> working PC with 1TB of disk containing data than it is to transfer it
Yikes, brings
new meaning to the term "Our System Crashed." Wonder
what the insurance is on something like that....
The insurance in that case was basically irrelevant. Their insurance would
cover the hardware replacement costs, period.
No point in insuring the data contained in the disk drives in the
computer in the back seat; it's just a copy.
On Thu, 17 Mar 2005, Doc Shipley wrote:
Then the other issue was the archived files
themselves. Some of it was in
PageMaker, some in Wordstar, some in MS-DOS .prn files, and a lot of it in
AutoCAD r12 and a freeware DOS GIS tool whose name I've mercifully forgotten.
THen there's that! Added to the top of media recovery ("who's got
a 200bpi UNIVAC drive?") it generally means that the data is
simply "lost". (Few think to "SAVE-AS" plain ASCII for posterity;
I rarely do.)
I spent several hundred hours sorting backup sets,
restoring them to disk,
migrating files into current formats, and then archiving both current and
original copies to 4mm tape. It made for some very soothing afternoons,
hidden in my office.
And they only did it because it was required by law!
I'm seriously planning on printing out my website on paper. Laugh,
but it might be one of the few copies that survives long term.
That is, if I can find a printer that produces real output.