[SCC_Active_Members] Software Archive Solution
Richard P. Gabriel
rpg at dreamsongs.com
Wed Apr 11 08:44:19 PDT 2007
At 23:35 -0700 4/10/07, Bob Fraley wrote:
>There is a difference between my proposal and the examples given
>here. In the examples some media has been written and left
>unattended for years. In my proposal, the master archive is active,
>kept in an operating computer. So if storage technology changes,
>the archive will need to be copied to the new media. Of course, if
>the archive were ever to be taken off line, the backup media would
>become the archive. By doing periodic full backups, the technology
>for storing the backups will be recent technology. The older
>artifacts will be available on recording media that is contemporary
>with this event, rather than media contemporary with the artifact
>itself.
I was assuming the archives were on live machines and constantly and
instantaneously available. Why would anyone assume otherwise?
>When new file formats are created, an automatic process could go
>through the archive to generate the new file formats. Similarly,
>audio or graphics files could be converted to new formats while the
>conversion programs are available. The archives that you reference
>were allowed to sit idle until the formats were no longer in use,
>and any conversion programs that may have existed were no longer
>available.
Well, you make a rosy assumption. There never were any conversion
programs for SAIL. "Standards" grew up separately (essentially in
silos), and the silo that was SAIL was merely abandoned. The work to
make the SAIL files available started very soon after the machine was
decommissioned, by what were then called system programmers (today
maybe called senior sys admins - there is no parallel because the
people back then actually were usually the OS implementors or near
kin; imagine a Linux committer running a Linux farm), and they wrote
the first conversion programs.
Step by step conversion is not always a reasonable alternative.
Consider the SAIL case. The SAIL character set included several
mathematical symbols, alpha, beta, for each, there exists, union,
intersection, etc, etc. When the conversion was done, unicode was not
a real alternative, so they used an html encoding that captured these
characters, but they were custom encodings. Now one could convert to
unicode, but the archive that is on a live machine is the transcoded
one, and so another custom conversion program needs to be written.
You might argue that SAIL is just a backwater and who cares, but for
20-30 years it was among the top half-dozen research labs in the
world.
What I'm saying is that it is unlikely to be the case that a routine
process can be followed.
> Clearly, the curators will need to remain alert to apply the
>appropriate translations in a timely manner. Formats that are no
>longer in use can be removed from the archive, but the core data (in
>the 1620 case the binary record of all holes in the card) should
>never be removed.
I sure wouldn't delete these intermediate steps. Otherwise you might
not be able to fix mistakes with reasonable effort.
My sense is that the OAIS model has examined most of these concerns
I've brought up.
-rpg-
More information about the SCC_active
mailing list