1 / 29

Long-Term Preservation

Long-Term Preservation. Technical Approaches to Long-Term Preservation the challenge is to interpret formats a similar development: sound carriers From phonograph to MP3. Those that do not keep up with this development soon will lack support:

arista
Download Presentation

Long-Term Preservation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Long-Term Preservation

  2. Technical Approaches to Long-Term Preservation • the challenge is to interpret formats • a similar development: sound carriers • From phonograph to MP3

  3. Those that do not keep up with this development soon will lack support: • new audio documents are only produced in current formats • out-of-date equipment spare parts are hard to come by.

  4. technical approaches to long-term preservation of digital documents fall into two categories: • aim to preserve the original state of documents along with systems that are suitable for rendering the documents in their original format • aim to continually transform digital documents into the formats of state-of-the-art rendition systems and at the same time to retain their original “look and feel.”

  5. Migration • advantages: • well known • documents available all the time • Possibly improved quality • disadvantages: • reduced authenticity • hard to automate

  6. Hardware Museums • The mission of a hardware museum is to collect (and keep operational) all relevant computing systems so that future generations may view our documents in their original environments.

  7. Hardware museums are not feasible in practice : • too many items • Additional software and hardware required • hard to maintain Hardware museum at the UniversitätderBundeswehrMünchen, Germany

  8. Emulation • Emulators allow the function of processors and other hardware components to be simulated by software. • When using emulation, for each digital document the following items have to be preserved (using, e.g., migration): • The character stream and the metadata • A specification of the hardware that can be interpreted by the emulator • The complete software of the rendition system (in the form of binary data streams).

  9. If interested persons would like to access a document conserved that way in, say, 100 years from now, they would have to proceed as follows: • create an emulator, • Load the hardware specification into an emulator to obtain a software implementation which is functionally equivalent to the original hardware. • install software • On the emulated computer install the systems software and the application programs needed for rendering the document

  10. and renderdocuments • Load the character stream of the digital document into the emulated . . . and render computer and start the rendition software to access the document.

  11. advantages ofemulation: • relatively small cost per document • cost proportionalto actual use one emulator suffices for many documents • high authenticity • Whenever an old format becomes obsolete emulation (while new ones become popular), new conversion techniques and tools have to be developed that achieve the required transformation.

  12. Standard Formats • costs proportional to number of formats • standards for simple character sequences and For complex document types

  13. Legal and Social Concerns • long-term preservation of digital documents involves legal and social concerns: • “Digital Rights Management” (DRM) and copy protection • reserved software right • Should hardware manufacturers provide emulators? • criteria for selection • costs as a limiting factor • make costs affordable • balance of interests between shareholders

  14. OAIS Models • Open Archival Information System Reference Model • an ISO standard on the long-term preservation of digital documents. • two complementary points of view: both, an information model and a process model

  15. The Information Model • Data Object and Information Object • The knowledge which is required to understand data is called Knowledge Base • In order to understand the data one needs additional information. • Ex, Along with the source code of the Java program, a book about the programming language Java must be available (Representation Information)

  16. The Content Information is the information object proper which contains all the information necessary to interpret data • Preservation Description Information (PDI) denotes all the information required to suitably preserve the corresponding Content Information.

  17. Content Information and PDI are combined into one logical entity, the Information Package. • Packaging Information. It specifies how Content Information and PDI are actually related to each other e.g., by describing the directory structure of a CD-ROM.

  18. Descriptive Information which yields Information about the content of the Information Package and thus allows the Information Package to be found in the archive.

  19. Modeling Context and Processes • In order to define the processes that are going on in the archive in more detail, the OAIS Reference Model starts by considering the context of the archive. • An archive’s purpose is to maintain documents, which are submitted to it and which are to be made available to future users.

  20. Producers, i.e., authors, institutions, etc. that deliver documents to the archive. • Management. defines the specific purpose of the archive, e.g., which documents are to be collected and which are not

  21. The OAIS Reference Model differentiates three different kinds of Information Packages in their relation to the environment of the archive: • Submission Information Packages (SIP) are sent to the archive by Producers • Archive Information Packages (AIP) are preserved in the archive • Dissemination Information Packages (DIP) are passed from the archive to Consumers.

  22. The Ingest process receives an SIP from the Producer and prepares it Ingest for storage and administration within the archive. • SIPs must be transformed into AIPs, and Descriptive Information corresponding to the AIPs has to be created. • AIP is passed on to the Archival Storage process, and the corresponding Descriptive Information to the Data Management process.

  23. Data Management process manages the Descriptive Information and also the data that are necessary to run the system • Administration process handles routine work in the archive: negotiates with producers the prerequisites for sending documents to the archive.

  24. DSEP Model • Deposit System for Electronic Publications • The business routine of library can be subdivided into four domains: – Acquisition of stock – Capturing metadata – Preservation and maintenance – Providing access

  25. The process Delivery & Capture transforms documents into SIPs conforming to the DSEP standards. • The process Packaging & Delivery unpacks the DIP and transforms it into a format that can be used by the library system.

More Related