html5-img
1 / 25

The PREMIS Working Group: Preservation Metadata for Digital Repositories

The PREMIS Working Group: Preservation Metadata for Digital Repositories. DLF Fall Forum October 26, 2004 Rebecca Guenther LC/NDMSO rgue@loc.gov . Preservation Metadata Functions. Information that supports and documents the digital preservation process:

rane
Download Presentation

The PREMIS Working Group: Preservation Metadata for Digital Repositories

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The PREMIS Working Group:Preservation Metadatafor Digital Repositories DLF Fall Forum October 26, 2004 Rebecca Guenther LC/NDMSO rgue@loc.gov

  2. Preservation Metadata Functions • Information that supports and documents the digital preservation process: • Establish provenance: track chain of custody and alterations over time • Details authenticity • Documents technical processes object has undergone • Describes technical details of object • Describes the environment from which it originated • Specify rights management information DLF-PREMIS

  3. Preservation Metadata Functions (cont.) • Provide information to maintain resources over the long term: • viability: object’s bitstream is intact • renderability: object can be translated to a form that can be viewed or used • understandability: rendered content can be interpreted and understood DLF-PREMIS

  4. Background • March 2000: OCLC and RLG jointly sponsor international working group on preservation metadata • Identify key issues/challenges • Seek consensus on recommendations and best practice • White paper (January 2001) • Defined preservation metadata; role in preservation process • Reviewed/synthesized existing preservation metadata schemes • Preservation metadata framework (June 2002) • Comprehensive description of types of information constituting preservation metadata • Based on OAIS information model • Set of “prototype” preservation metadata elements DLF-PREMIS

  5. Aftermath … • Framework … • Consolidated expertise • Provided foundation for developing formal preservation metadata specifications • Common departure point for different schema implementations • But ... further scope for collaboration in preservation metadata • Needed best practices/recommendations for implementing preservation metadata in real world digital archiving systems DLF-PREMIS

  6. Issues unresolved in WG • How minimal is a core preservation metadata element set? • How much metadata can be generated automatically? • Is it useful to apply metadata elements by object type or object behavior? • Levels of granularity not addressed • Need to provide less abstract view of preservation metadata for implementation DLF-PREMIS

  7. PREMIS • June 2003: OCLC and RLG sponsored new working group: PREMIS • Preservation Metadata: Implementation Strategies • Objectives • Define “core” set of preservation metadata elements, with supporting data dictionary, applicable to broad range of digital preservation activities • Identify and evaluate alternative strategies for encoding, storing, managing, and exchanging preservation metadata DLF-PREMIS

  8. Priscilla Caplan, FCLA (Chair) Rebecca Guenther, LC (Chair) Michael Alexander, British Library George Barnum, GPO Charles Blair, U. of Chicago Olaf Brandt, U. of Gottingen Adam Farquhar, British Library David Gewirtz, Yale Kevin Glavash, MIT/Dspace Cathy Hartman, U. of N. Texas Helen Hodgart, British Library Nancy Hoebelheinrich, Stanford Roger Howard/Sally Hubbard, Getty Museum Pam Kircher, OCLC John Kunze, Calif. Digital Library Brian Lavoie, OCLC liaison Robin Dale, RLG liaison Vicky McCarger, LA Times Jerry McDonough, NYU/METS Evan Owens, JSTOR Erin Rhodes, NARA Madi Solomon, Walt Disney Co. Angela Spinazze, ATSPIN Stefan Strathmann, U. of Gottingen Gunter Waibel, RLG Lisa Weber, NARA Robin Wendler, Harvard Hilde van Wijngaarden, KB Andrew Wilson, NAA Membership DLF-PREMIS

  9. Howard Besser, UCLA Liz Bishoff, OCLC (via Colorado Digitization Program) Gerard Clifton, National Library of Australia Gail Hodge, CENDI Steve Knight, National Library of New Zealand Maggie Jones, Digital Preservation Coalition Nancy McGovern, Cornell Cliff Morgan, Wiley UK Richard Rinehart, U. of California, Berkeley Advisory Committee DLF-PREMIS

  10. PREMIS Subgroups • Core elements • Establish core metadata elements and data dictionary • Developed a data model • Has had 2 face-to-face meetings • Weekly conference calls • Implementation • Examine alternative strategies for encoding, storage and management of preservation metadata • Conducted a survey of practices • Monthly conference call • Expect to complete activities by end of 2004 DLF-PREMIS

  11. Core elements subgroup • Development of data model • Objects • Events • Agents • Intellectual entities • Rights • Data dictionary structured according to entities DLF-PREMIS

  12. Core Elements • Conducting element-by-element review of prototype elements from metadata framework • Is the element “core”? • How is it being used at WG members’ institutions? • How should it be implemented/populated? • Elements not covered by the framework? DLF-PREMIS

  13. Objects • Identifiers • Location • Descriptive metadata out of scope • Technical metadata not specific to particular file format • Levels of objects: representation, file, filestream, bitstream DLF-PREMIS

  14. Objects:Technical metadata • Object characteristics • Fixity • Size • Format (including link to format registry) • Inhibitors • Significant properties • Creating application information • Environment (software, hardware) • Externally defined technical metadata (e.g. Z39.87/MIX) DLF-PREMIS

  15. Events • Digital provenance/process information • Actions that involve one or more objects • May be related to one or more agents • Semantic units • Event identifier • Event type • Event outcome • Event detail • Event date/time DLF-PREMIS

  16. Agents • Agent descriptions out of scope • Attributes of agents associated with preservation events and rights management • May carry-out, authorize, or compel one or more events • may create or act upon one or more objects • may hold or grant one or more rights • Semantic units • Agent identifier • Agent name DLF-PREMIS

  17. Rights and relationships • Rights • Only in context of right to preserve • Collecting rights use cases • Relationships • Data model expresses relationships between entities • Relationships between objects • Derivative, dependency, structural DLF-PREMIS

  18. Implementation Strategies subgroup • Conducted survey of preservation repositories to explore the state of the art • Questions about policies, governance, funding, system architecture, preservation strategies, metadata implementation • 70 surveys sent • Responses from 28 libraries, 7 archives, 14 other in 13 different countries • 10 national libraries, 6 national archives • Survey published Oct. 2004 DLF-PREMIS

  19. Survey findings • Little experience with digital preservation • Most didn’t have active preservation strategy • Many not yet in production • Cannot assess adequacy of metadata • Lack of common vocabulary and conceptual framework • Informed by OAIS reference model • Difference of opinion as to meaning of OAIS compliance DLF-PREMIS

  20. Survey findings (cont.) • Metadata • Many recording rights, provenance, technical, administrative, descriptive and structural • Consistent roles in preservation scope and policies (academic libraries, archives, national libraries) • Substantial use of METS, Z39.87/MIX, OCLC sets • Most repositories serve goals of both preservation and access DLF-PREMIS

  21. Trends • Store metadata redundantly in XML or relational database and with content data objects • Use METS for structural metadata and as container for descriptive and administrative; MIX for images • Use OAIS as framework and starting point • Maintain multiple versions (originals, some normalized or migrated) in repository with complete metadata for all versions • Choose multiple strategies for digital preservation DLF-PREMIS

  22. Looking ahead • Finalize core preservation metadata elements set • Complete data dictionary • XML schemas to support exchange of core elements for digital provenance/process and technical metadata • Final PREMIS report by end of 2004 • Community outreach: opportunities for public comment • Follow-on activities? DLF-PREMIS

  23. More information… • PREMIS Web site: http://www.oclc.org/research/projects/pmwg/ • “Implementing Metadata in Digital Preservation Systems: The PREMIS Activity” D-Lib (April ‘04) http://www.dlib.org/dlib/april04/lavoie/04lavoie.html • Rebecca Guenther: rgue@loc.gov • Priscilla Caplan: pcaplan@ufl.edu DLF-PREMIS

More Related