OAIS - PowerPoint PPT Presentation

slide1 n.
Skip this Video
Loading SlideShow in 5 Seconds..
OAIS PowerPoint Presentation
play fullscreen
1 / 50
Download Presentation
Download Presentation


- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

  1. OAIS RathachaiChawuthai rathachai.chawuthai@live.com Information Management CSIM / AIT

  2. Agenda • Preface • Overview • Data Model • Function Model • Architecture Model • OAIS in use

  3. Preface

  4. Starter I want to build own restaurant. What should I do?

  5. Starter What you should have

  6. Starter What you should plan

  7. Starter How you should run

  8. Starter I don’t tell you a blueprint or concrete model for running a restaurant. But I guideyou WHAT and HOW that you have to considerwhen plan to run a restaurant business.

  9. Introduction I want to build an archival information system. What should I do?

  10. Introduction Understand OAIS reference model

  11. Overview

  12. Overview • Open Archival Information System • In 2000 the Research Libraries Group (RLG) and Online Computer Library Center (OCLC) discussed how both organizations build an infrastructure for purposes of archiving digital objects. • It guides you to build archival information system

  13. Overview • Purpose • Model a system for archival information, which is represented in digital format, for long-term preservation • Scope • Framework for long-term preservation and access • Terminology • Architectures and Operation • Preservation strategies and techniques • Data model

  14. Overview • Primary functions • To preserve digital resource over an extended period of time • To provide user access to the information in archives

  15. High Level Concept Person(s), or client systems who interact with the OAIS system and services Person(s), or client systems, who provide the information to be preserved Person(s) who set the overall policy of the OAIS. Management is separate from administrative functions

  16. Archive External Data Workflow

  17. Archive External Data Workflow • Producer • Ingest digital resource to system • Management • Monitor, verify digital resource, do preservation planning, migrate digital resource, and etc. • Customer • Search and access digital resource in repository

  18. Data Model

  19. Package model • 4 Simple information concepts • Content Info • A digital resource that need to preservee.g. text, image, video, sound, … • Preservation Description Info (PDI) • Contain preservation metadata that informs humans or machines to know what they should concern when they want to access, render or other actions to the digital resource. • Archive Packaging Info • A package that enwraps both Content Info (1) and PDI (2) to store as one object • Info of Archive Package • It performs as a metadata of Archive Packaging Info (3). • It helps search engine that does not need to costly extract Archive Package Info to query Content Info or PDI directly

  20. Package Model PDI Preservation Description Information Content Information • Content Information: • Original targeted for preservation. • Physical/Digitalobject and it RepresentationInformation. Archive Packaging Information Package 1 Descriptive Information about Package 1

  21. Package Model PDI Preservation Description Information Content Information • Preservation Description Information (PDI): • What is needed to preserve the Content Information • Provenance • For reliability • Source of content • histories • Context • Environment to render • Reference • Refer to thing outside e.g. ISBN • Fixity • Check sum, MD5, … Archive Packaging Information Package 1 Descriptive Information about Package 1

  22. Package Model PDI Preservation Description Information Content Information • Descriptive Information: • information which is used to discover which package has the Content Information of interest • Full set of attributes that are searchable in catalog service • To perform indexing to this information may improve performance of searching Archive Packaging Information Package 1 Descriptive Information about Package 1

  23. Package Model Descriptive Information about Package 1 Package 1

  24. Information Model • Content Information • A basic concept of information that contain data and its representation information. • For example, it can be “Thailand Map” Content Information

  25. Information Model • Data Object • It is an object that need to preserve. • It can be either physical thing in the real world or digital object content containing bit string. • In this case, it can be file content (1010101010101110101 000101010101001010…..) of image file of Thailand Map • In fact, it is just a string of bit that has no meaning if no one cannot understand.

  26. Information Model • Representation Information • A bit string(100111010…) may be useless if no one knows its meaning. The representation Information inform what structure of “100111010…”is and how to interpret it. • It may has representation of representation if the data object content has complex structure or encode by many level Format in byte form Raw bit string Construct JPEG format structure Interpret to color of pixels to be a picture

  27. Information Model • PDI contains • Reference Info • Identifier that link to thing outside system or real world resource; such as ISBN • Provenance Info • To record why the digital resource born, where it born, why, and how. • Including software and environment that created it. • Context Info • To inform how reliable of the digital resource • To inform original or source of content • To inform history of change • To inform migration process • Fixity Info • To provide necessary information to access and verify digital resource • E.g. keyword, Checksum, MD5, and etc

  28. Information Model Example of PDI components

  29. Information Model Example of PDI components

  30. Information Model Example of PDI components

  31. Information Model Big picture Of Information Model Of a package After combine with all components

  32. Information Package • SIP • A form of package that the is suitable to ingest to the system by the producer. • Majorly, SIP contains Content Info and PDI. • Multiple SIPs may associate with the same PDI. DIP AIP SIP • AIP • A form of package that is suitable stored in the system. • One or more SIPs is transformed to AIP that has complete set of PDI associated Content Info. • AIP may be a collection of AIPs. • DIP • A form of package that is suitable disseminateto consumer. • AIP is transformed to DIP for sharing purpose. • DIP may contains one or more AIP that may not has complete set of PDI.

  33. Information Package Ingest Query/Access SIP DIP Disseminate Consumer Producer AIP Store Administrator

  34. Function Model

  35. Functional Models

  36. 1) Ingest • Accept SIPs from Producers • or from internal elements under Administration control • Prepare the AIPs for archive storage

  37. 2) Archival Storage • Storage of AIP • Maintenance of AIP • Retrieval of AIPs.

  38. 3) Data Management • Populate • Descriptive Information • Administrative Data • Maintain • Descriptive Information • Administrative Data • Access • Descriptive Information • Administrative Data

  39. 4) Administration • Solicit and negotiate submission agreement • With producer • Audit submission • To ensure that they meet standard • Maintain Configuration Management of • System hardware • Software • Day-to-day governance of the other OAIS functional entities

  40. 5) Preservation Planning • Monitor environment of OAIS • Provide recommendations • Still accessible? • Long-term? • If original computing environment becomes obsolete?

  41. 6) Access • Determine • Existence • Description • Location • Availability Of information in OAIS • Allow Consumer • Request • Retrieve Information of Products

  42. OAIS in use

  43. Summary What you should have

  44. Summary What you should plan

  45. Summary How you should run

  46. ?

  47. References • http://www.oclc.org • http://www.oclc.org/research/pmwg/pm_framework.pdf 1