Pedals persistent digital archives library system
This presentation is the property of its rightful owner.
Sponsored Links
1 / 20

PeDALS Persistent Digital Archives & Library System PowerPoint PPT Presentation


  • 138 Views
  • Uploaded on
  • Presentation posted in: General

PeDALS Persistent Digital Archives & Library System. GladysAnn Wells, Director and State Librarian Lisa Maxwell, Division Director, Records Management Division Arizona State Library, Archives and Public Records. A Word from Our Sponsors.

Download Presentation

PeDALS Persistent Digital Archives & Library System

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Pedals persistent digital archives library system

PeDALSPersistent Digital Archives & Library System

GladysAnn Wells, Director and State Librarian

Lisa Maxwell, Division Director, Records Management Division

Arizona State Library, Archives and Public Records


A word from our sponsors

A Word from Our Sponsors

  • Library of CongressNational Digital Information and Infrastructure Preservation Program(NDIIPP)

  • Institute for Museum and Library ServicesLibrary Services and Technology Act


Project partners

Project Partners

  • Arizona

  • Florida

  • New York

  • South Carolina

  • Wisconsin

  • Two states to be named

  • Kudos to the Washington State Archives


Technical goals

Technical Goals

  • To develop a OAIS compliant curatorial rationale that can be implemented in software to support anautomated, integrated workflow to process collections of digital records and publications

  • Best suited for collections of records that grow out of routine process and have associated metadata

  • May be adapted to publications

  • Not immediately appropriate for ad hoc records


Technical goals1

Technical Goals

  • To build “digital stacks” using LOCKSS as the basis of an inexpensive storage network that can preserve the authenticity and integrity of the materials.


Additional goals

Additional Goals

  • To build a community of shared practice that meets the needs of a wide range of repositories

    • For best practices ~ what works, what’s practical

    • For resource sharing ~ avoid redundant work

  • To remove barriers to preservation by keeping costs as low as possible


Curatorial rationale

Curatorial Rationale

  • Transformation of traditional, paper-based practices into the digital arena that focuses on explaining why we do things in a particular manner

    • Appraisal

    • Acquisition

    • Arrangement and description

    • Housing and storage

    • Reference and access

    • Preservation


Architecture

Architecture


Automated processing

Automated Processing

  • Curators work with rules, not records

    • Describe business processes (rules)

    • Monitor the process for quality assurance

  • Rules expressed in software

    • A “pipeline” that transforms records as they move through the system

  • Based on Microsoft BizTalk middleware


Preparatory work with provenance

Preparatory Work with Provenance

  • For each series of records selected for transfer

    • Negotiate metadata you will receive

    • Negotiate format of the records

    • Negotiate format of the submission information package(records, metadata, shipping manifest)

    • Negotiate frequency and manner of transfer

  • Provenance develops procedures to create SIPs

    • Exploring the use of LC’s BagIt

  • Archives describes business rules in middleware


Preliminary processing

Preliminary Processing

  • Describe provenance

  • Describe series-level

    • Creator, Provenance, Source of Records

    • Series title, Date ranges

    • Scope note

    • Access points: activities, topics

  • Describe accession

    • Assign accession number, date

    • Assign unique system number

    • Record source, transfer authority, restrictions


Submission

Submission

  • Provenance transfers submission information packages (SIPs) to a drop box

    • FTP, sFTP, scp

    • Tape

    • CD, DVD

  • Each Provenance has a directory

    • Each series has a subdirectory

  • Isolated for virus scanning

  • Option: simple Linux box


Data wrangling

Data Wrangling

  • Before business rules are launched, the curator will validate the SIPs to ensure they conform to the negotiated specification.

  • Includes running New Zealand Metadata Extractor to create

    • Software format

    • Version

    • File size

    • Mime type

  • Some problems we’ve encountered during this step

    • License number did not link to the file containing the record

    • XML contained invalid control characters

    • TIFFs were delivered instead of PDFs


Automated item level description pedals core metadata schema

Administration

Accession number

Transfer authority

Restrictions

Discovery

Provenance

Series

Title

Date

Summary (abstract, scope)

Access points

Preservation

File format, version

Hash

Single schema used for all records, contains elements common to most records

http://pedalspreservation.org/Metadata/Default.aspx

Version 2.0 out mid August

Automated Item-Level Description:PeDALS Core Metadata Schema


Mapping and creating metadata

Mapping and Creating Metadata

<?xml version="1.0" ?>

<keywords>

<keyword>

  <name>Bride First Name</name>

  <value>ROBIN</value>

  </keyword>

<keyword>

  <name>Bride Last Name</name>

  <value>STERNBERGER</value>

  </keyword>

<keyword>

<name>Groom First Name</name>

  <value>MANUEL</value>

  </keyword>

<keyword>

  <name>Groom Last Name</name>

  <value>AVILES</value>

  </keyword>

<?xml version="1.0" ?>

<keywords>

<keyword>

  <name>Bride First Name</name>

  <value>ROBIN</value>

  </keyword>

<keyword>

  <name>Bride Last Name</name>

  <value>STERNBERGER</value>

  </keyword>

<keyword>

<name>Groom First Name</name>

  <value>MANUEL</value>

  </keyword>

<keyword>

  <name>Groom Last Name</name>

  <value>AVILES</value>

  </keyword>

 <keyword>

<name>License #</name>

  <value>411811</value>

  </keyword>

<keyword>

  <name>Marriage Date</name>

  <value>12-17-2005</value>

  </keyword>

<keyword>

  <name>Recording Date</name>

  <value>1-19-2006</value>

  </keyword>

<keyword>

  <name>Batch Name</name>

  <value>ML01 SAD 01/13/06</value>

  </keyword>

  </keywords>


Mapping and creating metadata1

Mapping and Creating Metadata

  • Title

    • MARY ALICE CICERALE and ROBERT PORTER : marriage certificate, 2008

  • Extent

    • 1 Adobe Acrobat PDF (34379 bytes)

  • Date

    • 3 Jan 2008 : Marriage date

    • 12 Jan 2008 : Recording date

  • Access

    • CICERALE, MARY ALICE : Bride

    • PORTER, ROBERT : Groom


Create archival information package

Create Archival Information Package

  • Update Accessions Register database

  • Create AIP

    • <AIP><Normalized Metadata> </Normalized Metadata ><Received Metadata> </Received Metadata><Record> </Record> <Transformed record> </Transformed record></AIP>


Ingest into the digital stacks

Ingest into the Digital Stacks

  • AIPs transferred to a LOCKSS Cluster

  • LOCKSS

    • Redundant Array of Inexpensive Servers

    • Automatic integrity checking

    • Automatic error-correction

    • Geographically dispersed copies

    • Bitstream preservation

    • See LOCKSS.ORG


Public access

Public Access

  • For records that are not confidential, the middleware creates dissemination information packages, puts them on public webserver

    • Doesn’t include administrative, preservation metadata that users will not likely want

    • In format easily supported by common browsers

  • Middleware updates public catalog for website

    • We assume most partners will integrate these search pages into their existing website.


For more information

For more information

  • http://www.pedalspreservation.org/

  • Principal Investigator

    • Richard [email protected]

  • Project Coordinator

    • Sara [email protected]


  • Login