1 / 15

Stanford Digital Repository

Stanford Digital Repository. Extending the Implementation of PREMIS to Geospatial Resources in the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata Coordinator Digital Library Systems & Services PREMIS Tutorial San Diego, CA 11 Feb 2008. To Be Discussed.

hafwen
Download Presentation

Stanford Digital Repository

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Stanford Digital Repository Extending the Implementation of PREMIS to Geospatial Resourcesin the Stanford Digital Repository: An Exploration By Nancy J. Hoebelheinrich Metadata Coordinator Digital Library Systems & Services PREMIS Tutorial San Diego, CA 11 Feb 2008

  2. To Be Discussed • Context for SDR (Stanford Digital Repository) • What PREMIS data elements are being used currently • How & why • PREMIS & Geospatial Resources - a fit? Implementing the PREMIS Data Dictionary at Stanford

  3. SDR Context • Bit Level Preservation environment • Designed to facilitate an automated production environment for digitization & receipt of digital materials • Use of a “target manifest” (TM) • = core metadata, structure & file inventory expressed as METS document • = SIP w/o content files Implementing the PREMIS Data Dictionary at Stanford

  4. Scenario 1: David Rumsey Historical Maps Collection • Comprised of historical maps digitized as Single, Still image TIFFs • METS Records for • Rumsey Deposit Agreement • Rumsey “Collection Level” & Auxiliary Files • Each Item Implementing the PREMIS Data Dictionary at Stanford

  5. METS Documents for Rumsey Collection Relationships among METS Docs Deposit Agreement Individual Map Collection Level / Auxiliary Files Implementing the PREMIS Data Dictionary at Stanford

  6. PREMIS OBJECT PREMIS RIGHTS PREMIS EVENTS Aspects of digital provenance Succinct link to full rights statement Important lifecycle events PREMIS Records contained w/in METS Documents Implementing the PREMIS Data Dictionary at Stanford

  7. Use of PREMIS Object Data Elements • Used in each METS Document referencing files • Item, Agreement, “Collection Level” & Auxiliary Files • Located in the METS <amdSec><techMD> section • Automatic insertion by Ingest code to retain important provenance info for each file: • Original file name from data provider • Original checksum • Original file size • Some information redundant, but prefer to retain in case METS sections need to be pulled out separately for action • Rumsey Item TMRumsey PREMIS_Object excerpt Implementing the PREMIS Data Dictionary at Stanford

  8. Use of PREMIS Rights data elements • Rumsey Deposit Agreement TM • Represents the ingested draft Agreement with its own TM • Placeholder for: • XML or other REL instance of full agreement or • Use of METSRights once final agreement template is vetted & agreed upon by University Counsel Implementing the PREMIS Data Dictionary at Stanford

  9. How? <amdSec><rightsMD> <mdWrap><xmlData> Agreement TM Rumsey Rights Excerpt Why? Succinct summary of key information for quick access from METS Document itself Locator for more complete expression of terms, conditions; Use of PREMIS Rights data elements Implementing the PREMIS Data Dictionary at Stanford

  10. Event 1: Transform of descriptive MD from MS Access db => XML => MODS Inserted into mets <amdSec><digiprovMD> Rumsey SimpleFile TM Rumsey Event Excerpt Why this event? In case of questions from outside data provider Retain singular scripts & transform mechanisms Test practicability of recording such events in production environment Use of PREMIS Event Data Elements Implementing the PREMIS Data Dictionary at Stanford

  11. Shapefiles Digital Raster Graphics (DRG) files Digital Ortho Quarter Quads (DOQQs) Factors: Existence of extant domain specific MD, e.g., FGDC for descriptive and technical MD Number of layers of the resource, e.g., representation & file? Point in resource lifecycle wishing to document Scenario 2: Geospatial Files & PREMIS – is it a fit? Implementing the PREMIS Data Dictionary at Stanford

  12. Use of PREMIS Object Data Elements – Scenario 2: GIS Dataset • Domain specific needs for Object: • Context, especially for semantic underpinnings, e.g., Abstract, description of purpose, intended use of data • No place for this in PREMIS(?) • Perhaps <object><relationship> <relatedObjectIdentification> for an explanatory website? • Environment • HW/ SW info pertinent at time of data creation (?) • “Significant properties” • Data Quality – describing completeness, logical consistency, attribute accuracy • Data Trustworthiness – data creator / provider reliable? = “authentic” • Data Provenance – processes & sources for dataset = “understandable” • Better understanding of what’s contained in a “format registry” - & their existence! Implementing the PREMIS Data Dictionary at Stanford

  13. Event : Would prefer the option to describe process of data creation Merge c:\temp\states1;c:\temp \states2; c:\temp\USA (includes process = “merge” and data sources Why this event? Important to describe processes during different phases of lifecycle, even prior to ingestion Not to be able to do so – problemmatic for geospatial resources Advantage – can describe events once in repository, unlike FGDC Use of PREMIS Event Data Elements Implementing the PREMIS Data Dictionary at Stanford

  14. Issues & Challenges • Getting domain specific MD would help! • If not, getting important prez info from data creators -- uh huh -- uh huh!! • How to determine what is truly necessary for dataset use? • Is this level of documentation still bit preservation? • Getting buy-in from domains, e.g., geospatial Implementing the PREMIS Data Dictionary at Stanford

  15. Questions? / comments? Nancy J. Hoebelheinrich nhoebel@stanford.edu Implementing the PREMIS Data Dictionary at Stanford

More Related