1 / 20

Project Prism

Project Prism. Virtual Remote Control: Preservation Risk Management for Web Resources Nancy Y. McGovern, ECURE 2002. The Project. Project Prism. Part of a 4-year NSF-funded project supported by the Digital Libraries Initiative, Phase 2 (Grant No. IIS-9905955, the Prism Project)

robert
Download Presentation

Project Prism

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Project Prism Virtual Remote Control: Preservation Risk Management for Web Resources Nancy Y. McGovern, ECURE 2002

  2. The Project Project Prism • Part of a 4-year NSF-funded project • supported by the Digital Libraries Initiative, Phase 2 (Grant No. IIS-9905955, the Prism Project) • An umbrella project that includes • Digital Libraries research team (Computer Science) • Human Computer Interface (HCI) • Cornell University Library (CUL) • For updates: • http://www.library.cornell.edu/iris/research/prism/index.html

  3. The Team Project Prism Anne R. Kenney Nancy Y. McGovern Peter Botticelli Richard Entlich William R. Kehoe Carl Lagoze Sandra Payette

  4. Preservation Risk Management Project Prism • Increased reliance by research libraries on Web resources not owned or controlled • Need to monitor and evaluate resources • Identify risks to resources and appropriate responses • Technology introduces new threats, enables new solutions

  5. The Research Agenda Project Prism see, "Preservation Risk Management for Web Resources: Virtual Remote Control in Cornell's Project Prism," by Anne R. Kenney, Nancy Y. McGovern, Peter Botticelli, Richard Entlich, Carl Lagoze, and Sandra Payette in DLib Magazine, January 2002 http://www.dlib.org/dlib/january02/kenney/01kenney.html

  6. The Approach Project Prism • Process • Identification • Analysis • Appraisal • Strategy • Detection • Response

  7. Process Project Prism Adapt the Risk Management Model stages:

  8. Identification Project Prism Establish boundary; Characterize content: example: parse the URL

  9. Analysis Project Prism Define risks associated with: • A Web page: • as a stand-alone object, ignoring its hyperlinks • in local context, considering the internal and external links • A Web site: • as a semantically coherent set of linked Web pages • as an entity in a broader technical and organizational context

  10. Contextual Layers Project Prism

  11. Formatting: TIDY Standards compliance Document structure Metadata: HTTP headers HTML headers Changes Content Location Links Out-link structure In-link structure Intra-site Hub Volatility Page provenance URL parsing Log analysis Page-level Monitoring Project Prism

  12. Site-level Monitoring Project Prism • Graph analysis • Static site analysis and Longitudinal study • Aggregate page analyses • Site maintenance indicators • Backup and archiving policies and procedures • Hardware and software environment • Network configuration and maintenance

  13. Appraisal Project Prism Enable portfolio management: Hypothetical appraisal of a Web resource: Scope: highly relevant Value: high value, not essential; numerous links to page Relationship: secondary archives; informal agreement Maintenance: key indicators of good management Redundancy: captured by more than one archive Risk response: very responsive to risk notifications Capture: complex structure; cyclical updates; formats Size: medium-sized; 3-level crawl

  14. Portfolio Management Project Prism

  15. Strategy Project Prism Develop an organization-specific program:

  16. Detection Project Prism Monitor change; initiate response: Track indicators of management practices: - markup language: version, formatting, compliance - HTTP: status codes, header content - changes: content, location - links: internal, external, volatility - server: security, version, upgrades, responsiveness

  17. Detection (cont.) Project Prism Monitor change; initiate response: Identify potential risks - probable occurrence - frequency of occurrence - degree of impact Correlate to program-define response levels Identify appropriate risk/response scenario(s)

  18. Response Project Prism Develop a toolkit: Inventory and evaluate existing tools Assess functionality for Prism stages Adopt/adapt existing tools Develop new tools Apply to appropriate contextual layers Integrate tools into customizable toolkit

  19. Types of Tools Project Prism • link analyzers • log analyzers • Web crawlers • Web visualization programs • Web site management utilities

  20. Future Directions Project Prism • Preservation Risk Management Program: • Develop program using Prism framework • Provide organizational scenarios • Toolkit: • Complete inventory of tools • Build toolkit demonstrator • Applications: • Develop presentation techniques for stored resources • Enable risk/response scenario development

More Related