1 / 12

PANDAS PANDORA Digital Archivin

elina
Download Presentation

PANDAS PANDORA Digital Archivin

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


    1. PANDAS PANDORA Digital Archiving System Archiving Web Resources Conference Information Day Canberra, 12 November 2004 Paul Koerbin Digital Archiving Branch National Library of Australia pkoerbin@nla.gov.au

    2. PANDAS Description and purpose PANDORA Digital Archiving System Web-based workflow management system Developed to manage the web archiving processes at the National Library of Australia Written in Java on Apple WebObjects application development platform First version released in June 2001 Second (current) version released August 2002 Partners – used by partners from Perth to Sydney and from Darwin to MelbournePartners – used by partners from Perth to Sydney and from Darwin to Melbourne

    3. PANDAS Description and purpose Record administrative metadata about titles selected (or rejected or monitored) for national preservation Schedule and initiate harvesting Manage the quality assurance process and associated problem reporting and fixing Prepare items for public display through the PANDORA home page Manage access restrictions Generate management reports

    4. PANDAS

    5. PANDAS How it works Connects with and utilises other software and protocols for specific functions Provides an interface to the harvesting software – currently this is HTTrack (http://www.httrack.com) Uses WebDAV protocol to provide content managers with remote access to the harvested files Uses Z39.50 protocol to access the National Bibliographic Database to extract metadata from the MARC record

    6. PANDAS How it works Title and subject listings and title entry pages are generated ‘on-the-fly’ from PANDAS metadata Some static web pages (documents, information) Search engine Unique identifying number generated by PANDAS Persistent URL applied to title entry page http://nla.gov.au/nla.arc-21220 I mentioned that PANDORA has a specific meaning referring to the public interface to the archive … Testing UltraSeek search engineI mentioned that PANDORA has a specific meaning referring to the public interface to the archive … Testing UltraSeek search engine

    7. PANDAS Demonstration

    8. PANDAS Planned developments and future directions Ongoing development and enhancement of PANDAS Improve robustness of system Re-engineer PANDAS software Need to achieve greater efficiencies and increase scale of web archiving activity

    9. PANDAS Planned developments and future directions Automatically ingest and process larger volume of online publications and associated metadata – batches Comply with international standards and adopt standard tools – IIPC Incorporate other collection methods – domain harvesting, deep web, deposit Our roadmap for the medium term futureOur roadmap for the medium term future

    10. PANDAS Planned developments and future directions Automate collection of more preservation metadata and develop metadata management interface Improve access and discovery paths to the Archive’s resources as it continues to grow

    11. PANDAS Availability PANDORA partner agencies Authenticated users Public access to archived resources PANDAS evaluation system Documentation (manuals, data model etc) available online at http://pandora.nla.gov.au

    12. PANDAS Questions?

More Related