E N D
1. PANDASPANDORA Digital Archiving SystemArchiving Web Resources Conference Information DayCanberra, 12 November 2004Paul KoerbinDigital Archiving BranchNational Library of Australiapkoerbin@nla.gov.au
2. PANDAS Description and purpose
PANDORA Digital Archiving System
Web-based workflow management system
Developed to manage the web archiving processes at the National Library of Australia
Written in Java on Apple WebObjects application development platform
First version released in June 2001
Second (current) version released August 2002 Partners – used by partners from Perth to Sydney and from Darwin to MelbournePartners – used by partners from Perth to Sydney and from Darwin to Melbourne
3. PANDAS Description and purpose
Record administrative metadata about titles selected (or rejected or monitored) for national preservation
Schedule and initiate harvesting
Manage the quality assurance process and associated problem reporting and fixing
Prepare items for public display through the PANDORA home page
Manage access restrictions
Generate management reports
4. PANDAS
5. PANDAS How it works
Connects with and utilises other software and protocols for specific functions
Provides an interface to the harvesting software – currently this is HTTrack (http://www.httrack.com)
Uses WebDAV protocol to provide content managers with remote access to the harvested files
Uses Z39.50 protocol to access the National Bibliographic Database to extract metadata from the MARC record
6. PANDAS How it works
Title and subject listings and title entry pages are generated ‘on-the-fly’ from PANDAS metadata
Some static web pages (documents, information)
Search engine
Unique identifying number generated by PANDAS
Persistent URL applied to title entry page
http://nla.gov.au/nla.arc-21220 I mentioned that PANDORA has a specific meaning referring to the public interface to the archive …
Testing UltraSeek search engineI mentioned that PANDORA has a specific meaning referring to the public interface to the archive …
Testing UltraSeek search engine
7. PANDAS
Demonstration
8. PANDAS Planned developments and future directions
Ongoing development and enhancement of PANDAS
Improve robustness of system
Re-engineer PANDAS software
Need to achieve greater efficiencies and increase scale of web archiving activity
9. PANDAS Planned developments and future directions
Automatically ingest and process larger volume of online publications and associated metadata – batches
Comply with international standards and adopt standard tools – IIPC
Incorporate other collection methods – domain harvesting, deep web, deposit Our roadmap for the medium term futureOur roadmap for the medium term future
10. PANDAS Planned developments and future directions
Automate collection of more preservation metadata and develop metadata management interface
Improve access and discovery paths to the Archive’s resources as it continues to grow
11. PANDAS Availability
PANDORA partner agencies
Authenticated users
Public access to archived resources
PANDAS evaluation system
Documentation (manuals, data model etc) available online at http://pandora.nla.gov.au
12. PANDAS
Questions?