1 / 23

Web-based METS Creation for High Quality Mass Digitization

Learn about the benefits of using METS for creating high-quality mass digitization projects, including easy-to-use web-based METS metadata editors, flexible metadata sets, workflow orchestration, access roles and permissions, long-term preservation, and more. Join us to support the development of this open-source collaborative project!

terrycurtis
Download Presentation

Web-based METS Creation for High Quality Mass Digitization

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. case study web based METS creationRalf Stockmann (stockmann@sub.uni-goettingen.de)

  2. Why METS?The new paradigm: connecting content Present Portal Websites Federated Search Past Project Websites Repositories

  3. Future • Decentralized Web services • Relying on • Personalization • Social / Scientific Communities • Semantic Relations • Grid Computing • Offering: • Dynamic Services (private bookshelf, …) • Tools for Analysis, Annotation, Linking, Rating, Tagging • Collaborative Workspaces • Referencing single digital objects, or even parts of them • “Scientific Mashups” • Online / Offline • Interfaces and Protocols

  4. Consequences • Shift of Relevance • Less: • Originator / host of content • Low quality images • “Black Box” software architecture with “vanilla” features • More: • Metadata • Fulltext • Addressable sub-parts of an object • High resolution images • Interfaces • Specialized, encapsulated, connectable tools • METS • “Self-Awareness” of every document/file

  5. Web bases METS creation for high quality mass digitisation • Easy to use, collaborative web based METS metadata editor • Flexible metadata sets • Workflow orchestration • Access roles and permissions • Presentation and usage • Long term preservation • “Scan to EDL / WDL / …” • Open Source / Collaborative Development

  6. Create volume metadata based on catalog data

  7. Document model with two structures Logical structure Phys. structure Content files Monograph Bound Book 00000001.tif Page 00000002.tif Chapter Page 00000003.tif Chapter Page 00000004.tif Page 00000005.tif Chapter page area 00000006.tif Chapter Page 00000007.tif Chapter Page 00000008.tif Page HiRes01.jpg Thumb01.jpg Page Fulltext.xml

  8. Building logical and physical structures

  9. Exporting METS

  10. Controlling

  11. Workflow Orchestration

  12. Visualisation

  13. Full Text Search

  14. Image Highlighting

  15. Table of Content

  16. Metadata

  17. PDF Download

  18. Presenting (TEI) Full Text

  19. Handling Metadata and METS • Fulltext is referenced, not embedded in METS file due to file sizes. • METS file is about 2 – 3 MB • Fulltext is about 20 MB • Use MODS for descriptive metadata for logical structure entities • PREMIS preservation metadata • Own descriptive metadata schema for physical structure entities – storing page numbers

  20. Availability • Offering a full-flavored framework for digital libraries • Open Source • Components • LINUX / UNIX Filesystem • JAVA (min 1.5) • Tomcat & Apache • MYSQL • TYPO3 (PHP) • WebDAV • LDAP • Subversion Server • Work in progress: support model

  21. Join us!

More Related