Cezary Mazurek (mazurek@man.poznan.pl)
Sponsored Links
This presentation is the property of its rightful owner.
1 / 21

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator PowerPoint PPT Presentation


  • 83 Views
  • Uploaded on
  • Presentation posted in: General

Cezary Mazurek ([email protected]) Marcin Werla ([email protected]) Poznań Supercomputing and Networking Center (Poznań, Poland). PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator. Polish Optical Internet PIONIER.

Download Presentation

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Cezary Mazurek ([email protected])

Marcin Werla ([email protected])

Poznań Supercomputing and Networking Center (Poznań, Poland)

PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator

ECDL 2009, Corfu, Greece


Polish Optical InternetPIONIER

ECDL 2009, Corfu, Greece


Digital libraries in the PIONIER Network – organizational models

  • Main organizational models

    • Regional digital libraries

      • Created and maintained by several institutions from particular region

      • Gather mostly resources related to the region, its history and culture but also academic educational materials and national cultural heritage

    • Institutional digital libraries

      • Created and maintained by single institutions (like universities)

      • Gather mostly resources related to present activities (like institutional repositories) and history of the institution

  • In many cases the technical base and support for digital libraries is provided by local computing or networking centres (like PSNC)

ECDL 2009, Corfu, Greece


Digital Libraries in Poland

  • Overall number of digital objects

  • 285 thousands

  • Number of active digital libraries:

  • 19 regional

  • 21 institutional

  • Number of cooperating

  • institutions:

  • Several hundreds of libraries, museums and archives

+ several other digital libraries in the phase of planning, configuration

or initial content uploading

Regional digital libraries

Institutional digital libraries

ECDL 2009, Corfu, Greece


Digital Libraries Federation

  • Main aims

    • To facilitate the use of resources from Polish digital libraries

    • To increase the visibility of these resources in the Internet

    • To create new, advanced network services both for end-users and digital libraries creators on the base of these resources

ECDL 2009, Corfu, Greece


Digital Libraries Federation

  • Basic assumptions

    • No need nor requirement to move resources to the DLF

    • No fees for the use of the DLF and for being a part of it

    • Open standards are the basis for cooperation

      • Particular digital libraries can use different technological platforms

ECDL 2009, Corfu, Greece


Digital Libraries Federation

  • Basic functions

    • Search in the available publications

      • Simple

      • Advanced

    • Digitization plans

      • Searchable

      • Report

      • API for the prevention of duplicted digitization

    • Location of digital objects on the basis of their OAI Identifiers

    • Database of Polish digital libraries

    • Statistics and reports

  • Information in the DLF is updated on the daily (nightly) basis

ECDL 2009, Corfu, Greece


Digital Libraries Federation

  • See it:

    http://fbc.pionier.net.pl/

ECDL 2009, Corfu, Greece


Digital LibrariesFederationsearchplugin

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

Metadata aggregator

Digital libraries

Institutions

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

  • We gather the information about content providers and their information systems

  • Database of Polish Digital Libraries in the DLF

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

  • We gather the metadata of objects that should be visible in Europeana

  • Done with the OAI-PMH

    • In most cases we require the OAI-PMH interface

    • In really special cases we can do it in different way (eg. Polish Internet Library)

  • Now we harvest only Dublin Core Simple

    • Works on new national metadata schema started in September 2009

    • Approximate time of development: 3 months

    • Approximate time of deployment: ???

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

  • We will try to clean-up the metadata, normalize it and enrich

    • On the DLF level there are automatically built dictionaries on the basis of aggregated metadata

      • Separately for each metadata element

      • Separately for each metadata language

    • Differences between the metadata from various digital libraries have negative impact for the searching possibilities of the end-users

    • That is why the metadata normalization is so important

    • The basic analysis shows which elements are crucial and which should be easy to clean-up

      • The analysis was done in April 2009 on the metadata of 214 254 aggregated objects

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

  • Format

    • In 99% of descriptions: MIME type(eg. text/html, image/x.djvu)

  • Language

    • In most cases: ISO 639-2 (pol, ger, lat, fre etc.)

    • Sometimes one value „pol, ger” instead of „pol”, „ger”

  • Rights

    • Name of the institution which holds the original object

  • Type

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

ECDL 2009, Corfu, Greece


Digital Libraries Federation as a metadata aggregator for Europeana

ECDL 2009, Corfu, Greece


Subject - Most frequent values

(Polish version of objects’ description)

Confused with coverage:

temporal

spatial

ECDL 2009, Corfu, Greece


Publisher – Most frequent values

(Polish version of objects’ description)

Geographical location…

ECDL 2009, Corfu, Greece


Summary

  • We have over 40 digital libraries in Poland which are filled with content and metadata coming from hundreds of institutions from different domains

  • We harvest the metadata and provide a single point of access to it

    • The PIONIER Network Digital Libraries Federation (http://fbc.pionier.net.pl/)

    • The software used for this service will be released as an open-source by the end of this year

  • Cooperation with Europeana (but not only this) requires cleaning-up and normalization of metadata

  • This is currently our biggest challenge

    • But we do not want to solve it only by technical means on the level of our aggregator

    • Close cooperation with content providers and some organizational changes prepared by them should effect in more efficient and sustainable metadata improvement process than a purely technical solution

ECDL 2009, Corfu, Greece


Cezary Mazurek ([email protected])

Marcin Werla ([email protected])

Poznań Supercomputing and Networking Center (Poznań, Poland)

PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator

Thank you for your attention. Any questions?

ECDL 2009, Corfu, Greece


  • Login