Cezary Mazurek (mazurek@man.poznan.pl)
This presentation is the property of its rightful owner.
Sponsored Links
1 / 21

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator PowerPoint PPT Presentation


  • 73 Views
  • Uploaded on
  • Presentation posted in: General

Cezary Mazurek ([email protected]) Marcin Werla ([email protected]) Poznań Supercomputing and Networking Center (Poznań, Poland). PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator. Polish Optical Internet PIONIER.

Download Presentation

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Pionier network digital libraries federation experiences of a large scale metadata aggregator

Cezary Mazurek ([email protected])

Marcin Werla ([email protected])

Poznań Supercomputing and Networking Center (Poznań, Poland)

PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator

ECDL 2009, Corfu, Greece


Polish optical internet pionier

Polish Optical InternetPIONIER

ECDL 2009, Corfu, Greece


Digital libraries in the pionier network organizational models

Digital libraries in the PIONIER Network – organizational models

  • Main organizational models

    • Regional digital libraries

      • Created and maintained by several institutions from particular region

      • Gather mostly resources related to the region, its history and culture but also academic educational materials and national cultural heritage

    • Institutional digital libraries

      • Created and maintained by single institutions (like universities)

      • Gather mostly resources related to present activities (like institutional repositories) and history of the institution

  • In many cases the technical base and support for digital libraries is provided by local computing or networking centres (like PSNC)

ECDL 2009, Corfu, Greece


Digital libraries in poland

Digital Libraries in Poland

  • Overall number of digital objects

  • 285 thousands

  • Number of active digital libraries:

  • 19 regional

  • 21 institutional

  • Number of cooperating

  • institutions:

  • Several hundreds of libraries, museums and archives

+ several other digital libraries in the phase of planning, configuration

or initial content uploading

Regional digital libraries

Institutional digital libraries

ECDL 2009, Corfu, Greece


Digital libraries federation

Digital Libraries Federation

  • Main aims

    • To facilitate the use of resources from Polish digital libraries

    • To increase the visibility of these resources in the Internet

    • To create new, advanced network services both for end-users and digital libraries creators on the base of these resources

ECDL 2009, Corfu, Greece


Digital libraries federation1

Digital Libraries Federation

  • Basic assumptions

    • No need nor requirement to move resources to the DLF

    • No fees for the use of the DLF and for being a part of it

    • Open standards are the basis for cooperation

      • Particular digital libraries can use different technological platforms

ECDL 2009, Corfu, Greece


Digital libraries federation2

Digital Libraries Federation

  • Basic functions

    • Search in the available publications

      • Simple

      • Advanced

    • Digitization plans

      • Searchable

      • Report

      • API for the prevention of duplicted digitization

    • Location of digital objects on the basis of their OAI Identifiers

    • Database of Polish digital libraries

    • Statistics and reports

  • Information in the DLF is updated on the daily (nightly) basis

ECDL 2009, Corfu, Greece


Digital libraries federation3

Digital Libraries Federation

  • See it:

    http://fbc.pionier.net.pl/

ECDL 2009, Corfu, Greece


Pionier network digital libraries federation experiences of a large scale metadata aggregator

Digital LibrariesFederationsearchplugin

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana

Digital Libraries Federation as a metadata aggregator for Europeana

Metadata aggregator

Digital libraries

Institutions

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana1

Digital Libraries Federation as a metadata aggregator for Europeana

  • We gather the information about content providers and their information systems

  • Database of Polish Digital Libraries in the DLF

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana2

Digital Libraries Federation as a metadata aggregator for Europeana

  • We gather the metadata of objects that should be visible in Europeana

  • Done with the OAI-PMH

    • In most cases we require the OAI-PMH interface

    • In really special cases we can do it in different way (eg. Polish Internet Library)

  • Now we harvest only Dublin Core Simple

    • Works on new national metadata schema started in September 2009

    • Approximate time of development: 3 months

    • Approximate time of deployment: ???

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana3

Digital Libraries Federation as a metadata aggregator for Europeana

  • We will try to clean-up the metadata, normalize it and enrich

    • On the DLF level there are automatically built dictionaries on the basis of aggregated metadata

      • Separately for each metadata element

      • Separately for each metadata language

    • Differences between the metadata from various digital libraries have negative impact for the searching possibilities of the end-users

    • That is why the metadata normalization is so important

    • The basic analysis shows which elements are crucial and which should be easy to clean-up

      • The analysis was done in April 2009 on the metadata of 214 254 aggregated objects

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana4

Digital Libraries Federation as a metadata aggregator for Europeana

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana5

Digital Libraries Federation as a metadata aggregator for Europeana

  • Format

    • In 99% of descriptions: MIME type(eg. text/html, image/x.djvu)

  • Language

    • In most cases: ISO 639-2 (pol, ger, lat, fre etc.)

    • Sometimes one value „pol, ger” instead of „pol”, „ger”

  • Rights

    • Name of the institution which holds the original object

  • Type

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana6

Digital Libraries Federation as a metadata aggregator for Europeana

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana7

Digital Libraries Federation as a metadata aggregator for Europeana

ECDL 2009, Corfu, Greece


Subject most frequent values

Subject - Most frequent values

(Polish version of objects’ description)

Confused with coverage:

temporal

spatial

ECDL 2009, Corfu, Greece


Publisher most frequent values

Publisher – Most frequent values

(Polish version of objects’ description)

Geographical location…

ECDL 2009, Corfu, Greece


Summary

Summary

  • We have over 40 digital libraries in Poland which are filled with content and metadata coming from hundreds of institutions from different domains

  • We harvest the metadata and provide a single point of access to it

    • The PIONIER Network Digital Libraries Federation (http://fbc.pionier.net.pl/)

    • The software used for this service will be released as an open-source by the end of this year

  • Cooperation with Europeana (but not only this) requires cleaning-up and normalization of metadata

  • This is currently our biggest challenge

    • But we do not want to solve it only by technical means on the level of our aggregator

    • Close cooperation with content providers and some organizational changes prepared by them should effect in more efficient and sustainable metadata improvement process than a purely technical solution

ECDL 2009, Corfu, Greece


Pionier network digital libraries federation experiences of a large scale metadata aggregator

Cezary Mazurek ([email protected])

Marcin Werla ([email protected])

Poznań Supercomputing and Networking Center (Poznań, Poland)

PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator

Thank you for your attention. Any questions?

ECDL 2009, Corfu, Greece


  • Login