Cezary Mazurek (mazurek@man.poznan.pl)
Download
1 / 21

PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator - PowerPoint PPT Presentation


  • 113 Views
  • Uploaded on

Cezary Mazurek ([email protected]) Marcin Werla ([email protected]) Poznań Supercomputing and Networking Center (Poznań, Poland). PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator. Polish Optical Internet PIONIER.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' PIONIER Network Digital Libraries Federation Experiences of a large scale metadata aggregator' - oliana


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

Cezary Mazurek ([email protected])

Marcin Werla ([email protected])

Poznań Supercomputing and Networking Center (Poznań, Poland)

PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator

ECDL 2009, Corfu, Greece


Polish optical internet pionier
Polish Optical InternetPIONIER

ECDL 2009, Corfu, Greece


Digital libraries in the pionier network organizational models
Digital libraries in the PIONIER Network – organizational models

  • Main organizational models

    • Regional digital libraries

      • Created and maintained by several institutions from particular region

      • Gather mostly resources related to the region, its history and culture but also academic educational materials and national cultural heritage

    • Institutional digital libraries

      • Created and maintained by single institutions (like universities)

      • Gather mostly resources related to present activities (like institutional repositories) and history of the institution

  • In many cases the technical base and support for digital libraries is provided by local computing or networking centres (like PSNC)

ECDL 2009, Corfu, Greece


Digital libraries in poland
Digital Libraries in Poland models

  • Overall number of digital objects

  • 285 thousands

  • Number of active digital libraries:

  • 19 regional

  • 21 institutional

  • Number of cooperating

  • institutions:

  • Several hundreds of libraries, museums and archives

+ several other digital libraries in the phase of planning, configuration

or initial content uploading

Regional digital libraries

Institutional digital libraries

ECDL 2009, Corfu, Greece


Digital libraries federation
Digital Libraries Federation models

  • Main aims

    • To facilitate the use of resources from Polish digital libraries

    • To increase the visibility of these resources in the Internet

    • To create new, advanced network services both for end-users and digital libraries creators on the base of these resources

ECDL 2009, Corfu, Greece


Digital libraries federation1
Digital Libraries Federation models

  • Basic assumptions

    • No need nor requirement to move resources to the DLF

    • No fees for the use of the DLF and for being a part of it

    • Open standards are the basis for cooperation

      • Particular digital libraries can use different technological platforms

ECDL 2009, Corfu, Greece


Digital libraries federation2
Digital Libraries Federation models

  • Basic functions

    • Search in the available publications

      • Simple

      • Advanced

    • Digitization plans

      • Searchable

      • Report

      • API for the prevention of duplicted digitization

    • Location of digital objects on the basis of their OAI Identifiers

    • Database of Polish digital libraries

    • Statistics and reports

  • Information in the DLF is updated on the daily (nightly) basis

ECDL 2009, Corfu, Greece


Digital libraries federation3
Digital Libraries Federation models

  • See it:

    http://fbc.pionier.net.pl/

ECDL 2009, Corfu, Greece


Digital modelsLibrariesFederationsearchplugin

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana
Digital Libraries Federation as a metadata aggregator for Europeana

Metadata aggregator

Digital libraries

Institutions

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana1
Digital Libraries Federation as a metadata aggregator for Europeana

  • We gather the information about content providers and their information systems

  • Database of Polish Digital Libraries in the DLF

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana2
Digital Libraries Federation as a metadata aggregator for Europeana

  • We gather the metadata of objects that should be visible in Europeana

  • Done with the OAI-PMH

    • In most cases we require the OAI-PMH interface

    • In really special cases we can do it in different way (eg. Polish Internet Library)

  • Now we harvest only Dublin Core Simple

    • Works on new national metadata schema started in September 2009

    • Approximate time of development: 3 months

    • Approximate time of deployment: ???

ECDL 2009, Corfu, Greece


Digital libraries federation as a metadata aggregator for europeana3
Digital Libraries Federation as a metadata aggregator for Europeana

  • We will try to clean-up the metadata, normalize it and enrich

    • On the DLF level there are automatically built dictionaries on the basis of aggregated metadata

      • Separately for each metadata element

      • Separately for each metadata language

    • Differences between the metadata from various digital libraries have negative impact for the searching possibilities of the end-users

    • That is why the metadata normalization is so important

    • The basic analysis shows which elements are crucial and which should be easy to clean-up

      • The analysis was done in April 2009 on the metadata of 214 254 aggregated objects

ECDL 2009, Corfu, Greece



Digital libraries federation as a metadata aggregator for europeana5
Digital Libraries Federation as a metadata aggregator for Europeana

  • Format

    • In 99% of descriptions: MIME type(eg. text/html, image/x.djvu)

  • Language

    • In most cases: ISO 639-2 (pol, ger, lat, fre etc.)

    • Sometimes one value „pol, ger” instead of „pol”, „ger”

  • Rights

    • Name of the institution which holds the original object

  • Type

ECDL 2009, Corfu, Greece




Subject most frequent values
Subject - Most frequent values Europeana

(Polish version of objects’ description)

Confused with coverage:

temporal

spatial

ECDL 2009, Corfu, Greece


Publisher most frequent values
Publisher – Most frequent values Europeana

(Polish version of objects’ description)

Geographical location…

ECDL 2009, Corfu, Greece


Summary
Summary Europeana

  • We have over 40 digital libraries in Poland which are filled with content and metadata coming from hundreds of institutions from different domains

  • We harvest the metadata and provide a single point of access to it

    • The PIONIER Network Digital Libraries Federation (http://fbc.pionier.net.pl/)

    • The software used for this service will be released as an open-source by the end of this year

  • Cooperation with Europeana (but not only this) requires cleaning-up and normalization of metadata

  • This is currently our biggest challenge

    • But we do not want to solve it only by technical means on the level of our aggregator

    • Close cooperation with content providers and some organizational changes prepared by them should effect in more efficient and sustainable metadata improvement process than a purely technical solution

ECDL 2009, Corfu, Greece


Cezary Mazurek ([email protected]) Europeana

Marcin Werla ([email protected])

Poznań Supercomputing and Networking Center (Poznań, Poland)

PIONIER Network Digital Libraries FederationExperiences of a large scale metadata aggregator

Thank you for your attention. Any questions?

ECDL 2009, Corfu, Greece


ad