1 / 17

CLARIN Metadata Infrastructure Component Metadata and intermediate solutions

CLARIN Metadata Infrastructure Component Metadata and intermediate solutions. Daan Broeder Claus Zinn Dieter van Uytvanck - Max-Planck Institute for Psycholinguistics. CLARIN NL Info session 1-7-2009. Content. Component metadata Infrastructure Intermediate solutions CMD Toolkit

ilyssa
Download Presentation

CLARIN Metadata Infrastructure Component Metadata and intermediate solutions

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CLARIN Metadata InfrastructureComponent Metadata and intermediate solutions Daan Broeder Claus Zinn Dieter van Uytvanck - Max-Planck Institute for Psycholinguistics CLARIN NL Info session 1-7-2009

  2. Content • Component metadata • Infrastructure • Intermediate solutions • CMD Toolkit • Create CMD components now • Virtual Language Observatory • What can we do with metadata

  3. Context • Other Metadata Infrastructures in our domain: • IMDI, OLAC/DC, TEI • Problems: • Inflexible: too many (IMDI) or too few (OLAC) fields • Limited interoperability • Problematic (unfamiliar) terminology for some sub-communities. • etc.

  4. CLARIN Project - CMDI • Metadata infrastructure based on a “Component Metadata Model” • Aims • Flexibility • Researcher should themselves decide what metadata fits their needs • Offer ready made metadata components • Allow creation of new metadata components needed • Interoperability built-in • Complete Infrastructure: software for editing, harvesting, exploitation • Compatibility with existing frameworks: OLAC, IMDI

  5. CMDI history • Berlin WP2 workshop, Oct. 2008 • Oxford WP2 workshop Feb. 2009 • Documents: • Metadata Infrastructure for Language Resources and Technology v3 Dec 2008 • Metadata Infra Work Document, Feb 2009 • Requirements for Virtual Collections Mar 2009, limited circulation. • CMDI developers wiki • Nijmegen Developers Workshop, May 2009

  6. Metadata Components Lets describe a sound recording Sample frequency Format Size Technical Metadata …

  7. Metadata Components Lets describe a sound recording Language Name Id … Technical Metadata

  8. Metadata Components Lets describe a sound recording Actor Name Age Language Sex Language … Technical Metadata

  9. Metadata Components Lets describe a sound recording Continent Location Country Address Actor … Language Technical Metadata

  10. Metadata Components Project Name Lets describe a sound recording Contact … Location Actor Language Technical Metadata

  11. Metadata Components Project Lets describe a sound recording Location Actor Metadata schema Language Technical Metadata Metadata profile

  12. Metadata Components Project Lets describe a sound recording Location Actor Metadata schema Language Technical Metadata Metadata description

  13. Location Text Actor Recording CreationDate Name Country BirthDate Language Dance Type MotherTongue Title Type Coordinates Metadata Components Component registry User selects appropriate components to create a metadata description user Semantic interoperability partly solved via references to ISOcat concept registry Country dcr:1001 ISOcat concept registry Language dcr:1002 BirthDate dcr:1000 DCMI concept registry Title: dc:title Selecting metadata components from the registry

  14. Search Service ISOcat Concept Registry Semantic Mapping Relation Registry CLARIN Component Registry DCMI Concept Registry Joint Metadata Repository other Concept Registry Metadata Repository Metadata Repository CLARIN MD Live-cycle Perform search/browsing on the metadata catalog using the ISO DCR and other concept registries and CLARIN relation registry Create metadata schema from selection of existing components. Allow creation of new components if they have references to ISOcat Metadata harvesting by OAI protocol Metadata descriptions created Metadata component profile was selected from metadata component registry

  15. Current solution • What if you want to contribute metadata now? • The CLARIN ad-hoc registry (800+ resources, 130+ tools) • Provide IMDI or OLAC metadata • Harvesting (metadata transport) via: • OAI protocol for OLAC records or provide static records • XML harvesting for IMDI • Harvested metadata will be shown in a special CLARIN catalog. • Using the standard MPI/LAT catalog software • and integrated in VLO specializations

  16. Use & Create CMD components now • What if you are adventurous? • CLARIN metadata toolkit allows to start creating metadata components or use existing ones. • We have an existing set of components derived from: • IMDI metadata for sessions • IMDI catalog metadata • Small CLARIN NL project planned to test and report on this • But you can try it too!

  17. The End

More Related