1 / 28

A web-based repository service for vocabularies and alignments in the Cultural Heritage domain

A web-based repository service for vocabularies and alignments in the Cultural Heritage domain. Lourens van der Meij Antoine Isaac Claus Zinn. Authors not here Projects. Using SW techniques for CH data. Focus on vocabularies and alignments.

dex
Download Presentation

A web-based repository service for vocabularies and alignments in the Cultural Heritage domain

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A web-based repository service for vocabularies and alignments in the Cultural Heritage domain Lourens van der Meij Antoine Isaac Claus Zinn

  2. Authors not here • Projects

  3. Using SW techniques for CH data

  4. Focus on vocabularies and alignments • Knowledge Organization Systems (KOS) like thesauri are used to describe cultural objects • Many different KOSs are used in different institutions • Merging them in one global vocabulary is not realistic nor desirable

  5. Semantic matching as a solution to tackle semantic heterogeneity

  6. Eliciting needs for a repository Application cases • Semantic search and browsing • (Re-)Indexing Overall functions • Uniform access to vocabularies • Access & management of alignments Experiment idea: test SW techniques for flexibility, ease of re-use and linking models & data

  7. Existing RDF best practices: SKOS animals NT cats cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats

  8. Existing RDF best practices: SKOS Crucial features for a repository • Vocabulary membership • Cross-vocabulary mapping properties

  9. Existing RDF best practices: OAEI From Ontology Alignment Evaluation Initiative • Mapping cells • 2 entities being matched • 1 relation type (any!) • 1 measure • Provide hook for annotations • Alignments between ontologies as set of cells • Can also be annotated http://oaei.ontologymatching.org

  10. Existing RDF best practices: OAEI

  11. Need for a service API? • Need for dedicated middleware: some reqs beyond basic data access are not met by standard SPARQL • Full-text search on labels • Ranking of results • Access control/authentication • Query complexity control • LoD data publication strategy • Other data exchange formats (JSON) • APIs are also a good way to structure practices in a domain

  12. API design • API is inspired by both SKOS and OAEI APIs • But dedicated to simple vocabularies Not fully-fledged ontologies • Dedicated to vocabularies and alignments More than usual terminology repositories • Alignments are for simple vocabularies Restricting OAEI-based functions to SKOS mappings

  13. Distributed service architecture • Allowing to serve either vocabularies or alignments or both Fitting different stakeholder missions/interests • One service can sit on several others Distribution thought as a scalability-enabler Sends reassuring message re. access control

  14. Distributed service architecture

  15. CATCH service implementation

  16. CATCH service implementation Plus: many alignments automatically created in the STITCH project

  17. Driven by “business” interests E.g., KB has a list of relevant KOSs in its context Johan Stapel

  18. Deployment (1) Vocabulary and alignment browser

  19. Deployment (2) RAMEAU (French NL) as linked data • Interlinked with LCSH (Library of Congress) • Soon to SWD (German NL) • Using manual mappings from the MACS project http://stitch.cs.vu.nl/repository

  20. Deployment (3) STITCH re-indexing prototype (ISWC 2009) • Plugged onto KB cataloguing system

  21. Lessons learnt • Middleware is still useful • To match real application requirements • To gather communities of practice around new usages • But SW tools really help building it • Relevance of existing models like SKOS • Only one part of SKOS unused (collections) and one extension required (concept scheme groups) • Disclaimer: we were involved in SKOS  • Interest from the Cultural Heritage domain

  22. (Changing landscape of) Issues • Some basic middleware functions like full-text search are now tackled by vendor-specific SPARQL ext. We prefer it that way  • Working out the distributed architecture is difficult Progress on federated RDF repositories can be useful • Versioning/changes MUST be addressed at a fine-grained level (concepts) Maybe the issue with the least mature solutions!

  23. Future work Already started! CATCHplus: continuing CATCH efforts, bringing them even closer to production New repository and interface

  24. Current work • Refinement of HTTP API E.g., Possibility to search for pairs of related concepts, with constraints Closer to SPARQL, but still limiting complexity • Based on Openlink Virtuoso • Disk-based implementation can handle huge datasets • Built-in LOD function & full-text features

  25. Current work • Architecture is no longer distributed, for now! Difficult conflict between requirements • Some clients had requirements for SPARQL • Federated SPARQL query is (was?) not yet mature • Named graphs are being experimented • For representing KOS data bundles (file upload) • For contextualizing triples (one shortcoming of SKOS/RDF)

  26. Thanks! http://stitch.cs.vu.nl/repository

More Related