1 / 55

Semantic Web and Digital Library Management

Semantic Web and Digital Library Management. Ludovic Deravet Software Architect @ I.R.I.S. S&E. using Fedora-Commons. Semantic Web and Digital Library Management. PART 1: INTRODUCTION. Evolution of the Web. WEBOS. volume of data. SEMANTIC WEB. WEB. SPARQL. RDF.

melina
Download Presentation

Semantic Web and Digital Library Management

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Semantic Web and Digital Library Management Ludovic Deravet Software Architect @ I.R.I.S.S&E using Fedora-Commons

  2. Semantic Web and Digital Library Management PART 1: INTRODUCTION

  3. Evolution of the Web WEBOS volume of data SEMANTIC WEB WEB SPARQL RDF evolution of web technologies Flash OWL Distributed Search XML RSS DESKTOP Java Intelligent personal agents Semantic databases HTTP HTML Semantic Search MacOS SQL SGML SaaS Social Networking Websites Windows Wikis Lightweight collaboration Weblogs Email Groupware FTP File Servers Keyword Search File Systems Databases 1990-2000 2000-2010 1980-1990 2010-2020

  4. Digital Library Management

  5. Managing and Searching Information Search Result(s)

  6. What is Semantic Web?

  7. Semantic Web Foundations http://www.irislink.com/#company SELECT ?subject ?label WHERE { ?subject rdfs:subClassOf ?object . OPTIONAL { ?subject rdfs:label ?label } I.R.I.S. D.M. experts RDF triples RDFS OWL <rdf:RDF … xmlns:contact=http://.../contact#> <contact:Personrdf:about=http://.../contact#me> <contact:fullName>…</contact:fullName> <contact:mailBoxrdf:resource=mailto:xxx@yyy/> </contact:Person </rdf:RDF>

  8. How does it look like? (example)

  9. Fedora-Commons Features Fedora Repository Modules Dissemination Validation Security Resource Index Storage Management Registry CMA RDF Files RDBMS

  10. What can you do with Fedora-Commons?

  11. How can we help you? I.R.I.S. S&E – International Organisations

  12. Questions?

  13. Semantic Web and Digital Library Management PART 2: ADVANCED

  14. What topics?

  15. Semantic Web and Digital Library Management DIGITAL LIBRARY

  16. What is Digital Library Management? A solution to meet the needs for: • Bulk load of digital assets • Cataloguing • Editing • Storing • Searching

  17. Evolution of the Web volume of data WebOS Semantic Web SPARQL RDF evolution of web technologies WWW Flash OWL Distributed Search XML RSS Java Intelligent personal agents Semantic databases HTTP HTML Semantic Search MacOS SQL SGML SaaS Social Networking Desktop Websites Windows Wikis Lightweight collaboration Weblogs Email Groupware FTP File Servers Keyword Search File Systems Databases 1990-2000 WEB 1.0 2000-2010 WEB 2.0 1980-1990 PC ERA 2010-2020 WEB 3.0

  18. Problem – Searching and Managing Information • Synonyms • have a different spelling but have the same (or quite) meaning • Homonyms • sound alike but have different meaning • most of the time, they have a different spelling • Languages • might require lot of maintenance • not always the same level of quality in each language • Parametric Search • It’s difficult to find things, especially something specific • Too few = too many search results • Too much = no search result

  19. Problem – Searching and Managing Information • Time spent • users spend too much time searching for what they are looking for • Data reusability • Limited ability to reuse data • Managing the information is complex • Within the same company, each department often manages its own information • Each department might have its own way of solving the problem • Try to use technologies to solve the original problem (e.g. MDM) • High volume of information requires human management of the information • Using hierarchical solutions by classifying information • Using horizontal solutions with tags

  20. Semantic Web and Digital Library Management SEMANTIC WEB

  21. What is Semantic Web? The idea behind is “quite” simple: • electronic information will become unambiguous • data will become findable • data will be reusable • data will be interoperable • systems will be flexible • real time information

  22. Foundations of Semantic Web • URIs for everything • Triples: <subject> <predicate> <object> • Models and technologies (e.g. RDF) • Data exchange formats (e.g. RDF/XML, N-Triples) • Notations (e.g. RDFS, OWL) • SPARQL

  23. Foundations of Semantic Web(example) Albert is the father of Philippe SUBJECT PREDICATE OBJECT http://www.belgium.be/person albert/profile.html http://www.belgium.be/person philippe/profile.html http://www.belgium.be/rdf/ relationship#fatherof in RDF notation <rdf:RDFxmlns:be=http://www.belgium.be/rdf/relationship#>

  24. Foundations of Semantic Web(example) be:Kingrdfs:subClassOfbe:Person be:Princerdfs:subClassOfbe:Person RDFS dc:subjectrdf:typerdf:Property PREFIX be: <http://www.belgium.be/ontology> SELECT ?firstname ?lastname WHERE { ?person a be:Person ?person be:firstname ?person be:lastname } SPARQL

  25. How does it look like? (example)

  26. Semantic Web and Digital Library Management FEDORA-COMMONS OVERVIEW

  27. What is Fedora-Commons?

  28. What is Fedora-Commons?

  29. Semantic Web and Digital Library Management FEDORA-COMMONS IN DETAILS

  30. Fedora-Commons Features Fedora Repository Modules Dissemination Validation Security Resource Index Storage Management Registry CMA RDF Files RDBMS

  31. Semantic Web and Digital Library Management Fedora Repository Modules Dissemination Validation Security RI Store Management Registry CMA

  32. CMA – Content Model Architecture Content Model Service Definition fedora-model: hasService fedora-model: hasModel fedora-model: isContractorOf fedora-model: isDeploymentOf Service Deployment Data (Digital Object)

  33. Digital Object

  34. Digital Objects Relationships - Example ns:hasPhotoLocation Windows Operating System Address dc:title Rights ns:isRunningOn ns:hasAddress ns:hasLicense ns:hasText Document Server IRIS Corporate ns:hasLogo ns:hasName ns:supportFormats ns:hasCompression Compression Documents iHQC dc:title I.R.I.S. Group ns:hasLogo

  35. Semantic Web and Digital Library Management Fedora Repository Modules Dissemination Validation Security RI Store Management Registry CMA

  36. Dissemination (Example) Title: The ‘Great Migrations’ Owner: NGC Date: 06/11/2010 1) http://website/pid/pdf THUMBNAIL 2) Calls service with PID and format WSDL Transformation Service VIDEO XML 3) Returns PDF representation ( dissemination) of the requested resource High Speed Videos Streaming platform Archive notice

  37. Semantic Web and Digital Library Management Fedora Repository Modules Dissemination Validation Security RI Store Management Registry CMA

  38. Stores Fedora Repository Modules Storage Default Store File-System Amazon Scalable (no limitation of files) Reliable (SLA 99.99%) No file-system limitation Cost Management (pay for what you use) iRODS is handling the digital objects Fedora-Commons is handling the metadata / management Distributed Management System Stores can be located at different places (geographically) StorageTek 5800 System Distributed Management Storage Manages datasets stored in a wide range of data stores (file-system, network, databases…) Large datasets

  39. Semantic Web and Digital Library Management Fedora Repository Modules Dissemination Validation Security RI Store Management Registry CMA

  40. Resource Index Fedora Repository Modules RI Triples Store Mulgara

  41. Resource Index (RI) - Example Stephen Hawking’s Universe Library dc:title Video L1 dc:language English V1 ns:isMemberOf dc:description Explores the greatest mysteries of the cosmos. Category dc:author C1 Stephen Hawking ns:isCategoryOf ns:isCollectionOf The Story of Everything ns:isCollectionOf ns:isCollectionOf dc:title Episode E3 Episode Episode ns:type E1 E2 Science ns:format dc:title dc:title Blue-Ray ns:format Time Travel Blue-Ray ns:format Aliens Blue-Ray

  42. Resource Index (cont’d) - Triples

  43. Resource Index (cont’d) ITQL Queries (http://docs.mulgara.org/itqlcommands/index.html)

  44. Semantic Web and Digital Library Management Fedora Repository Modules Dissemination Validation Security RI Store Management Registry CMA

  45. Validation • Applied when managing digital objects: • foxml 1.0 • foxml 1.1 • mets 1.0 • mets 1.1 • atom • Use schematron • rule-based validation language • structural language expressed in XML <sch:pattern name="Preliminary Object Checks" id="preliminary"> <sch:rule context="foxml:datastream[@ID='AUDIT']"> <sch:assert test="count(foxml:datastreamVersion) = 1">The AUDIT Datastream can only have ONE version since it is a non-versionabledatastream. (foxml: datastreamVersion)</sch:assert> </sch:rule> </sch:pattern>

  46. Security • Legacy Authentication and Authorization • Authorization: XACML (from Sun) • Authentication: using server filters • FeSL • will replace XACML in a future release of Fedora-Commons • based on JAAS (Java Authentication and Authorization Service

  47. Management • Primary APIs • REST API (HTTP) • API-A and API-M (SOAP) • Secondary APIs • Resource Index with iTQL and SPARQL (HTTP) • OAI-PMH for metadata harvesting across repositories (HTTP) • Third-Party APIs • MediaShelf with a Java client APIs

  48. Semantic Web and Digital Library Management WHO’S GONE FEDORA-COMMONS and USER COMMUNITY

  49. Users Community

  50. Users Community

More Related