Sigma ee reaping low hanging fruits in rdf based data integration
Download
1 / 21

Sigma EE: Reaping low-hanging fruits in RDF-based data integration - PowerPoint PPT Presentation


  • 107 Views
  • Uploaded on

Sigma EE: Reaping low-hanging fruits in RDF-based data integration. Richard Cyganiak I-Semantics 2010, Graz. Intro. Semantic Technologies conferences In-use Tracks Applications session D2RQ Expose contents of relational databases as RDF/SPARQL

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' Sigma EE: Reaping low-hanging fruits in RDF-based data integration' - ulla


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Sigma ee reaping low hanging fruits in rdf based data integration

Sigma EE: Reaping low-hanging fruits in RDF-based data integration

Richard Cyganiak

I-Semantics 2010, Graz


Intro
Intro integration

  • Semantic Technologies conferences

    • In-use Tracks

    • Applications session

  • D2RQ

    • Expose contents of relational databases as RDF/SPARQL

    • Just a format converter; what do people use it for?


The common theme
The common theme … integration

Integration of data across the organization/project



The rdf based data integration project1
The RDF-based data integration project integration

  • Probably limited budget …

  • Otherwise would buy from SAP or Oracle



Sigma ee
Sigma EE integration

  • Originally not built for enterprise data but for web data

  • Sindice, search engine for the Web of Data

    • Microformats, RDFa, Linked Data on the Web

    • For building apps on top of data search API

    • http://sindice.com/

  • How to show the richness of all that data?

    • http://sig.ma/


sig.ma integration demo



Background
Background integration

  • The problem: How to provide uniform access to heterogeneous data sources?

    • Value-added services:

      • Search

      • Browsing

      • Recommendations of related items

      • Reporting

      • Dashboarding

      • Notifications


Solutions
Solutions? integration

  • Data Warehousing

  • Enterprise Information Integration

  • Enterprise Search

  • A middle ground in-between?


Data warehousing eii
Data Warehousing, EII integration

  • Integrate enterprise data sources into a new data source

    • Data Warehouse: materialized (new DB)

    • Enterprise Information Integration: virtual (distrib. queries)

  • Focus on data

  • Tight integration

  • High up-front cost


Enterprise search
Enterprise Search integration

  • Provides the most sought-after service (search)

  • Focus on documents

    • full-text search

  • Lower up-front cost (no schema alignment)

  • Providing value-added services on top is difficult


A middle ground
A middle ground integration

  • Start by providing access to data on a per-business-object basis without prior schema alignment

  • Services: Browsing of the catalog of objects; search

  • Align, link and reconciliate as required to enable more services, e.g., expressive queries


A middle ground1
A middle ground integration

  • No accepted term yet

    • Data Spaces?

    • Pay-as-you-go Data Integration?

    • Linked Enterprise Data?


The rdf technology stack
The RDF technology stack integration

  • A standards-based “data-first” approach

    • RDF, SPARQL, OWL – W3C standards

  • Off-the-shelf components

  • Integrates well with web data sources


The rdf bus
The “RDF Bus” integration

  • Various implementation strategies

    • ETL + One Big Triple Store with SPARQL endpoint

    • Several SPARQL endpoint (SPARQL 1.1 SERVICE feature?)

    • Linked Data style (resolvable URIs)

  • Bus details determine what services can be provided

    • Can you do high-performance SPARQL?

    • Can you do full-text search?

    • Real-time up-to-date information or significant delay?

    • Where is alignment handled?

    • Who can hook in new data sources?


Sigma ee1
Sigma EE integration

  • Services: search, browsing

  • Strengths

    • Minimal requirements for the RDF bus

    • Strong support for provenance

    • Dynamic UI

  • Bus has to provide Search and Entity descriptions

    • E.g., SPARQL endpoint with full-text search

    • E.g., Solr

    • E.g., Sindice + (part of) the Web

    • E.g., custom Java classes

    • Or multiple of the above


Architecture
Architecture integration


Sigma ui
Sigma UI integration

  • Full-text search

  • On-the-fly fuzzy merge of data sources

  • Empower user to evaluate provenance, reject and accept data sources

  • Show/hide/rearrange properties and values

  • Browse to related entities

  • Permalinks, embeddable widgets


Summary
Summary integration

  • Sigma EE: front-end for your RDF Bus

    • E.g., for your triple store

  • Off-the-shelf UI with minimum configuration

  • Available under GPL or other licenses on request

  • Running at http://sig.ma/


ad