Sigma ee reaping low hanging fruits in rdf based data integration
This presentation is the property of its rightful owner.
Sponsored Links
1 / 21

Sigma EE: Reaping low-hanging fruits in RDF-based data integration PowerPoint PPT Presentation


  • 86 Views
  • Uploaded on
  • Presentation posted in: General

Sigma EE: Reaping low-hanging fruits in RDF-based data integration. Richard Cyganiak I-Semantics 2010, Graz. Intro. Semantic Technologies conferences In-use Tracks Applications session D2RQ Expose contents of relational databases as RDF/SPARQL

Download Presentation

Sigma EE: Reaping low-hanging fruits in RDF-based data integration

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Sigma ee reaping low hanging fruits in rdf based data integration

Sigma EE: Reaping low-hanging fruits in RDF-based data integration

Richard Cyganiak

I-Semantics 2010, Graz


Intro

Intro

  • Semantic Technologies conferences

    • In-use Tracks

    • Applications session

  • D2RQ

    • Expose contents of relational databases as RDF/SPARQL

    • Just a format converter; what do people use it for?


The common theme

The common theme …

Integration of data across the organization/project


The rdf based data integration project

The RDF-based data integration project


The rdf based data integration project1

The RDF-based data integration project

  • Probably limited budget …

  • Otherwise would buy from SAP or Oracle


Where next after hello world

Where next after “Hello World”?


Sigma ee

Sigma EE

  • Originally not built for enterprise data but for web data

  • Sindice, search engine for the Web of Data

    • Microformats, RDFa, Linked Data on the Web

    • For building apps on top of data search API

    • http://sindice.com/

  • How to show the richness of all that data?

    • http://sig.ma/


Sigma ee reaping low hanging fruits in rdf based data integration

sig.ma demo


Off the shelf ui for the rdf bus

Off-the-shelf UI for the RDF Bus


Background

Background

  • The problem: How to provide uniform access to heterogeneous data sources?

    • Value-added services:

      • Search

      • Browsing

      • Recommendations of related items

      • Reporting

      • Dashboarding

      • Notifications


Solutions

Solutions?

  • Data Warehousing

  • Enterprise Information Integration

  • Enterprise Search

  • A middle ground in-between?


Data warehousing eii

Data Warehousing, EII

  • Integrate enterprise data sources into a new data source

    • Data Warehouse: materialized (new DB)

    • Enterprise Information Integration: virtual (distrib. queries)

  • Focus on data

  • Tight integration

  • High up-front cost


Enterprise search

Enterprise Search

  • Provides the most sought-after service (search)

  • Focus on documents

    • full-text search

  • Lower up-front cost (no schema alignment)

  • Providing value-added services on top is difficult


A middle ground

A middle ground

  • Start by providing access to data on a per-business-object basis without prior schema alignment

  • Services: Browsing of the catalog of objects; search

  • Align, link and reconciliate as required to enable more services, e.g., expressive queries


A middle ground1

A middle ground

  • No accepted term yet

    • Data Spaces?

    • Pay-as-you-go Data Integration?

    • Linked Enterprise Data?


The rdf technology stack

The RDF technology stack

  • A standards-based “data-first” approach

    • RDF, SPARQL, OWL – W3C standards

  • Off-the-shelf components

  • Integrates well with web data sources


The rdf bus

The “RDF Bus”

  • Various implementation strategies

    • ETL + One Big Triple Store with SPARQL endpoint

    • Several SPARQL endpoint (SPARQL 1.1 SERVICE feature?)

    • Linked Data style (resolvable URIs)

  • Bus details determine what services can be provided

    • Can you do high-performance SPARQL?

    • Can you do full-text search?

    • Real-time up-to-date information or significant delay?

    • Where is alignment handled?

    • Who can hook in new data sources?


Sigma ee1

Sigma EE

  • Services: search, browsing

  • Strengths

    • Minimal requirements for the RDF bus

    • Strong support for provenance

    • Dynamic UI

  • Bus has to provide Search and Entity descriptions

    • E.g., SPARQL endpoint with full-text search

    • E.g., Solr

    • E.g., Sindice + (part of) the Web

    • E.g., custom Java classes

    • Or multiple of the above


Architecture

Architecture


Sigma ui

Sigma UI

  • Full-text search

  • On-the-fly fuzzy merge of data sources

  • Empower user to evaluate provenance, reject and accept data sources

  • Show/hide/rearrange properties and values

  • Browse to related entities

  • Permalinks, embeddable widgets


Summary

Summary

  • Sigma EE: front-end for your RDF Bus

    • E.g., for your triple store

  • Off-the-shelf UI with minimum configuration

  • Available under GPL or other licenses on request

  • Running at http://sig.ma/


  • Login