1 / 54

By Helen Howell

Ontology Interoperation. By Helen Howell. Outline. What is interoperation? Why? Problems Approach to Interoperations ONION Cyc. What is Interoperation?. Interoperation may be defined as a loose coupling across information sources, semantic metadata descriptions and ontologies.

fergus
Download Presentation

By Helen Howell

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Ontology Interoperation By Helen Howell

  2. Outline • What is interoperation? • Why? • Problems • Approach to Interoperations • ONION • Cyc

  3. What is Interoperation? Interoperation may be defined as a loose coupling across information sources, semantic metadata descriptions and ontologies Semantic Bridge Ontology Ontology

  4. Why? • Share information

  5. Domain Differences • Naming attribute items differently • payroll - EMP; personnel - PEOPLE • Scope • payroll may include suport for student benefits for employee’s children; personnel - employees only

  6. Domain Differences (cont.) • Encoding • ssn with and without hypens • Attribute scopes-Semantic hetergeneity • hot has different meaning in weather domain than in truck-engine domain

  7. Integration Problems • Data that is available is subject to implicit assumptions which means that it cannot be reused out of context. • Data is often duplicated and labeled under different terms. e.g., Address verses Addr

  8. Approaches • Direct Translation • Single Shared Ontology • Multiple Shared Ontology

  9. Direct Translation For each ontology a set of translating functions is provided to allow mappings with the other ontologies.

  10. Direct Translation • Only feasible if a few ontologies • OBSERVER - mappings between a term in a source ontology and its synonym in a destination ontology. • Set up relationships e.g., Volcano effects Environment

  11. Single Shared Ontology Often standards are not convenient to use since they have to be suitable for all potential uses. So a single subset is formed as a standard

  12. Multiple Shared Ontologies Locates shared knowledge in multiple but smaller shared ontologies

  13. Multiple Shared Ontologies • Referred to as Resource Clustering or Ontology Clustering • Approach more flexible and scalable • No longer have to commit to a comprehensive ontology

  14. Multiple Ontologies • Normally organized into a hierarchy • Top level provides general definitions share by all resources • Lower levels provide definitions of concepts that extend and characterize upper levels

  15. Mediation-Based Information Systems • Mediators provide intelligent middleware services - matching resources to applications - resources are autonomous and heterogeneous

  16. Mediation Based Information Systems • Identify most relevant resources for user’s needs • Retrieve the relevant data • Translate it to a common representation e.g., Nevada, NV, NV USA

  17. Interoperation of Information Sources via Articulation of Ontologies Prasenjit Mitra, Gio Wiederhold Stanford University Supported by AFOSR- New World Vistas Program

  18. Interoperation of Information Sources • Compose information • Multiple independent, heterogeneous sources • Reliability, scalability • Semantic heterogeneity • - same term different semantics • - different term same semantics

  19. Onion System

  20. Articulation Generator Thesaurus Ont1 Ont2 Context-based Word Relator Phrase Relator Driver Semantic Network Structural Matcher OntA Human Expert

  21. Preliminaries: Ontology • Ontology - hierarchy of terms and specification of their properties. • Modeled as a directed labeled graph + set of rules. • Ont = ( V, E, R) • V - set of nodes(concepts) • E - set of edges(properties) • R - set of rules involving V,E

  22. Articulation Rules • Articulation rules - logic-based rules that relate concepts in two ontologies: • Binary Relationships • (O1.Car SubClassOf O2.Vehicle) • (O1.Buyer Equ O2.Owner) • (O2.LuxuryCar SubClassOf O1.Car)

  23. Example SubClass Attribute Vehicle Car SubClass SubClass Attribute Owner Instance LuxuryCar Buyer Equ Instance Instance List_Price Buyer VID Owner Retail_ Price VIN H123 H123 $35k Name Address Amount Unit Name Addr $ 40k PMitra CA CA PMitra

  24. Articulation Rules (contd.) • Horn Clauses • - (O1.Car O1.Instance X), • (X O1.Price Y), (Y > $30000) • => (O2.LuxuryCar Instance X) • - (V O2.Retail_Price P), (C O1.List_Price L), • (L Unit U), (L Amount A), (V Equ C) • => (P Equ concat(U,A))

  25. Contributions • Articulation Generation Toolkit • - produce translation rules semi-automatically • - a library of reusable heuristic methods • - a GUI to display ontologies and • interact with the expert • Ontology Algebra • - query rewriting and planning

  26. Articulation Generation Methods • Non-iterative Methods • - Lexical Matcher • - Thesaurus-based Matcher • - Corpus-based Matcher • - Instance-based Matcher • Iterative Methods • - Structural Matcher • - Inference-based Matcher

  27. Lexical Methods • Preprocessing rules. • -Expert-generated seed rules. • e.g., (O1.List_Price Equ O2.Retail_Price) • -Context-based preprocessing directives. • e.g., (O1.UK_Govt Equ O2.US_Govt) • -Stop-word Removal & Stemming • -Word match (full or partial) • -Phrase match • e.g., (O1.Ministry_Of_Defence Equ • O2.Defense_Ministry) 0.6

  28. Thesaurus-based methods • Consult a dictionary/thesaurus to find synonyms, related words • Generate a similarity measure or relatedness measure • - words that have similar words in their definitions are similar • Get more semantically meaningful relationships from WordNet (syn, hyper)

  29. Based on processing headwords ý definitions Candidate Match Repository Term linkages automatically extracted from 1912 Webster’s dictionary * * free, other sources . being processed. Notice presence of 2 domains: chemistry, transport

  30. Corpus-based • Collect a set of text documents preferably from same domain • - search using keywords in google • Build a context vector (1000-character neighbourhood) for each word • Compute word-pair similarity based on the cosine of the vectors • Use word-pair similarity to find similarity among labels of nodes/edges

  31. Structural Methods • Uses results of lexical match • If x% of parent nodes match & y% of children nodes match • Special relations (AttributeOf) match

  32. Tools to create articulations Graph matcher for Articulation- creating Expert Vehicle ontology Transport ontology Suggestions for articulations

  33. continue from initial point • Also suggest similar terms • for further articulation: • by spelling similarity, • by graph position • by term match repository • Expert response: • 1. Okay • 2. False • 3. Irrelevant • to this articulation • All results are recorded • Okay’s are converted into articulation rules

  34. An Ontology Algebra Operations can be composed Operations can be rearranged Alternate arrangements can be evaluated Optimization is enabled The record of past operations can be kept and reused when sources change

  35. Intersection create a subset ontology • keep sharable entries • Union create a joint ontology • merge entries • Difference create a distinct ontology • remove shared entries Binary Operators A knowledge-based algebra for ontologies The Articulation Function (ArtGen), given two ontologies, supplies articulation rules between them.

  36. Intersection: Definition • O1 = (V1, E1, R1) O2 = (V2, E2, R2) • OI = ( O1 IntArtGen O2) = (VI, EI, RI) • Arules = ArtGen( O1, O2 ) • VI = Nodes( Arules ) • EI = Edges( Arules ) + Edges(E1, VI.V1) + Edges( E2, VI.V2) • RI = Arules + Rules( R1, VI.V1) + Rules( R2, VI.V2)

  37. Intersection: Example SubClass LuxuryCar Car Equ LuxuryTax InexpCar MSRP Price OI O2 O1 • ARules = { (O1.Car SubClass O2.LuxuryCar), (O1.MSRP Equ O2.Price)} • NI = ( O1.Car, O1.MSRP, O2.LuxuryCar, O2.Price ) • EI = Edges(ARules) + {(O1.Car Attribute O1.MSRP), (O2.LuxuryCar Attribute Price)}

  38. Intersection: Properties • Commutative? • OI12=(O1 IntArtGen O2) = (O2 IntArtGen O1)=OI21 • VI12 = VI21, EI12 = EI21, RI12=RI21 • ARules12 = ARules21 • ArtGen(O1, O2) = ArtGen(O2, O1) • IntArtGen is commutative iff ArtGen is commutative

  39. Semantically Commutative • Example: • ArtGen(O1,O2) : ( O1.Car SubClassOf O2.Vehicle) • ArtGen(O2,O1) : ( O2.Vehicle SuperClassOf O1.Car) • Defn: ArtGen is Semantically Commutative iff ArtGen(O1,O2) <=> ArtGen(O2,O1) • Operands to intersection can be rearranged if ArtGen is semantically commutative

  40. Associativity Example • Example: • ArtRules(O1, O3) : (O1.Car SubClassOf O3.Vehicle) • ArtRules(O2, O3) : (O2.Truck SubClassOf O3.Vehicle) • ArtRules(O1, O2) : null • Intersection is not associative!

  41. Associativity • (O1.O2).O3 = O1.(O2.O3) • iff ArtGen is consistent and transitively connective • consistent - given two nodes it generates the same relationship irrespective of relationships between other nodes

  42. Associativity (contd.) • ArtGen relates A and B: • (A Rel B) = (A R1 B) or (B R2 A) • Transitively connective: • If ArtGen generates (A Rel B), (B Rel C) • then it also generates (A Rel C’) • where A e O1, B e O2, C,C’ e O3

  43. Using the Articulation Rules • Q(X) :- (LuxuryCar Instance X), • (X Owner Y), (Y Addr `CA’) • Translated using Articulation Rules • Q(X) :- (Car Instance X),(X List_Price Y) • (Y Unit ‘$’),(Y Amount A), • (A > 30000), (X Buyer Z), • (Z Address ‘CA’)

  44. Conclusion • Introduced an Ontology Interoperation System. • Interoperation based on articulations that bridge the semantic gap. • Graph-oriented data model, logical rules • Founded on Ontology Algebra • Thanks to Stefan Decker, Alex Carobus, Jan Jannink, Sergey Melnik, Shrish Agarwal

  45. DAMLOntologyInteroperationUsing Cyc Steve Reed Cycorp

  46. DAML Excitement at Cycorp Promoting DAML ontology interoperability through Cyc, as a reference ontology. Cyc now imports and exports DAML, paraphrases and reasons about it. Cyc lexical tools assist the mapping of imported DAML ontologies into Cyc. • Both UDDI categorization schemes imported as DAML into Cyc: • UNSPSC (via KSL) and NAICS Cycorp is featuring DAML in it’s upcoming release of OpenCyc (this fall).

  47. DAML Challenges at Cycorp How to automate mapping and translation of DAML ontologies. How to get the user community to help map numerous DAML terms -- interest growing at OpenCyc. Importation of large DAML schema is daunting but underway, WordNet, OpenDirectory, UNSPSC, NAICS Provide an inference engine for the Semantic Web - subsumption, classification, constraints, well-formed statement validation, diagnostics, justifications. What web services interface? SOAP likely.

  48. User UNSPSC Ontology NAICS Ontology Cyc Ontology UDDI Web Services DAML Ontology Interoperation Search for: “waterproof plywood” Return: marine plywood by Hardwood Int. Mapping 1: UNSPSC to Cyc Mapping 2: NAICS to Cyc Search for: UNSPSC/7311 Hardwood Int. … marine plywood …

  49. Ontology Mapping DAML Import From Doerr, Semantic Problems of Thesaurus Mapping

  50. Ontology Mapping Cyc reference DAML Import From Doerr, Semantic Problems of Thesaurus Mapping

More Related