1 / 67

The Semantic Web

The Semantic Web. A progress report and some observations Pat Hayes, IHMC. The vision of the Semantic Web. The WWW is a planet-wide system linking computers which enables people to communicate, establish links and publish content to one another.

osgood
Download Presentation

The Semantic Web

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The Semantic Web A progress report and some observations Pat Hayes, IHMC

  2. The vision of the Semantic Web • The WWW is a planet-wide system linking computers which enables people to communicate, establish links and publish content to one another. • The SW plans to use it to do this with machine-usable content, so that software can read it, draw conclusions from it and act on it.

  3. The vision of the Semantic Web • The WWW is a planet-wide system linking computers which enables people to communicate, establish links and publish content to one another. • The SW plans to use it to do this with machine-usable content, so that software can read it, draw conclusions from it and act on it. • Possible applications include B2B, services, improved WWW access, integrated datahandling, Global Mind…

  4. The vision of the Semantic Web • The WWW is: • Moore’s law

  5. The vision of the Semantic Web • The WWW is: • Moore’s law + Optic fiber

  6. The vision of the Semantic Web • The WWW is: • Moore’s law + Optic fiber + HTTP

  7. The vision of the Semantic Web • The WWW is: • Moore’s law + Optic fiber + HTTP + HTML

  8. The vision of the Semantic Web • The WWW is: • Moore’s law + Optic fiber + HTTP + HTML (+ extra software goodies such as Javascript)

  9. The vision of the Semantic Web • The WWW is: • Moore’s law + Optic fiber + HTTP + HTML • The SW will use the first two, and rely on the third for now.

  10. The vision of the Semantic Web • The WWW is: • Moore’s law + Optic fiber + HTTP + HTML • The SW will use the first two, and rely on the third for now. • But it needs a new ‘semantic HTML’, i.e. a standard reference language for expressing content. This is where most of the effort has gone so far.

  11. Semantic markup languages • There are several candidate languages now being used or proposed: OIL DAML+OIL OWL RDF RDFS

  12. Semantic markup languages • There are several candidate languages now being used or proposed: OIL DAML+OIL OWL-DL RDF RDFS OWL-RDF

  13. Semantic markup languages • There are several candidate languages now being used or proposed: OIL Extensional, ‘layered’ DAML+OIL OWL-DL Intensional, non-wf. RDF RDFS OWL-Full

  14. W3C semantic markup languages Uniform and very simple syntactic model, processable by simple XML engines. Intensional, non-well-founded semantics. All RDF/RDFS/OWL assertions are encoded as sets of triples of form aaa RRR bbb . which means RRR(aaa, bbb); all variables are existential; all names are urirefs or literals. The rest of the family consists of semantic extensions to this basic RDF model. RDF RDFS OWL-Full

  15. W3C semantic markup languages Uniform and very simple syntactic model, processable by simple XML engines. Intensional, non-well-founded semantics. All RDF/RDFS/OWL assertions are encoded as sets of triples of form aaa RRR bbb . which means RRR(aaa, bbb); all variables are existential; all names are urirefs or literals. The rest of the family consists of semantic extensions to this basic RDF model. (There is also a very ugly XML serial syntax.) RDF RDFS OWL-Full

  16. <ex:Mary> <ownershipOntologies:had> _:ll . _:ll <rdf:type> <ex:Lamb> . _:ll <dimensionOntologies:size> <ex:Little> .

  17. W3C semantic markup languages Users are expected to define classes and use classes and properties defined by other users. The urirefs used as names constitute the ‘links’ between ontologies, eg _:xx dc:title “My Diary” . _:xx dc:author _:yy . _:yy rdf:type biocat:HumanBeing . _:yy w3:mailbox “phayes@ai.uwf.edu” . _:yy usgov:ssNumber “567881962”^^xsd:string . Many of these RDF ontologies already exist (c. 10|6 lines of RDF). RDF RDFS OWL-Full

  18. Universal resource identifiers • Links on the WWW are mostly URLs (global file address scheme), but also URNs and others. • Key SW idea is that a URI locates the ‘owner’ of any name, ie the authoritative source of information about the intended meaning. • NB, the URI is usually not the intended denotation. • The names are the links.

  19. W3C semantic markup languages RDFS has vocabulary for talking about properties (binary relations), membership in classes, subclass and subproperty relationships, eg rdf:Property rdf:type rdf:Class . rdf:Class rdf:type rdf:Class . ph:FatherOf rdfs:subPropertyOf ph:ancestorOf . Two different classes can have the same members…classes can contain themselves… RDF RDFS OWL-Full

  20. <ex:Mary> <prop:had> _:xxx . _:xxx <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> _:ll . _:ll <http://www.w3.org/2000/01/rdf-schema#subClassOf> <ex:Lamb> . _:ll <http://www.w3.org/2000/01/rdf-schema#subClassOf> <ex:Little> . <ex:Lamb> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < http://www.w3.org/2000/01/rdf-schema#Class>

  21. <owl:Class rdf:about=“#OwnersOfOneLittleLamb”> <owl:Restriction owl:cardinality=“1”^xsd:integer> <owl:onProperty rdf:resource=“prop:had” /> <owl:someValueFrom rdf:resource=“#LittleLambs” /> </owl:Restriction> </owl:Class> <Person rdf:ID=“ex:Mary”> <prop:had rdf:value=“MarysLamb” /> </Person> < owl:IntersectionOf LittleLambs rdf:resource= <rdf:List> <owl:Restriction owl:onProperty ex:size > <owl:allValuesFrom <owl:Class owl:one of ex:Small /> /> </owl:Restriction> <Ex:Lambs> </rdf:List> />

  22. W3C semantic markup languages RDF: basic assertions (existential conjunctive binary positive logic); containers (bags, sequences, lists), XML literals, reification, … RDFS: classes, subclass, subproperty; property ranges and domains; Literals corresponding to all XML Schema datatypes (strings, numbers, dates, etc…) OWL: Notions of transitive, symmetric, functional properties; union, intersection and complement of classes; explicit class constructors; equality and inequality; classes defined by restrictions on properties. RDF RDFS OWL-Full

  23. W3C semantic markup languages Owl reasoning is much more complex than ‘bare’ RDF, yet OWL is all expressed as RDF triples. The extra complexity comes from extra OWL semantic conditions, mostly on the properties, eg. ppp rdf:type owl:SymmetricProperty . aaa ppp bbb . owl-entails bbb ppp aaa . RDF RDFS OWL-Full

  24. W3C semantic markup languages Owl reasoning is much more complex than ‘bare’ RDF, yet OWL is all expressed as RDF triples. The extra complexity comes from extra OWL semantic conditions, but can be all expressed by giving a translation from OWL/RDF into first-order logic. RDF RDFS OWL-Full

  25. Lbase as a foundation formalism Lbase translation of triples RDF triples written using RDF/RDFS/OWL vocabularies RDF axioms RDFS axioms OWL-Full axioms

  26. (A subset of CL adapted for SW use) Lbase as a foundation formalism Lbase translation of triples RDF triples written using RDF/RDFS/OWL vocabularies RDF axioms RDFS axioms OWL-Full axioms

  27. OIL Extensional, ‘layered’ DAML+OIL OWL-DL Intensional, non-wf. RDF RDFS OWL-Full

  28. Same syntactic freedom as RDF OWL-Full OWL-Lite Restricted vocabulary Allows frame-like notation OWL-DL Restricted syntactic constructions: Individual/literal/class/property vocabularies separated No classes of classes, properties of properties, etc., Extensional; need to distinguish OWL-DL from RDFS categories.

  29. State of play Final RDF/RDFS specs now being produced (published about now) OWL being finalized now, published in next few weeks. See W3C website for details DAML and OIL deployed, esp. by DARPA intelligence community and DAML-S.

  30. SCL initiative is a ‘fast-track’ effort to define a better Lbase = subset of CL which is adapted to SW uses and integrated with RDF/RDFS/OWL . A small ad-hoc international working group has been formed and we plan to have a draft standard proposal written by July 2003.

  31. SCL initiative is a ‘fast-track’ effort to define a better Lbase = subset of CL which is adapted to SW uses and integrated with RDF/RDFS/OWL . A small ad-hoc international working group has been formed and we plan to have a draft standard proposal written by July 2003. Watch This Space…..

  32. How is the SW going to work? OK, so you put some machine-readable stuff on your website. Now what?

  33. How is it going to work? OK, so you put some machine-readable stuff on your website. Now what? Hopefully, someone is going to do something useful with it.

  34. How is the SW going to work? OK, so you put some machine-readable stuff on your website. Now what? Hopefully, someone is going to do something useful with it. Such as put you in touch with customers more effectively, or find your website more efficiently, or draw some useful conclusions.

  35. How is the SW going to work? OK, so you put some machine-readable stuff on your website. Now what? Hopefully, someone is going to do something useful with it. Such as put you in touch with customers more effectively, or find your website more efficiently, or draw some useful conclusions. All of these assume some kind of collusion between the publisher and the user, but they also assume a detachment of purpose. In general, the writer of the content does not know what the information is going to be used for.

  36. Transmitting content The writer of the content does not know what the information is going to be used for. What can the writer assume about the way the information is used? No more than is in the spec, in general. But logical semantics only supplies truth-conditions; and those provide only a very minimal constraint upon use, even with the strongest possible assumptions.

  37. Transmitting content What content is in fact transmitted? Idea of “social meaning” is central, but new for AI/KR Eg. A: gobshite rdf:type rdfs:Class rdf:comment “A gobshite is a contemptible person who habitually tells lies.” B:Irish rdfs:subClassOf A#gobshite . C:http://www.coginst.uwf.edu/~phayes rdf:type B#Irish rdfs-entails: http://www.coginst.uwf.edu/~phayes rdf:type A#gobshite .

  38. Transmitting content Logical semantics only supplies truth-conditions; and those provide only a very minimal constraint upon use, even with the strongest possible assumptions. And we cannot even make the strongest assumptions, since we cannot even assume a shared meaning when software agents are involved, since they have access only to the surface forms.

  39. Is this the right thing to be working on?

  40. Is this the right thing to be working on? • Moore’s law + Optic fiber + HTTP + HTML

  41. Is this the right thing to be working on? • Moore’s law + Optic fiber + HTTP + HTML • So far we have been focusing on the ‘semantic HTML’ based on XML. But what we need also is a ‘semantic HTTP’ to support negotiation of meaning and content.

  42. We cannot even assume a shared meaning <ex:Mary> <prop:age> “10” . What does this literal mean? Seems obvious….

  43. We cannot even assume a shared meaning <ex:Mary> <prop:age> “10” . It means the number ten. It means the character string ‘10’. It means both the number and the string. It means either the number or the string. It doesn’t mean anything unless associated with a datatype, and then what it means depends on the datatype.

  44. It means the number ten. It means the character string ‘10’. It means both the number and the string. It means either the number or the string. e. It doesn’t mean anything unless associated with a datatype, and then what it means depends on the datatype. Then it would be impossible to represent property values which were strings or binary numbers. Then the range of the property would be a set of pairs, and there is no way to say that in RDF Then the range of the property wouldn’t be well-defined. Then two identical literals might mean different things, so one could not identify them.

  45. It means the number ten. It means the character string ‘10’. It means both the number and the string. It means either the number or the string. e. It doesn’t mean anything unless associated with a datatype, and then what it means depends on the datatype. Then it would be impossible to represent property values which were strings or binary numbers. Then the range of the property would be a set of pairs, and there is no way to say that in RDF Then the range of the property wouldn’t be well-defined. Then two identical literals might mean different things, so one could not identify them.

  46. b. It means the character string ‘10’. <ex:Mary> <prop:age> “10” .

  47. b. It means the character string ‘10’. <ex:Mary> <prop:age> “10” . <ex:Mary> <prop:age> “10”^^<xsd:number> .

  48. The scary part of this story is that it took a group of <10 reasonably intelligent, dedicated people more than seven months intensive effort to get to this point, and nobody is really happy with the result. (Is the chandelier in the room or part of the room?)

  49. Tougher case: different universes of discourse. What is the complement of a class? Eg what is in the class of US non-citizens? What is the range of a quantifier? When integrating information from various sources, we have to assume that the quantifiers range over (at least) the union of the universes assumed by the different sources. Many data archives and sources are built assuming a restricted universe. We need universe-protection mechanisms. owl:Class vs. rdfs:Class in OWL-DL

More Related