300 likes | 316 Views
The Semantic Web. Professor James Hendler http://www.cs.umd.edu/~hendler http://owl.mindswap.org. MIND SWAP. Maryland Information and Network Dynamics Laboratory, Semantic Web and Agents Project J. Hendler B. Parsia Jennifer Golbeck Aditya Kalyanpur Grecia Lapizco-Encinas Katy Newton
E N D
The Semantic Web Professor James Hendlerhttp://www.cs.umd.edu/~hendler http://owl.mindswap.org
MIND SWAP • Maryland Information and Network Dynamics Laboratory, Semantic Web and Agents Project • J. Hendler • B. Parsia • Jennifer Golbeck • Aditya Kalyanpur • Grecia Lapizco-Encinas • Katy Newton • Evren Sirin • Corporate Research Partners: • Fujitsu Laboratory of America, College Park • Lockheed Martin Advanced Technology Laboratories • NTT Corp • SAIC Corp. • http://owl.mindswap.org: Owl-powered Semantic Web page (under development) • http://www.mindswap.org: Traditional web page • Ronald Alford • Ross Baker • Amy Alford • Matt Westhoff • Michael Grove • Ronald Reck
The Semantic Wave (Berners-Lee, 03) UMCP has been a key contributor
XML is NOT semantics <photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>
XML is NOT semantics Xml schema is DOCUMENT checking photo has multiple subject fields photo has one physical location etc. <photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>
XML is NOT semantics Xml schema is DOCUMENT checking photo has multiple subject fields photo has one physical location etc. WHICH SAYS NOTHING ABOUT TALKS, SUBJECTS, PEOPLE, EVENTS, etc. <photo> <subject> http://www.w3.org/~timbl </subject> <name> Tim Berners-Lee</name> </name> …</photo>
The SEMANTICS is inthe links (e.g. to ontologies)! Event:title <daml:ObjectProperty rdf:ID="photograph"> <rdfs:domain rdf:resource="#Picture"/> <rdfs:range rdf:resource= …#person"/> </daml:ObjectProperty> Event:WebPage < > rdf:type photo:Photograph, Photo:File http://…/images#image1, Photo:topic :event1#event:speaker. Event1 a Event:event; date “May 7-11”, speaker http://…#timbl.html Title “WWW 2002…” TimBL rdf:type w3c-ont:person; name “Tim Berners-Lee” … <s:Class rdf:about="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Conference"> <s:comment> describes a generic conceptabout events </s:comment> <s:subClassOf rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Event"/> <a:disjointFrom rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#Workshop"/> <a:restrictedBy rdf:resource="http://www.semanticweb.org/ontologies/swrc-onto-2000-09-10.daml#genid18"/> <rdf:Description rdf:about="http://www.w3.org/2001/03/earl/0.95#Person"> <rdf:type rdf:resource="http://www.w3.org/2000/01/rdf-schema#Class"/> <rdfs:subClassOf rdf:resource="http://www.w3.org/2001/03/earl/0.95#Assertor"/> </rdf:Description>
Database schema for International Digitial Children’s Library Metadata repository ~40 classes ~25 properties (corresponding to database keys) From the small and structured <daml:Class rdf:about="#Book"> </daml:Class> <daml:Class rdf:about="#Author"> <rdfs:subClassO rdf:resource="#Person"/> </daml:Class> <daml:Property rdf:about="#age"> <rdfs:domain rdf:resource="#Book"/> <rdfs:range rdf:resource="#age_value_id"/> </daml:Property> <rdfs:Property rdf:about="#award"> <rdfs:domain rdf:resource="Book"/> <rdfs:range rdf:resource="#award_value_id"/> </rdfs:Property>
US National Cancer Institute Ontology ~17,000 classes >1 million synonyms and definitions RDF graph >500,000 triples To the large and complex owl:Class rdf:ID="CD80_Antigen"> <rdfs:subClassOf rdf:resource="#Cell_Adhesion_Molecule</rdfs:label> <rdfs:label>CD80 Antigen</rdfs:label> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#T_Cell"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#B_Cell"/> </owl:Restriction> </rdfs:subClassOf> <rdfs:subClassOf> <owl:Restriction> <owl:onProperty rdf:resource="#Protein_Expressed_In_Tissue"/> <owl:someValuesFrom rdf:resource="#Macrophage"/> </owl:Restriction> </owl:Class> <CD80_Antigen rdf:ID="CD80_AntigenData"> <Swiss_Prot>P33681</Swiss_Prot> <DEFINITION> <![CDATA[<def-source>NCI</def-source> <def-definition>A member of the immunoglobulin supergene family . Co-regulator of T cell activation with CD86. Plays a critical role in autoimmune, humoral, and transplant responses. Expressed on activated T and B cells and on macrophages. (from PROW)</def-definition>]]> </DEFINITION>
And many more • DAML ontologies for: • Army Thesaurus: 18,000 concepts • CIA World Fact Book in DAML: 1400 classes, 2500 instances • UJTLs: 326 classes, all Joint and Naval Task lists as instances • Military Geofile: ~50 installation types; 56,700 Geoloc instances • NIMA Geonames: 36,750 Afghanistan targets • And Many more (220 in library) • WordNet, NCI Cancer Thesaurus, Heads of State, Weather Conditions, CYC, UNSPC, Genealogy, baseball, Chemistry, …
The Growing Semantic Web • Jan 1, 03: Crawler finds 5.8M+ DAML statements on 20,000+ web pages • Doesn’t include many instance KBs tied to ontologies • Doesn’t include many very large RDFS-based KBs that include some OWL • Ontology library at http://www.daml.org has 209 ontologies (March 2003) • Open for anyone to create • Open for anyone to use • OWL is being supported by large corporation labs • Web tool developers: IBM, HP, Sun, Intel, Fujitsu • Content providers: Daimler-Chrysler, Nokia, Motorola, EDS, Agfa • OWL is starting to be used by thesaurus developers • C.f. National Cancer Institute metathesaurus released in OWL Lite • CALL Thesaurus in DAML • NASA thesaurus available in DAML • Use of semantic markup for Web Services beginning to move beyond basic research • DAML-S cited as required reading for Web Services Choreography WG • DAML-S being demoed to WSDL WG tomorrow!!
Tools are critical Tools for marking up images, figures and other non-textual data… must be built in a way that they tie into the "business processes" of the working scientist -- that is, rather than learning a whole new set of tools, the basic web tools of the scientist must include mechanisms that make it EASIER for the scientist to produce web content (appropriately markedup) while authoring papers, performing experiments, creating and logging data, and the other day to day activities of the working researcher. Science and the Semantic Web, Hendler, 03 The same is true of the military user
Tools for visualization and use A number of tools for querying, browsing and visualizing SW data are being developed
Handle larger size/complexity Tools are now scaling as well
…is starting to work Are there any mountains in Europe bigger than Mt. Blanc? document service database
Wrapping/Mapping tool XML Schema (WSDL) Define transformation functions DAML Ontology (DAML-S) Manual editing if necess. Mappings for attributes
“Ad hoc Programming” Advanced information management capabilities Discovery, Filtering, Composition