1 / 31

Ontology-Based Free-Form Query Processing for the Semantic Web

Ontology-Based Free-Form Query Processing for the Semantic Web. Mark Vickers Brigham Young University MS Thesis Defense. Supported by:. Presentation Overview. Web Queries Explanation of AskOntos Demo Evaluation Future Work and Conclusion. Web Queries: Challenges.

grazia
Download Presentation

Ontology-Based Free-Form Query Processing for the Semantic Web

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:

  2. Presentation Overview • Web Queries • Explanation of AskOntos • Demo • Evaluation • Future Work and Conclusion

  3. Web Queries: Challenges Example: Searching for a car • Cannot specify constraints • Documents returned (usually too many) • Takes time to read through documents • Determine relevance • Find information (price, year, etc.)

  4. Web Queries: Opportunities • Semantic web • Proposed ontology-based framework for making information machine-readable • Uses markup languages to identify information • “[A] search program can look for only those pages that refer to a precise concept…” -Tim Berners-Lee • How should semantic web be searched?

  5. Solution: AskOntos – a Query System for the Semantic Web • Allows free-form queries over semantically annotated pages • Processes queries using information extraction • Returns tables of extracted values

  6. AskOntos Overview

  7. Extraction Ontologies Object sets Relationship sets Participation constraints Lexical Non-lexical Primary object set Aggregation Generalization/Specialization

  8. Extraction Ontologies Data Frame: Internal Representation: float Value Phrase Value Expression: \s*[$]\s*(\d{1,3})*(\.\d{2})? Left Context: $ Key Word Phrase Key Word Expression: ([Pp]rice)|([Cc]ost)| … Operation Phrase Operator: > Expression: (more\s*than)|(more\s*costly)|…

  9. Annotating Web Pages

  10. Annotating Web Pages

  11. Step 1. Parse Query “Find me the and of all s – I want a ” price mileage red Nissan 1996 or newer >= Operator

  12. Step 2. Find Related Ontology “Find me the price and mileage of all red Nissans – I want a 1996 or newer” Similarity value: 2 Similarity value: 5

  13. Step 3. Formulate XQuery Expression • Conjunctive and aggregate queries run over selected ontology’s extracted values • Value-phrase-matching words determine conditions • Conditions: • Color = “red” • Make = “Nissan” • Year >= 1996 >= Operator

  14. Step 3. Formulate XQuery Expression For Let Where Return

  15. Step 4. Run XQuery Expression Over Ontology’s Extracted Data • Uses Qexo 1.7, GNU’s XQuery engine for Java • Orders results according to number of values

  16. Demo

  17. Evaluation of AskOntos • Success Measure: ability to translate free-form queries into formal queries • Extraction ontologies: car ads, house ads, countries, movies, and diamond ads • 3 rounds of testing • 50 queries each (gathered from other CS students) • 1st round discarded due to queries • Minor improvements on system between rounds

  18. Query Translation Metrics “Find me the price and mileage of all red Nissans – I want a 1996 or newer.” for$docin document("file:///.../Car.OWL")/rdf:RDF for$Recordin$doc/owl:Thing … where($Color="red" orempty($Color)) and ($Make="Nissan" orempty($Make)) and ($Year="1996" orempty($Year)) return <Record ID="{$id}"> <Price>{$Price}</Price> <Color>{$Color}</Color> <Make>{$Make}</Make> <Year>{$Year}</Year> </Record> Human conversion Return-Clause Names: {Price, Mileage,Color, Make, Year} Conditions: {(Color,=,“red”), (Make,=,“Nissan”), (Year,>=,“1996”)} Automated conversion Return-Clause Names: {Price,Color, Make, Year} Conditions: {(Color,=,“red”), (Make,=,“Nissan”), (Year,=,“1996”)}

  19. Results

  20. Result Analysis Common reasons for errors: 1. Word not in lexicon: “5 Bedrooms, 3 Bath, study, game room, 2 car garage, and < $250,000”

  21. Result Analysis 2. Mistakes in regular expressions “Which countries use the euro?”

  22. Result Analysis 3. Not enough context: “What are the models from 2005”

  23. Conclusion/Contributions • AskOntos • Is a free-form query system for the semantic web • Applies information extraction for query processing • Answers questions with extracted data values • Contributions • Web queries that use semantic annotations • Web queries returning answers from extracted data • Processing free-form queries using ontologies

  24. Future Work • Disjunction and negation • Fuzzy queries • Spellchecker

  25. TREC 2004 QA Question Topics

  26. Related Research

  27. Related Research

  28. Simple Multiple-Record Documents Genealogy Domain – from Troy Walker’s thesis Highest-Fanout Separator VSM Separator

  29. Complex Multiple-Record Documents

  30. Scaling to the Web • Ontologies crawl and harvest web pages • Ontologies extract values from pages • Ontologies indexed • Queries extracted by relevant ontologies • Rely on Google-like technology

More Related