1 / 21

Concept Hierarchy Induction

Concept Hierarchy Induction. b y Philipp Cimiano p resented by Joseph Park. Concept Hierarchies. Structure information into categories Provide a level of generalization Form the backbone of any ontology. Common Approaches. Machine readable dictionaries Lexico -syntactic patterns

creda
Download Presentation

Concept Hierarchy Induction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Concept Hierarchy Induction by Philipp Cimiano presented by Joseph Park

  2. Concept Hierarchies • Structure information into categories • Provide a level of generalization • Form the backbone of any ontology

  3. Common Approaches • Machine readable dictionaries • Lexico-syntactic patterns • Distributional similarity • Co-occurrence analysis

  4. Machine readable dictionaries • Exploit regularity of dictionaries • Find a hypernym for the defined word • Head of the first NP (genus or kernel term) • spring "the season between winter and summer and in which leaves and flowers appear“ • hornbeam "a type of tree with a hard wood, sometimes used in hedges“ • launch "a large usu. motor-driven boat used for carrying people on rivers, lakes, harbors, etc."

  5. Lexico-syntactic patterns • Hearst patterns • Hearstl: NP such as {NP,}* {(and | or)} NP • Hearst2: such NP as {NP,}* {(and | or)} NP • HearstS: NP {,NP}* {,} or other NP • Hearst4: NP {,NP}* {,} and other NP • Hearst5: NP including {NP,}* NP {(and | or)} NP • Hearst6: NP especially {NP,}* {(and|or)} NP • They should occur frequently and in many text genres • They should accurately indicate the relation of interest • They should be recognizable with little or no pre-encoded knowledge

  6. Example of using hearst pattern • 'Such injuries as bruises, wounds and broken bones...' • hyponym(bruise, injury) • hyponym(wound, injury) • hyponym(broken bone, injury)

  7. Distributional similarity • Distributional hypothesis • Words are similar to the extent they share the same context • ‘you shall know a word by the company it keeps’ –Firth

  8. Example

  9. Co-occurrence analysis • Collocation • Document-based subsumption • a certain term is more special than a term if also appears in all the documents in which appears

  10. Three More Approaches • Formal Concept Analysis (FCA) • Guided Clustering • Learning from heterogeneous sources of evidence

  11. Formal Concept Analysis • Set-theoretical approach • Parse corpus (extract dependencies) • Verb-pp-complement • Verb-object • Verb-subject • Extract surface dependencies (section 4.1.4)

  12. Pseudocode

  13. Example

  14. Results

  15. Guided Clustering • Uses hypernyms from WordNet and Hearst patterns

  16. Example

  17. Results

  18. More REsults

  19. Heterogeneous sources of evidence • Naïve threshold classifier • Uses Hearst patterns for corpus patterns • Uses Google API for web patterns • Uses Hearst patterns over downloaded pages • Uses WordNet senses • Uses ‘head’-heuristic (r-match) • Uses corpus based subsumption • Uses document based subsumption

  20. Results

  21. More results

More Related