1 / 13

Chinese analogy search considering multi-relations

CSC2012. Chinese analogy search considering multi-relations. Zhao Lu Department of Computer Science and Technology, East China Normal University, Shanghai, China. CSC2012. Our problem.

siusan
Download Presentation

Chinese analogy search considering multi-relations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CSC2012 Chinese analogy search considering multi-relations Zhao Lu Department of Computer Science and Technology, East China Normal University, Shanghai, China

  2. CSC2012 Our problem • Latent Relation Search is a recently proposed query-by-example technique that aims at solving queries in which the user specifies a triplet of terms (A,B,C) and seeks from a search engine a fourth term D whose relationship with C is analogous to that of A and B. • For example, HuoQigangand GuoJingjing is a couple. If the name Yao Ming is given, we can find out Yao Ming’s wife Ye Li. • The relation between Yao Ming and Ye Li is highly similar to HuoQigangand GuoJingjing.

  3. CSC2012 Contribution • We propose a hybrid method to represent relations between word-pairs using bag of words and lexical patterns. • We count frequencyand weight of word. • A k-means clustering method is used to extract all the relation words representing different relationships between word pair (A, B).

  4. CSC2012 Three Kinds of Relation Mapping OTO MR OTM

  5. CSC2012 Extracting relation-words Preprocessing Modular 1. Extract the complete sentences containing A and B. 2.Word segmentation and POS tagging.

  6. CSC2012 Extract relation-words by lexical pattern • We count the frequency and weight of each word. • The definition of weight is the times of the word which occurs in a sentence that match a lexical pattern. Table 1:Lexical patterns

  7. CSC2012 Clustering using a k-means method • In order to distinguish the different words on behalf of different relations, we use the k-means clustering to clarify the words into different clusters. • After clustering, we select the word with the highest frequency and weight value as the relation-representing word. • Extracting Target Words in the same way

  8. CSC2012 Experiment evaluations

  9. CSC2012 Experiment Results Fig.3 The relation-word ranks for test cases Fig.2 Percentage of questions which target words at variousrank

  10. CSC2012 MRR and Percentage of Target Words at Different Rank

  11. CSC2012 Conclusion • A Chinese Analogy search method is proposed. • Different relationships between the entities are distinguished by k-means clustering. • Our approach achieves a MRR of 0.773 which is higher than existing methods.

  12. CSC2012 Future work • In the future, we will focus on the way to distinguish the three kinds of relation mapping automatically . • Some method like SVM will be applied to raise the accuracy of extracting relation-words.

More Related