1 / 1

1. Introduction

kass
Download Presentation

1. Introduction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Using Corpus-based Linguistic Approaches in Sense Prediction StudyJia-Fei Honga, Sue-Jin Kerb, Chu-Ren Huangc and Kathleen Ahrensda Institute of Linguistics, Academia Sinica, Taiwan,b Department of Computer Science and Information Management, Soochow University, Taiwan, c Faculty of Humanities, The Hong Kong Polytechnic University, Hong Kong,d Language Centre, Hong Kong Baptist University, Hong Kong 1. Introduction Our goal in this study of sense prediction is to generate solutions for lexical ambiguity in general. Our study showed the feasibility of sense-prediction without lexically assigned senses. 2. Research Question (1) How do we predict the word senses of a lexically ambiguous word to present different interpretations in different contexts or domains? (2) How do we use corpora as the databases to support a word sense prediction study? 3. Methodology In this sense prediction study, we explore all possible senses of the four target words---chi1 “eat”, wan2 “play”, huan4 “change” and shao1 “burn”. (1) To collect related collocations from Taiwan’s Central News Agency Gigaword Corpus (2) Character similarity clustering analysis and concept similarity clustering analysis by HowNet (3) To use Chinese Wordnet (CWN) and Xiandai Hanyu Cidian (Xian Han) to evaluate 4. Analysis 4.1 Character similarity clustering analysis Following Fujii and Croft (1993), there are two sub-steps: (1) Character similarity comparison between words (2) Group similarity comparison between words Then, to average the similarity of two different clusters 4.2 Concept similarity clustering analysis To assign all words to lexical concepts via HowNet. Two main strategies are as: (1) similarity between sememes and (2) similarity between concepts through HowNet 4.3 Evaluation After discussing two similarity clustering approaches for the four target words, we will evaluate them via CWN and Xian Han. 5. Conclusion We are able to demonstrate the viability of these two approaches as a superior model in this sense prediction study.

More Related