1 / 12

A Concept-based Model for Enhancing Text Categorization

A Concept-based Model for Enhancing Text Categorization. Presenter : Jiang-Shan Wang Authors : Shady Shehata, Fakhri Karray, Mohamed Kamel. 國立雲林科技大學 National Yunlin University of Science and Technology. SIGKDD 2008. Outline. Motivation Objective Methodology Experiments Conclusion

zeroun
Download Presentation

A Concept-based Model for Enhancing Text Categorization

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Concept-based Model for Enhancing Text Categorization Presenter : Jiang-Shan Wang Authors : Shady Shehata, Fakhri Karray, Mohamed Kamel 國立雲林科技大學 National Yunlin University of Science and Technology SIGKDD 2008

  2. Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments

  3. Motivation • Most of text categorization techniques are based on word and/or phrase analysis of the text. • However, two terms can have the same frequency in their documents, but one term contributes more to the meaning of its sentences than the other term. • Example : • electronic techniques • defense effort

  4. Objective To propose a new concept-based model that analyzes terms on the sentence and document levels.

  5. Methods - Overview

  6. Methods – Natural Language Processing We have noted how some electronic techniques, developed for the defense effort, have eventually been used in commerce and industry. [ARG0 We] [TARGET noted] [ARG1 how some electronic techniques developed for the defense effort have eventually been used in commerce and industry]. We have noted how [ARG1 some electronic techniques] [TARGET developed] [ARGM-PNC for the defense effort] have eventually been used in commerce and industry. We have noted how [ARG1 some electronic techniques developed for the defense effort] have [ARGM-TMP eventually] been [TARGET used] [ARGM-LOC in commerce and industry].

  7. Methods – Statistical Analyzer

  8. Methods – Conceptual Ontological Graph

  9. Methods – Concept Extractor

  10. Experiments

  11. Conclusion This work bridges the gap between natural language processing and text categorization disciplines. The quality of the categorization results by the proposed model surpasses that of traditional approaches significantly.

  12. Comments • Advantage • Considering about sentence semantics for text categorization. • Drawback • . • Application • Text categorization. • Document categorization • Web document categorization.

More Related