1 / 17

An IPC-based vector space model for patent retrieval

An IPC-based vector space model for patent retrieval. Presenter: Jun-Yi Wu Authors: Yen-Liang Chen, Yu-Ting Chiu. 國立雲林科技大學 National Yunlin University of Science and Technology. 2011 IPM. Outline. Motivation Objective Methodology Experiments Conclusion Comments. Motivation.

dyan
Download Presentation

An IPC-based vector space model for patent retrieval

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. An IPC-based vector space model for patent retrieval Presenter: Jun-Yi Wu Authors: Yen-Liang Chen, Yu-Ting Chiu 國立雲林科技大學 National Yunlin University of Science and Technology 2011 IPM

  2. Outline • Motivation • Objective • Methodology • Experiments • Conclusion • Comments

  3. Motivation • The weakness in traditional VSM is that the indexing vocabulary changes whenever changes occur in the document set, or the indexing vocabulary selection algorithms, or parameters of the algorithms, or if wording evolution occurs.

  4. Objective • The major objective of this research is to design a method to solve the afore-mentioned problems for patent retrieval. • The proposed method utilizes the special characteristics of the patent documents, the International Patent Classification (IPC) codes, to generate the indexing vocabulary for presenting all the patent documents.

  5. Methodology

  6. Methodology Patent DB • Phase 1: Collect patent documents

  7. Methodology • Phase 2:Text preprocessing

  8. Methodology • Phase 3: Generate category * term vectors 8

  9. Methodology 9

  10. Methodology • Phase 4: Generate term * category vector • Phase 5: Generate document * category vectors 10

  11. Experiments 11

  12. Experiments 12

  13. Experiments 13

  14. Experiments 14

  15. Experiments 15

  16. Conclusion • A novel method, IPC-based VSM, was proposed for generating vectors to represent patent documents. • The indexing vocabulary generated in IPC-based VSM was better at finding similar documents than either of the traditional methods. 16

  17. Comments • Advantage • IPC_based SVM better than previous methods. • Application • Information Retrieval 17

More Related