1 / 62

Organizing and Implementing on the Thesauri Mapping Project

Organizing and Implementing on the Thesauri Mapping Project. Dr. Chang Chun Associate Professor Agriculture Information Institute, Chinese Academy of Agricultural Sciences (AII/CAAS), Beijing China The Seventh Agricultural Ontology Service (AOS) Workshop

guy
Download Presentation

Organizing and Implementing on the Thesauri Mapping Project

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Organizing and Implementing on the Thesauri Mapping Project Dr. Chang Chun Associate Professor Agriculture Information Institute, Chinese Academy of Agricultural Sciences (AII/CAAS), Beijing China The Seventh Agricultural Ontology Service (AOS) Workshop AFITA 2006 November 9-11, Bangalore, India

  2. 7th AOS Outline Outline • Introduction • Organizing • AGROVOC and CAT • Conclusions • Objectives • Methods • Mapping rules • Discussions

  3. 7th AOS Introduction Brief Introduction on the Mapping Project AGROVOC FAO CAT CAAS ExactMatch InexactMatch BroadMatch NarrowMatch AND,OR,NOT No mapping mapping mapping Resource Mapping Rules Target

  4. 7th AOS Objective Objective 1: Enrich AOS Terminology Domain Knowledge • Key words have problems in search information; • Thesauri are still working in information management; • Research on conversion from thesaurus to ontology; • Mapping can add more new domain knowledge.

  5. 7th AOS Objective CAT Search end Chinese users Chinese data Search Mapping Information ( e, b,n… ) AGRIS data English Users Search AGROVOC Search end Objective 2: Develop Cross-Language Search System

  6. 7th AOS Organizing The Time and Tools of Mapping Project • The time of mapping project: From September 2005 to September 2006; • Mapping rules: a revision method of SKOS Mapping Vocabulary Specification; • Mapping direction: from CAT (resource) to AGROVOC (target) • Mapping tools: Protégé , Excel sheet, CAT and AGROVOC CD-ROM.

  7. 7th AOS Organizing Working Flow • From 2005-09-01 to 2005-11-05: make plans of mapping methods, prepare and test the mapping data; • From 2005-11-06 to 2006-05-30: the training and mapping with Excel sheet; • From 2006-06-01 to 2006-09-30: convert the Excel sheet information to OWL mapping data, Protégé can read this information.

  8. 7th AOS Organizing The specialists • we organized about 16 agricultural domain specialists in CAAS, many of them are PhD students, they were chosen based on the domain. • The main domain are biological science, agricultural environmental science, agricultural meteorology, fertilizer science, horticulture, forestry practice, plant protection, agronomy, agricultural products processing and storage and comprehensive utilization, veterinary medicine, biological control, Industrial technology and equipment, fishery science, and so on. • Some of them have knowledge of thesaurus.

  9. 7th AOS Organizing AGROVOC and CAT • AGROVOC: • 27736 English terms: 16769 descriptors, 10967 non descriptors • 25060 Chinese terms: 16628 descriptors, 8432 non descriptors • 1240 top terms • organized in 130 categories (AGRIS/CARIS) • includes biological taxonomy and geographical names • CAT: • 64638 Chinese terms: 51614 descriptors, 13024 non-descriptors • 51400descriptors has at least one translation • 2332 top terms • organized in 40 categories (e.g. crops, etc.) • includes biological taxonomy and geographical names

  10. 7th AOS Organizing To Finish the Mapping Work in Two Steps • First, Excel sheet: We split CAT into 36 documents based on the domain,we use Excel sheet, try to find all mapping information and input it in the Excel sheet, all these sheets will be kept as original data; • Second,convert information to OWL document: After we finish the all Excel sheets, we convert and input these mapping information into OWL documents, they can be read in Protégé after import CAT and AGROVOC.

  11. 7th AOS Organizing A B C D E F G H I J C-term code C- term Relation A-term code A- term combine relation C-revise suggestion C- comment A-revise suggestion A- comment Excel sheets

  12. 7th AOS Methods Mapping Standards and Methods • Exact Match,Inexact Match; • Broad Match,Narrow Match; • AND;OR;NOT;

  13. 7th AOS Methods Mapping relationships • Exact match • SKOS: exactMatch • OWL: equivalentTo • Broader/Narrower match • SKOS: broadMatch, narrowMatch • OWL: subClassOf • OR, AND, NOT operators • SKOS: OR, AND, NOT • OWL unionOf, intersectionOf, complementOf • Partial equivalences • SKOS: minorMatch, majorMatch

  14. 7th AOS Methods Exact Match Mapping AGROVOC Exact Match CAT Such as:‘17147-禾谷类作物’ Exact Match ‘25512-Cereal crops’

  15. 7th AOS Methods equivalentClass: One of main mapping relation (13105) <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_17147_禾谷类作物_Cerealcrop"> <owl:equivalentClass> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_25512_Cerealcrops_禾谷类作物"> <owl:equivalentClass rdf:resource="http://www.caas.net.cn/2005/cat#c_17147_禾谷类作物_Cerealcrop"/> </rdf:Description> </owl:equivalentClass> </rdf:Description>

  16. 7th AOS Methods Inexact Match Mapping CAT AGROVOC Inexact Such as:‘经济大国’Inexact match‘Developed countries’

  17. 7th AOS Methods Inexact Match : We seldom use this mapping relation 55581_玉米芯_Maizecobie 16171 <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_55581_玉米芯_Maizecob"> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >inexact mapping with 16171</rdfs:comment> </rdf:Description>

  18. 7th AOS Methods Broad Match Mapping CAT AGROVOC Broad Match Such as :“35234-普及教育”Broad Match ‘2488-Education’

  19. 7th AOS Methods subClassOf:BroadMatch (another main mapping relation 11408) <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_35234_普及教育_Universaleducation"> <rdfs:subClassOf rdf:resource="http://www.fao.org/aos/agrovoc/2005#c_2488_Education_教育"/> </rdf:Description>

  20. 7th AOS Methods Narrow Match Mapping AGROVOC CAT Narrow Match Such as:“8341_岛屿_Islands” Narrow Match “695_Atolls_环礁”

  21. 7th AOS Methods subClassOf: Narrow Match (173) <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_695_Atolls_环礁"> <rdfs:subClassOf rdf:resource="http://www.caas.net.cn/2005/cat#c_8341_岛屿_Islands"/> </rdf:Description>

  22. 7th AOS Methods AND;OR;NOT AND OR NOT “59683-自动标引”Exact Match‘11729-Indexing of information’AND‘15855 -Automation’ “7536_大麦_Barley”Exact Match‘823_Barley_大麦OR3662_Hordeum vulgare_大麦植物’ ‘12114-非传染性病害’ Exact match ‘5962-Plant diseases’ NOT ‘34024-Infectious diseases’

  23. 7th AOS Methods AND “59683_自动标引_Automaticindexing”Exact Match11729_Indexingofinformation_信息编目 and 15855_Automation_自动化

  24. 7th AOS Methods AND: intersectionOf <owl:Class> <owl:intersectionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_11729_Indexingofinformation_信息编目"/> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_15855_Automation_自动化"/> </owl:intersectionOf> </owl:Class> <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_59683_自动标引_Automaticindexing"> <owl:equivalentClass> <owl:Class> <owl:intersectionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_11729_Indexingofinformation_信息编目"/> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_15855_Automation_自动化"/> </owl:intersectionOf> </owl:Class> </owl:equivalentClass> </rdf:Description>

  25. 7th AOS Methods OR 7536_大麦_Barley”Exact Match ‘823_Barley_大麦OR3662_Hordeum vulgare_大麦植物

  26. 7th AOS Methods OR: unionOf <owl:Class> <owl:unionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_823_Barley_大麦"/> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_3662_Hordeumvulgare_大麦植物"/> </owl:unionOf> </owl:Class> <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_7536_大麦_Barley"> <owl:equivalentClass> <owl:Class> <owl:unionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_823_Barley_大麦"/> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_3662_Hordeumvulgare_大麦植物"/> </owl:unionOf> </owl:Class> </owl:equivalentClass> </rdf:Description>

  27. 7th AOS Methods NOT ‘12114_非传染性病害_Non-infectiousdiseases’ Exact match ‘5962_Plantdiseases_植物病害 ’ ANDNOT ‘34024_Infectiousdiseases_侵染性病害’

  28. 7th AOS Methods NOT: complementOf <owl:Class> <owl:intersectionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_5962_Plantdiseases_植物病害"/> <owl:Class> <owl:complementOf rdf:resource="http://www.fao.org/aos/agrovoc/2005#c_34024_Infectiousdiseases_侵染性病害"/> </owl:Class> </owl:intersectionOf> </owl:Class> <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_12114_非传染性病害_Non-infectiousdiseases"> <owl:equivalentClass> <owl:Class> <owl:intersectionOf rdf:parseType="Collection"> <rdf:Description rdf:about="http://www.fao.org/aos/agrovoc/2005#c_5962_Plantdiseases_植物病害"/> <owl:Class> <owl:complementOf rdf:resource="http://www.fao.org/aos/agrovoc/2005#c_34024_Infectiousdiseases_侵染性病害"/> </owl:Class> </owl:intersectionOf> </owl:Class> </owl:equivalentClass> </rdf:Description>

  29. 7th AOS Methods No mapping: 13867_干扰_Interference

  30. 7th AOS Methods NoMapping: comment <rdf:Description rdf:about="http://www.caas.net.cn/2005/cat#c_13867_干扰_Interference"> <rdfs:comment rdf:datatype="http://www.w3.org/2001/XMLSchema#string" >AGROVOC hasn't this concept</rdfs:comment> </rdf:Description>

More Related