1 / 1

Unihan Database ( unicode/charts/unihan.html )

Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified Chinese Chenhui Chu, Toshiaki Nakazawa , Sadao Kurohashi (Graduate School of Informatics, Kyoto University). Kanji & Hanzi. Freely Available Resources.

aziza
Download Presentation

Unihan Database ( unicode/charts/unihan.html )

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chinese Characters Mapping Table of Japanese, Traditional Chinese and Simplified ChineseChenhui Chu, Toshiaki Nakazawa, SadaoKurohashi (Graduate School of Informatics, Kyoto University) Kanji & Hanzi Freely Available Resources • A mapping table of Chinese characters in Japanese (Kanji) and Chinese (Hanzi) is useful for many Japanese-Chinese bilingual tasks • Unihan Database (http://unicode.org/charts/unihan.html) • Complicated relations between Kanji and Hanzi • Character sets of Kanji and Hanzi • Hanzi Converter Standard Conversion Table (http://www.mandarintools.com/zhcode.html) • 6,740 TC and SC pairs • Kanconvit Mapping Table(http://kanconvit.ta2o.net/) • 3,506one to one mappings of Kanji, TC and SC 1 2 Method & Resource Completeness Evaluation p • The method • Wiktionary (http://www.wiktionary.org/) 雪 愛 国 発 詑 鮃 込 ・・・ 雪 愛 國 發 詑 ・・・ 雪 爱 国 发 鲆 ・・・ C1: 雪雪雪 C2: 愛愛爱 C3: 国國国 C4: 発發发 C5: 詑詑N/A C6: 鮃N/A鲆 C7: 込N/A N/A ・・・ Classification Variants JIS Kanji BIG5 GB2312 • Comparison results Unihan Hanzi Converter Kanconvit • Resource statistics • Not found in Wiktionary • Multiple Hanzi forms • Not found in proposed method 3 4

More Related