1 / 33

Hong Kong Chinese Authority (Name) Project – the HKCAN XML version

Hong Kong Chinese Authority (Name) Project – the HKCAN XML version. XML and Authority Control ALA Annual Conference June 26, 2005 Chicago, IL. Joanna Yi-hang PONG City University of Hong Kong. HKCAN. H ong K ong C hinese A uthority ( N ame) A collaborative project since 1999

chun
Download Presentation

Hong Kong Chinese Authority (Name) Project – the HKCAN XML version

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Hong Kong Chinese Authority (Name) Project – the HKCAN XML version XML and Authority Control ALA Annual Conference June 26, 2005 Chicago, IL Joanna Yi-hang PONG City University of Hong Kong

  2. HKCAN • Hong Kong Chinese Authority (Name) • A collaborative project since 1999 • 7 Hong Kong university libraries • Chinese University of Hong Kong • City University of Hong Kong • Hong Kong Baptist University • Hong Kong Institute of Education • Hong Kong Polytechnic University • Lingnan University • University of Hong Kong

  3. Aims • To build up a Chinese name authority file with CJK (Chinese, Japanese, Korean) scripts that meets the need of the bilingual community • To improve and streamline authority-control operations by setting up standardization for name headings and principles for authority record selection to achieve “Better”, “Faster” and “Cheaper” • To participate in regional and global cooperative activities on authority work

  4. HKCAN authority record model • 7xx model • Based on MARC discussion paper 2001-DP05 Multilingual Authority Records in MARC21 Authority Format • http://www.loc.gov/marc/marbi/2001/2001-dp05.html • This paper explores how to handle multilingual records in the MARC21 authority format • HKCAN based on the recommended Model C

  5. Record model – Model C (cont’d) • Established headings (1xx) are defined by a catalog context (i.e. cataloging rules and language of catalog) • Context designation are shown in field 008/10-11(rules) or field 040$b (lang. of cataloging) • 4xx and 5xx reference tracing are appropriate to the 1xx heading in that context • Alternative established headings are recorded in 7xx fields, along with the indication of the context in which they are appropriate

  6. Record model – Model C (cont’d) Context designation (cat. rules, descriptive convention, lang. of cat.) Authority record 1 1 1 1 1 1 m m Established heading (1xx) Reference tracings (4xx, 5xx) Alternative Established Heading (7xx) 1 1 1 Context indicator

  7. Record model - Model C (cont’d) Example: 008/10 (Cataloging rules): c (AACR2) 008/11 (Subject system/thesaurus rules): a (LCSH) Context designation 040 $b (Language of cataloging): eng (English) 100 0# $a John Paul $b II, $c Pope, $d 1920- Established heading (1xx) 400 0# $a Joannes Paulus $b II, $c Pope, $d 1920- 400 0# $a Juan Pablo $b II, $c Pope, $d 1920- 400 0# $a Jean Paul $b II, $c Pope, $d 1920- Reference tracing (4/5xx) 400 0# $a Johannes Paul $b II, $c Pope, $d 1920- 400 0# $a Joann Pavel $b II, $c Pope, $d 1920- <etc.> 510 2# $a Catholic Church. $b Pope (1978-: John Paul II) 700 04 $a Juan Pablo $b II, $c Papa, $d 1920- $7 aacr//spa 700 06 $a Jean Paul $b II $c pape, $d 1920- $7 aacr//fre 700 04 $a Jean Paul $b II $c (Pape), $d 1920- $7 ncafnor context 700 04 $a Johannes Paul $c Papst, $b II $7 rak indicator 700 04 $a <Chinese heading for John Paul> $b II $c <Chinese designation for Pope> $d 1920- $7 aacr//chi Alternative established heading (7xx)

  8. Record model - HKCAN record 008 941020nc acannaabn |a aaa ||| 010 $anr 94034993 035 $a(DLC#)nr 94034993a 040 $aDLC-R$beng$cDLC-R$dOCoLC$dHkCU$dHkCAN 066 $c$1 100 1 $aZhou, Ying,$d17th cent. 400 1 $wnne$aChou, Ying,$d17th cent. 400 1 $aZhou, Fangshu,$d17th cent. 400 1 $a周方叔,$d17th cent. 400 1 $aChou, Fang-shu,$d17th cent. 670 $aChih lin (卮林), 1992:$bt.p. (Chou Ying) 670 $aChung wen ta tz{176}u tien (中文大詞典):$bv. 6, p. 290 (Chou Ying; of Ming; native of P{176}u-t{176}ien; t. Fang-shu; author of Chih lin; lived around the mid of Emperor Ch{176}ung-chen reign) 670 $aHis 卮林 : 10卷, 附補遺1卷, [1963]:$bt.p. (周嬰) 670 $a中國人名大辭典, 1934:$bp. 545 (周嬰, 明莆田人, 字方叔, 崇禎中以貢生知 上猶縣, 所著卮林, 体近類書) 700 1 $a周嬰,$d17th cent

  9. Record model – HKCAN record (cont’d) 008   9802097n| acannaab| |a ana 040   $aHkCU $cHkCU 066   $c$1 110 1 $aHong Kong (China). $bCensus and Statistics Dept. 410 2 $aCensus and Statistics Dept. (Hong Kong, China) 410 2 $aZheng fu tong ji chu (Hong Kong, China) 410 2 $a政府統計處(香港, 中國) 510 1 $aHong Kong. $bCensus and Statistics Dept. $wa 510 1 $a香港. $b統計處 $wa 510 1 $a香港. $b政府統計處 $wa 667 $aNon-conventional pinyin pairing of 1xx/7xx|5HkCAN 670   $aIts Hong Kong social & economic trends = 香港社會及經濟趨勢, 1995- : $b1997 ed., cover (Census and Statistics Department Hong Kong Special Administrative Region People's Republic of China 中華人民共和國 香港特別行政區 政府統計處) 710 1 $a香港(中國). $b政府統計處

  10. Record model – in library system • INNOPAC library system • 7xx enhanced redirection function • Re-indexed as “Equivalent heading” • Displayed in WebPAC as “Equivalent heading”

  11. Record model – in library system (cont’d)

  12. Record model – in library system (cont’d)

  13. TTS (Big5) version vs. XML version • In 1999, developed by a Taiwan software vendor • Further developed and maintained by TTS, another Taiwan software company • Based on Big5 – restriction in character encoding • Lack of Z39.50 design • October 2003, a new HKCAN software developed by Chinese University of Hong Kong – XML version

  14. Document Type Definition • HKCAN DTD (Document Type Definition) – to specify the structure of each XML authority record • With this DTD, records can be output to the XML schema or other related schemas if needed • DTD has well-served all the necessary functionality in the present XML platform

  15. DTD (cont’d)

  16. HKCAN XML platform Records in Communication MARC format with EACC encoding Program to convert records from Communication MARC format to XML format Records in XML format with EACC encoding Program to convert the records from CCCII encoding to UTF-8 encoding Records in XML format with UTF-8 encoding Import the records to a relational database for index search HKCAN XML full text search server (Tamino) for full text search, records display & download HKCAN index search server (SQL anywhere 8.0) Retrieve the full record from HKCAN XML server MARC Web interface Index search Full text search

  17. Record conversion From MARC record with EACC encoding

  18. Record conversion (cont’d) To XML record with EACC encoding

  19. Record conversion (cont’d) To XML record with UTF-8 encoding

  20. Record search • Index search (browse search) • Full text search (phrase/keyword search) • Tamino server: Full text search, record display, record download • SQL Anywhere server: Index search

  21. Support of Unicode encoding & display • Unicode character set – able to accommodate both simplified & traditional Chinese characters • Enables faithful transcription of Chinese names 100 1 $aLu, Dunji 400 1 $aLu, Tun-chi $wnne 670   $aHuang, Z.X. Huang Zongxi shi wen xuan yi (黃宗羲詩文選譯), 1991 670   $aHis 自由报人 : 曹聚仁传, 2003: $bt.p. (卢敦基) biog. (男, 1962年生于浙江永康, 1978年3月入杭州大学中文系, 1987年获文学硕士学位, 现为浙江省社会科学院越文化研究所所长, 研究员, 浙江大学在读论文博士, 中囯作家协会会员, 著有《风起云扬--汉书随笔》,《金庸小说论》等) 700 1 $a盧敦基 Simplified Chinese characters Traditional Chinese characters

  22. Support of traditional vs. simplified Chinese character mapping • Traditional/simplified mapping table • Tamino server: built-in function to provide linked searching • Chinese index and phrase searching irrespective of input characters in simplified or traditional formats can all be supported

  23. Support of traditional vs. simplified Chinese character mapping (cont’d)

  24. Export options • Options to export records in: • text • MARC21 or • MARC21 XML • Swapping of 1xx and 7xx fields From: To: 100 1 $aChen, Peigui, $dju ren 1846 100 1 $a陳培桂, $d舉人1846 700 1 $a陳培桂, $d舉人1846 700 1 $aChen, Peigui, $dju ren 1846

  25. Support of Z39.50 protocol & full integration with INNOPAC library system

  26. One stop search • Inspired by VIAF & the LEAF Project • Search across multiple authority files concurrently • HKCAN, Chinese Authority Name Database (Taiwan), LC Authority File, National Library of China

  27. One stop search (cont’d)

  28. One stop search (cont’d) • Coordinating Committee on Chinese Name Authority was set up in 2003 • Members • JULAC-HKCAN (Hong Kong) • CALIS (China Academic Library & Information System) • NLC (National Library of China) • Agreements • Chinese name entry should follow international recommended format • Share each other’s authority databases • Develop authority databases with Unicode encoding

  29. Latest statistics • Over 123000 records in HKCAN database • Original records created by HKCAN Workgroup • Over 73800 records (60%) • Records from Library of Congress, enhanced by HKCAN Workgroup • Over 49200 records (40%)

  30. Latest statistics (cont’d) • Statistical ratio between different types of authority records • Personal Names : 85,000 records • Corporate Names : 15,000 records • Conference Names : 1,000 records • Uniform Titles: 22,000records Total: over 123,000 records

  31. Difficulties encountered • Character-mapping • from EACC to Unicode • Library of Congress EACC to UTF-8 mapping tables http://www.loc.gov/marc/specifications/specchareacc.html; Unihan database http://www.unicode.org/charts/unihan.html; or INNOPAC UTF-8 mapping table? • Task Force of UTF-8 mapping in Hong Kong • Diacritics • Searching and display • Diacritics mapping table in INNOPAC library system

  32. Future development & further cooperation • Database in practice - develop editing module, duplication checking function • Promote sharing of existing resources among libraries in Hong Kong, Taiwan & Mainland China • Enhance the effectiveness of Chinese authority works among libraries on an international scale

  33. Thank you! lbjoanpg@cityu.edu.hk

More Related