Cicling event t he chinese english stat xfer system a look under the hood
This presentation is the property of its rightful owner.
Sponsored Links
1 / 7

CICLing “Event”: T he Chinese-English Stat-XFER System – A Look Under the Hood PowerPoint PPT Presentation


  • 80 Views
  • Uploaded on
  • Presentation posted in: General

CICLing “Event”: T he Chinese-English Stat-XFER System – A Look Under the Hood. Alon Lavie Language Technologies Institute Carnegie Mellon University. Chinese-English System. Developed over past year under DARPA/GALE funding (within IBM-led “Rosetta” team)

Download Presentation

CICLing “Event”: T he Chinese-English Stat-XFER System – A Look Under the Hood

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Cicling event t he chinese english stat xfer system a look under the hood

CICLing “Event”:The Chinese-English Stat-XFER System – A Look Under the Hood

AlonLavie

Language Technologies Institute

Carnegie Mellon University


Chinese english system

Chinese-English System

  • Developed over past year under DARPA/GALE funding (within IBM-led “Rosetta” team)

  • Participated in recent NIST MT-08 Evaluation

  • Large-scale broad-coverage system

  • Integrates large manual resources with automatically extracted resources

  • Current performance-level is still inferior to state-of-the-art phrase-based systems

CICLing-2008


Chinese english system1

Chinese-English System

  • Lexical Resources:

    • Manual Lexicons (base forms):

      • LDC, ADSO, Wiki

      • Total number of entries: 1.07 million

    • Automatically acquired from parallel data:

      • Approx 5 million sentences LDC/GALE data

      • Filtered down to phrases < 10 words in length

      • Full formed

      • Total number of entries: 2.67 million

CICLing-2008


Chinese english system2

Chinese-English System

  • Transfer Rules:

    • 61 manually developed transfer rules

    • High-accuracy rules extracted from manually word-aligned parallel data

CICLing-2008


Translation example

Translation Example

  • SrcSent 3澳洲是与北韩有邦交的少数国家之一。

  • Gloss: Australia is with north korea have diplomatic relations DE few country world

  • Reference: Australia is one of the few countries that have diplomatic relations with North Korea.

  • Translation:Australia is one of the few countries that has diplomatic relations with north korea .

  • Overall: -5.77439, Prob: -2.58631, Rules: -0.66874, TransSGT: -2.58646, TransTGS: -1.52858, Frag: -0.0413927, Length: -0.127525, Words: 11,15

  • ( 0 10 "Australia is one of the few countries that has diplomatic relations with north korea" -5.66505 "澳洲 是 与 北韩 有 邦交 的 少数国家之一 " "(S1,1124731 (S,1157857 (NP,2 (NB,1 (LDC_N,1267 'Australia') ) ) (VP,1046077 (MISC_V,1 'is') (NP,1077875 (LITERAL 'one') (LITERAL 'of') (NP,1045537 (NP,1017929 (NP,1 (LITERAL 'the') (NUMNB,2 (LDC_NUM,420 'few') (NB,1 (WIKI_N,62230 'countries') ) ) ) (LITERAL 'that') (VP,1021811 (LITERAL 'has') (FBIS_NP,11916 'diplomatic relations') ) ) (FBIS_PP,84791 'with north korea') ) ) ) ) ) ")

  • ( 10 11 "." -11.9549 "。" "(MISC_PUNC,20 '.')")

CICLing-2008


Example syntactic lexical phrases

Example: Syntactic Lexical Phrases

  • (LDC_N,1267 'Australia')

  • (WIKI_N,62230 'countries')

  • (FBIS_NP,11916 'diplomatic relations')

  • (FBIS_PP,84791 'with north korea')

CICLing-2008


Example xfer rules

Example: XFER Rules

;;SL::(2,4) 对 台 贸易

;;TL::(3,5) trade to taiwan

;;Score::22

{NP,1045537}

NP::NP [PP NP ] -> [NP PP ]

((*score* 0.916666666666667)

(X2::Y1)

(X1::Y2))

;;SL::(2,7) 直接提到 伟 哥 的 广告

;;TL::(1,7) commercials that directly mention the name viagra

;;Score::5

{NP,1017929}

NP::NP [VP "的" NP ] -> [NP "that" VP ]

((*score* 0.111111111111111)

(X3::Y1)

(X1::Y3))

;;SL::(4,14) 有 一 至 多 个 高 新 技术项目 或 产品

;;TL::(3,14) has one or more new , high level technology projects or products

;;Score::4

{VP,1021811}

VP::VP ["有" NP ] -> ["has" NP ]

((*score* 0.1)

(X2::Y2))

CICLing-2008


  • Login