1 / 9

A Deterministic Co-reference System with Rich Syntactic Features and Semantic Knowledge

A Deterministic Co-reference System with Rich Syntactic Features and Semantic Knowledge. Heeyoung Lee & Sudarshan Rangarajan Collaborators : Karthik Raghunathan under the guidance of Mihai Surdeanu, Nate Chambers & Dan Jurafsky. The Problem. boolean isCoreferent(Mention A, Mention B).

erobitaille
Download Presentation

A Deterministic Co-reference System with Rich Syntactic Features and Semantic Knowledge

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Deterministic Co-reference System with Rich Syntactic Features and Semantic Knowledge Heeyoung Lee & Sudarshan Rangarajan Collaborators : Karthik Raghunathan under the guidance of Mihai Surdeanu, Nate Chambers & Dan Jurafsky

  2. The Problem • boolean isCoreferent(Mention A, Mention B) ‘More important to the future of 8mm is Sony's success in the $2.3 billion camcorder market. The Japanese company already has 12% of the total camcorder market, ranking it third behind the RCA and Panasonic brands.’  • isCoreferent(‘Sony’, ‘The Japanese Company’) : TRUE • isCoreferent(‘The Japanese Company’, ‘it’) : TRUE • isCoreferent(‘Sony’, ‘it’) : TRUE • isCoreferent(‘it’, ‘camcorder market’) : FALSE • isCoreferent(‘it’, ‘RCA’) : FALSE

  3. Baseline System • Simple Co-reference Resolution with Rich Syntactic and Semantic Features, by Aria Haghighi & Dan Klein (EMNLP 2009) • Deterministic, single-pass, constraint-based system • Included Syntactic salience & Agreement constraint checking. • Lack of Semantic Knowledge in decision making. ‘President Bush and his colleague had different opinions. However, the person who has the right to make the final decision is the president.’

  4. Preliminary Error Analysis Corpora for Error Analysis : MUC-6 (Train Set); and for Experiments : MUC-6 & ACE

  5. Simple Knowledge Extraction System (SKES) Seed & Mention Pairs Yield Semantic Patterns Yield Metrics used to refine pattern yield

  6. Construct passes • Sort decision features – Highest precision first.

  7. Multi-pass Coreference System • Deterministic, multi-pass, constraint based system. • Decisions based on more confident mention pairs first. • Further decisions based on previously accumulated knowledge about mentions. ‘President Bush and his colleague had different opinions. However, the person who has the right to make the final decision is the president.’

  8. Result • Multi-pass system is more sensitive to error propagation -> need high precision passes. • Higher precision, but lower recall and F1. • Needs more passes to increase recall -> Future work ( & Co-reference decision Re-ranker)

  9. Questions? Thank You!

More Related