1 / 19

Semantic Annotation in SALSA

SALSA The Sa arbrücken L exical S emantics A nnotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado, Manfred Pinkal. Semantic Annotation in SALSA. Manual semantic annotation of

albin
Download Presentation

Semantic Annotation in SALSA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. SALSAThe Saarbrücken Lexical Semantics Annotation & Acquisition Project Aljoscha Burchardt, Katrin Erk, Anette Frank, Andrea Kowalski, Sebastian Pado,Manfred Pinkal

  2. Semantic Annotation in SALSA • Manual semantic annotation of • 0.8 million words of syntactically annotated German newspaper text (TIGER Corpus, Releases 1, 2) • with frames and frame elements (Berkeley FrameNet Database), staying as close as possible to the Berkeley FrameNet database

  3. SALSA: What's special? • SALSA is about German • Cross-lingual divergencies?

  4. Cross-lingual Divergencies • Convincing cross-lingual portability results (ED) in general • Adaptation necessary because of • Inappropriate granularity of distinctions between FEs • Missing FEs • (Rare cases of) inappropriate granularity of frames

  5. SALSA: What's special? • SALSA is about German • Cross-lingual divergencies? • Corpus-driven lexicon development through exhaustive full-text annotation • Difficult cases • Incompleteness of Berkeley FrameNet

  6. Difficult cases • Metaphors • Support Verb Constructions • Idioms

  7. Difficult phenomena: Some Figures

  8. SALSA corpus: Release I • Total size of 20.000 annotated instances • Consistent annotation through different verification steps • All occurrences/readings of > 400 German verbal predicates (different frequency bands) • Scheduled for Summer 2006

  9. The SALTO Annotation Tool

  10. SALSA II: Automatic Annotation and Acquisition • Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis  Talk by Katrin and Sebastian

  11. SALSA II: Automation • Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis  Talk by Katrin and Sebastian • The Detour System (through WordNet to FrameNet)  Talk by Anette and Al

  12. Fred & Rosy Fred, Detour & Rosy

  13. SALSAII: Automation • Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis  Talk by Katrin and Sebastian • The Detour System (through WordNet to FrameNet)  Talk by Anette and Al • Cross-lingual projection of frame-semantic information  Katrin and Sebastian

  14. Cross-lingual Projection

  15. SALSAII: Automation & Application • Fred, Rosy, and Shalmaneser: A tool-chain for shallow semantic analysis  Talk by Katrin and Sebastian • The Detour System (through WordNet to FrameNet)  Talk by Anette and Al • Cross-lingual projection of frame-semantic information  Katrin and Sebastian • Textual Entailment (RTE)  Anette and Al

  16. h: Aki Kaurismäki directed a film. t: In 1983, Aki Kaurismäki directed his first full-time feature.

  17. Grammatically related h: Aki Kaurismäki directed a film. WordNet related t: In 1983, Aki Kaurismäki directed his first full-time feature.

  18. SALSA: Future Work • Bottstrapping frame information by data expansion techniques • Linking lexical semantic resourcs with upper-model ontologies • Analysis of non-compositional phenomena • A worked-out semantic lexicon • Application to textual entailment

More Related