W ord s ense d isambiguation
This presentation is the property of its rightful owner.
Sponsored Links
1 / 21

W ord S ense D isambiguation PowerPoint PPT Presentation


  • 62 Views
  • Uploaded on
  • Presentation posted in: General

W ord S ense D isambiguation. By Mahmood Soltani Tehran University 2009/12/24. Agenda. Introduction Knowledge source Lexical Learned World WSD Approaches Knowledge-base Curpos-base Unsupervised Approaches Supervised Approaches Some approach in detail.

Download Presentation

W ord S ense D isambiguation

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


W ord s ense d isambiguation

Word Sense Disambiguation

By Mahmood Soltani

Tehran University

2009/12/24


Agenda

Agenda

  • Introduction

  • Knowledge source

    • Lexical

    • Learned World

  • WSD Approaches

    • Knowledge-base

    • Curpos-base

      • Unsupervised Approaches

      • Supervised Approaches

  • Some approach in detail

  • WSD 12/24/09 Tehran University


Word sense disambiguation

Introduction

  • Defining word sense :

    • The phenomenon of lexical ambiguity is traditionally subdivided into polysemy and homonymy

      • Polysemy: one work having several related meaning.

      • Homonymy: two words have the same lexical form but different etymoloies and unrelated meaning.

    • In dictionaries, each word is listed with a number of discrete senses and subsenses. possibly dierent from dictionary to dictionary

    • The first step involved in the task of WSD is the determination of dierent senses for all words in the text to be disambiguated, the sense inventory

  • WSD 12/24/09 Tehran University


Introduction cont

Introduction (cont)

  • What is word sense disambiguation?

    • WSD refers to a task that automatically assigns a sense, selected from a set of pre-defined word senses to an instance of a polysemous word in a particular context.

    • identify the correct sense of an ambiguous word in a sentence.

    • determine which of the senses of an ambiguous word is invoked in a particular use of the word.

  • WSD 12/24/09 Tehran University


Introduction cont1

Introduction (cont)

  • word sense disambiguation step:

    • Step 1: pre-defines sense

      • A list of sense such as those found in everyday dictionaries

      • A group of feature, catagories, or associated words

      • An entry in transfer dictionary which includes translations in another language

    • Step 2 : assignment of words to senses.

      • The context of the word to be disambiguation

      • External knowledge

  • WSD 12/24/09 Tehran University


Introduction cont2

Introduction (cont)

  • Where is the use of WSD?

    Sense disambiguation is an“intermediate task” ,which is not an end in itself, but rather is necessary at one level or another to accomplish most natural language processing tasks.

    • Machine translation

    • Information retrieval

    • Speech processing

    • Text processing (spelling detection)

  • WSD 12/24/09 Tehran University


Introduction cont3

Introduction (cont)

  • Conceptual Model of WSD

    • WSD is the matching of sense knowledge and word context.

    • Sense knowledge can either be lexical knowledge defined in dictionaries, or world knowledge learned from training corpora.

  • WSD 12/24/09 Tehran University


Word sense disambiguation

Knowledge sources

  • Lexical Knowledge

    • Lexical knowledge is usually released with a dictionary. It can be either symbolic, or empirical. It is the foundation of unsupervised WSD approaches.

  • Learned World Knowledge

    • World knowledge is too complex or trivial to be verbalized completely. So it is a smart strategy to automatically acquire world knowledge from the context of training corpora on demand by machine learning techniques

  • Trend

    • Use the interaction of multiple knowledge sources to approach WSD.

  • WSD 12/24/09 Tehran University


Lexical knowledge

Lexical Knowledge

  • Sense frequency

    • Usage frequency of each sense of a word.

  • Sense gloss

    • Sense definition and examples By counting common words between the gloss and the context of the target word, we can nai

  • Concept Tree

    • Represent the related concepts of the target in the form of semantic network as is done by WordNet.

  • WSD 12/24/09 Tehran University


Lexical knowledge cont

Lexical Knowledge (cont)

  • Selectional Restrictions

    • Syntactic and semantic restric

  • Subject Code

    • Refers to the category to which one sense of the target word belongs.

  • Part of Speech (POS)

    • POS is associated with a subset of the word senses in both WordNet and LDOCE. That is, given the POS of the target, we may fully or partially disambiguate its sense(Stevenson & Wilks, 2001).

  • WSD 12/24/09 Tehran University


Learnd world knowledge

Learnd world Knowledge

  • Domain-specific Knowledge

    • Like selectional restrictions, it is the semantic restriction placed on the use of each sense of the target word.

    • The restriction is more specific.

  • Parallel Corpora

    • Also called bilingual corpora, one serving as primary language, and the other as a secondary language.

    • Using some third-party software packages, we can align the major words (verb and noun) between two languages.

    • Because the translation process implies that aligned pair words share the same sense or concept, we can use this information to sense the major words in the primary language (Bhattacharya et al. 2004).

  • WSD 12/24/09 Tehran University


Early wsd work in mt

Early WSD work in MT

  • The first attempts at automated sense disambiguation were made in the context of machine translation

    • Weaver (1949) : outlines the basis of an approach to WSD.

    • Reifler’s (1955) “semantic coincidences” between a word and its context,the role of syntactic relations, was also recognized

    • Weaver’s (1949) Memorandum discusses the role of the domain in sense disambiguation

    • Oswald,1952, 1957; Oettinger, 1955; etc :micro-glossaries contain only the meaning of a given word relevant for texts in a particular domain

    • Several researchers attempted to devise an “interlingua” based on logical and mathematical principles

    • ….

  • without large-scale resources most of these ideas remained untested and to large extent, forgotten until several decades later.

  • WSD 12/24/09 Tehran University


Wsd approaches

WSD Approaches

  • there are three ways to approach the problem of WSD:

    • a knowledge-based approach, which uses an explicit lexicon (machine readable dictionary (MRD), thesaurus) or ontology (e.g. WordNet).

    • Corpus-based disambiguation, where the relevant information about word senses is gathered from training on a large corpus.

    • Hybrid approach combining aspects of both of the forementioned methodologies

  • WSD 12/24/09 Tehran University


Knowledge based approach

Knowledge-based approach

  • WSD systems building on the information contained in MRDs use the available material in various ways:

    • Lesk (1986) was the first to use dictionary denitions to WSD: counts overlapping content words in the sense denitions & in the denitions of context words occurring nearby

    • Yarowsky (1992), the sense of a word is dened as its category in Roget's International Thesaurus Yarowsky (1992), thesense of a word is dened as its category in Roget's International Thesaurus

    • WordNet includes various potential sources of information

    • Leacock et al. (1998) employ WordNet to counter data sparseness.

  • WSD 12/24/09 Tehran University


Corpus based approach

Corpus-based approach

  • A corpus-based approach extracts information on word senses from a large annotated data collection.

    • Distributional information about an ambiguous word refers to the frequency distribution of its senses

    • collocational or co-occurrence information

    • part-of-speech

  • WSD 12/24/09 Tehran University


Corpus based approach cont

Corpus-based approach(cont)

  • There are two possible approaches to corpus-based WSD systems:

    • Supervisedapproaches use annotated training data and basically amount to a classication task

    • Unsupervised algorithmsare applied to raw text material and annotated data is only needed for evaluation They correspond to a clustering task rather than a classication.

    • Bootstrapping , looks like supervised approaches, but it needs only a few seeds instead of a large number of training examples

  • WSD 12/24/09 Tehran University


Corpus based approach cont1

Corpus-based approach(cont)

  • Supervised

    • exemplar-based: the k-nearest neighbor technique has been employed most

    • Rule-base :use algorithms, e.g. decision lists, which search for discriminatory features in the training data

    • Probabilistic-base :use of dierent probabilistic classiers. Despite its relative simplicity, naive Bayes has been frequently applied

  • WSD 12/24/09 Tehran University


Corpus based approach cont2

Corpus-based approach(cont)

  • Unsupervised

    • cluster the contexts of an ambiguous word into a number of groups and discriminate between them without labeling them. A clear disadvantage is that, so far, the performance of unsupervised systems lies a lot lower than that of supervised systems

  • WSD 12/24/09 Tehran University


Naive bayes classifier

Naive bayes classifier

Training 

 Test

  • WSD 12/24/09 Tehran University


Word sense disambiguation

K-NN

Training 

 Test

  • WSD 12/24/09 Tehran University


Word sense disambiguation

Any Question?

  • WSD 12/24/09 Tehran University


  • Login