Word Sense Disambiguation. Foundations of Statistical NLP Chapter 7. 2002. 1. 18. Kyung-Hee Sung. Contents. Methodological Preliminaries Supervised Disambiguation Bayesian classification / An information-theoretic approach
Foundations of Statistical NLP
2002. 1. 18.
Ex) Butter may be used as noun, or as a verb.
← using Bayes’s rule
← P(c) is constant for all senses
← computed by MLE
← The number of occurrences of sense sk
with collocation fm
1.Initialize the parameters of the model μ randomly.
Compute the log likelihood of the corpus C
2. While l(C|μ) is improving repeat:
(a) E-step. Estimatehik
← Naive Bayes assumption
(b) M-step. Re-estimate the parameters P(vj|sk) and P(sk) by way of MLE
← for ten experiments
with different initial