Minimally Supervised Morphological Analysis by Multimodal Alignment. David Yarowsky and Richard Wicentowski. Introduction. The Algorithm capable of inducing inflectional morphological analyses of regular and highly irregular forms.
Consider this task as three steps:
VBDLemma Alignment by Frequency Similarity
Clustering inflectional variants of verbs (e.g. sipped, sipping, and sip).
isLemma Alignment by Context Similarity cont.
initially set to (0.5,0.6,1.0,0.98)
The goal is to generalize a mapping function via a generative probabilistic model.
P(inflection | root,suffix,POS)=P(stemchange | root,suffix,POS)
P(solidified | solidify, +ed, VBD)
= P(yi | solidify, +ed, VBD)
≈ 1P(yi | ify, +ed)
+ (1-1)( 2P(yi | fy, +ed)
+ (1-2)( 3P(yi | y, +ed)
+ (1-3)( 4P(yi | +ed)
+ (1-4) P(yi)
POS can be deleted