Sciweavers

CICLING
2009
Springer

Semi-supervised Clustering for Word Instances and Its Effect on Word Sense Disambiguation

15 years 10 hour ago
Semi-supervised Clustering for Word Instances and Its Effect on Word Sense Disambiguation
We propose a supervised word sense disambiguation (WSD) system that uses features obtained from clustering results of word instances. Our approach is novel in that we employ semi-supervised clustering that controls the fluctuation of the centroid of a cluster, and we select seed instances by considering the frequency distribution of word senses and exclude outliers when we introduce "must-link" constraints between seed instances. In addition, we improve the supervised WSD accuracy by using features computed from word instances in clusters generated by the semi-supervised clustering. Experimental results show that these features are effective in improving WSD accuracy.
Kazunari Sugiyama, Manabu Okumura
Added 24 Nov 2009
Updated 24 Nov 2009
Type Conference
Year 2009
Where CICLING
Authors Kazunari Sugiyama, Manabu Okumura
Comments (0)