Sciweavers

197 search results - page 13 / 40
» Word Clustering and Disambiguation Based on Co-occurrence Da...
Sort
View
ACL
2001
13 years 9 months ago
Multi-Class Composite N-gram Language Model for Spoken Language Processing Using Multiple Word Clusters
In this paper, a new language model, the Multi-Class Composite N-gram, is proposed to avoid a data sparseness problem for spoken language in that it is difficult to collect traini...
Hirofumi Yamamoto, Shuntaro Isogai, Yoshinori Sagi...
SIGIR
2005
ACM
14 years 1 months ago
Noun sense induction using web search results
This paper presents an algorithm for unsupervised noun sense induction, based on clustering of Web search results. The algorithm does not utilize labeled training instances or any...
Goldee Udani, Shachi Dave, Anthony Davis, Tim Sibl...
BMCBI
2005
251views more  BMCBI 2005»
13 years 7 months ago
Contextual weighting for Support Vector Machines in literature mining: an application to gene versus protein name disambiguation
Background: The ability to distinguish between genes and proteins is essential for understanding biological text. Support Vector Machines (SVMs) have been proven to be very effici...
Tapio Pahikkala, Filip Ginter, Jorma Boberg, Jouni...
EMNLP
2010
13 years 5 months ago
A New Approach to Lexical Disambiguation of Arabic Text
We describe a model for the lexical analysis of Arabic text, using the lists of alternatives supplied by a broad-coverage morphological analyzer, SAMA, which include stable lemma ...
Rushin Shah, Paramveer S. Dhillon, Mark Liberman, ...
ACL
1993
13 years 8 months ago
Contextual Word Similarity and Estimation from Sparse Data
In recent years there is much interest in word cooccurrence relations, such as n-grams, verb-object combinations, or cooccurrence within a limited context. This paper discusses ho...
Ido Dagan, Shaul Marcus, Shaul Markovitch