Sciweavers

210 search results - page 7 / 42
» Distributional Clustering of English Words
Sort
View
BMCBI
2006
141views more  BMCBI 2006»
13 years 7 months ago
Asymptotic behaviour and optimal word size for exact and approximate word matches between random sequences
Background: The number of k-words shared between two sequences is a simple and effcient alignment-free sequence comparison method. This statistic, D2, has been used for the cluste...
Sylvain Forêt, Miriam R. Kantorovitz, Conrad...
COLING
1992
13 years 8 months ago
A Freely Available Wide Coverage Morphological Analyzer for English
This paper presents a morphological lexicon for English that handle more than 317000 inflected forms derived from over 90000 stems. The lexicon is available in two formats. The fi...
Daniel Karp, Yves Schabes, Martin Zaidel, Dania Eg...
ACL
1998
13 years 8 months ago
Automatic Retrieval and Clustering of Similar Words
Bootstrapping semantics from text is one of the greatest challenges in natural language learning. We first define a word similarity measure based on the distributional pattern of ...
Dekang Lin
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
14 years 7 months ago
Enhanced word clustering for hierarchical text classification
In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...
Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...
LREC
2010
162views Education» more  LREC 2010»
13 years 8 months ago
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation
Given the recent trend to evaluate the performance of word sense disambiguation systems in a more application-oriented set-up, we report on the construction of a multilingual benc...
Els Lefever, Véronique Hoste