Sciweavers

304 search results - page 33 / 61
» A Semi-Supervised Document Clustering Technique for Informat...
Sort
View
KDD
2004
ACM
150views Data Mining» more  KDD 2004»
14 years 9 months ago
A framework for ontology-driven subspace clustering
Traditional clustering is a descriptive task that seeks to identify homogeneous groups of objects based on the values of their attributes. While domain knowledge is always the bes...
Jinze Liu, Wei Wang 0010, Jiong Yang
ICTAI
2007
IEEE
14 years 3 months ago
Document Length Normalization by Statistical Regression
The document-length normalization problem has been widely studied in the field of Information Retrieval. The Cosine Normalization [2], the Maximum tf Normalization [1] and the By...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
ICONIP
2009
13 years 6 months ago
Text Mining with an Augmented Version of the Bisecting K-Means Algorithm
There is an ever increasing number of electronic documents available today and the task of organizing and categorizing this ever growing corpus of electronic documents has become t...
Yutaro Hatagami, Toshihiko Matsuka
CIKM
2007
Springer
13 years 10 months ago
Proximity-based document representation for named entity retrieval
One aspect in which retrieving named entities is different from retrieving documents is that the items to be retrieved – persons, locations, organizations – are only indirect...
Desislava Petkova, W. Bruce Croft
KDD
1999
ACM
99views Data Mining» more  KDD 1999»
14 years 1 months ago
On the Merits of Building Categorization Systems by Supervised Clustering
This paper investigates the use of supervised clustering in order to create sets of categories for classi cation of documents. We use information from a pre-existing taxonomy in o...
Charu C. Aggarwal, Stephen C. Gates, Philip S. Yu