Sciweavers

285 search results - page 7 / 57
» Ontology-based Text Document Clustering
Sort
View
WEBI
2005
Springer
15 years 9 months ago
A Semi-Supervised Document Clustering Algorithm Based on EM
Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...
Leonardo Rigutini, Marco Maggini
ICDE
2007
IEEE
211views Database» more  ICDE 2007»
15 years 10 months ago
Document Representation and Dimension Reduction for Text Clustering
Increasingly large text datasets and the high dimensionality associated with natural language create a great challenge in text mining. In this research, a systematic study is cond...
M. Mahdi Shafiei, Singer Wang, Roger Zhang, Evange...
ESANN
2007
15 years 5 months ago
Kernel PCA based clustering for inducing features in text categorization
We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Zsolt Minier, Lehel Csató
CIKM
2008
Springer
15 years 6 months ago
Winnowing-based text clustering
We present an approach to document clustering based on winnowing fingerprints that achieved good values of effectiveness with considerable save in memory space and computation tim...
Javier Parapar, Alvaro Barreiro
CIKM
2004
Springer
15 years 9 months ago
Stemming and lemmatization in the clustering of finnish text documents
Under construction… Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval – clustering. General Terms Algorithms, Expe...
Tuomo Korenius, Jorma Laurikkala, Kalervo Jär...