Sciweavers

2034 search results - page 12 / 407
» On the Comparability of Software Clustering Algorithms
Sort
View
ICCS
2009
Springer
14 years 2 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
BIBE
2003
IEEE
128views Bioinformatics» more  BIBE 2003»
14 years 1 months ago
A Repulsive Clustering Algorithm for Gene Expression Data
: - Facing the development of microarray technology, clustering is currently a leading technique to gene expression data analysis. In this paper, we propose a novel algorithm calle...
Chyun-Shin Cheng, Shiuan-Sz Wang
BMCBI
2010
150views more  BMCBI 2010»
13 years 7 months ago
DeltaProt: a software toolbox for comparative genomics
Background: Statistical bioinformatics is the study of biological data sets obtained by new micro-technologies by means of proper statistical methods. For a better understanding o...
Steinar Thorvaldsen, Tor Flå, Nils Willassen
EMNLP
2007
13 years 9 months ago
V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure
We present V-measure, an external entropybased cluster evaluation measure. Vmeasure provides an elegant solution to many problems that affect previously defined cluster evaluatio...
Andrew Rosenberg, Julia Hirschberg
CORIA
2008
13 years 9 months ago
Involving Validity Indices in Document Clustering
The goal of any clustering algorithm is to find the optimal clustering solution with the optimal number of clusters. In order to evaluate a clustering solution, a number of validit...
Ahmad El Sayed, Hakim Hacid, Djamel A. Zighed