Sciweavers

652 search results - page 12 / 131
» Accelerated EM-based clustering of large data sets
Sort
View
VLDB
1998
ACM
312views Database» more  VLDB 1998»
14 years 3 days ago
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases
Many applications require the management of spatial data. Clustering large spatial databases is an important problem which tries to find the densely populated regions in the featu...
Gholamhosein Sheikholeslami, Surojit Chatterjee, A...
BMCBI
2005
80views more  BMCBI 2005»
13 years 7 months ago
Sample phenotype clusters in high-density oligonucleotide microarray data sets are revealed using Isomap, a nonlinear algorithm
Background: Life processes are determined by the organism's genetic profile and multiple environmental variables. However the interaction between these factors is inherently ...
Kevin Dawson, Raymond L. Rodriguez, Wasyl Malyj
NIPS
2008
13 years 9 months ago
Measures of Clustering Quality: A Working Set of Axioms for Clustering
Aiming towards the development of a general clustering theory, we discuss abstract axiomatization for clustering. In this respect, we follow up on the work of Kleinberg, ([1]) tha...
Shai Ben-David, Margareta Ackerman
PRL
2002
67views more  PRL 2002»
13 years 7 months ago
A pseudo-nearest-neighbor approach for missing data recovery on Gaussian random data sets
Missing data handling is an important preparation step for most data discrimination or mining tasks. Inappropriate treatment of missing data may cause large errors or false result...
Xiaolu Huang, Qiuming Zhu
SIGMOD
1998
ACM
99views Database» more  SIGMOD 1998»
14 years 4 days ago
CURE: An Efficient Clustering Algorithm for Large Databases
Clustering, in data mining, is useful for discovering groups and identifying interesting distributions in the underlying data. Traditional clustering algorithms either favor clust...
Sudipto Guha, Rajeev Rastogi, Kyuseok Shim