Sciweavers

17390 search results - page 33 / 3478
» Distributed Data Clustering
Sort
View
CLUSTER
2004
IEEE
13 years 11 months ago
An evaluation of the close-to-files processor and data co-allocation policy in multiclusters
In multicluster systems, and more generally, in grids, jobs may require co-allocation, i.e., the simultaneous allocation of resources such as processors and input files in multipl...
Hashim H. Mohamed, Dick H. J. Epema
KDD
2003
ACM
210views Data Mining» more  KDD 2003»
14 years 7 months ago
Privacy-preserving k-means clustering over vertically partitioned data
Privacy and security concerns can prevent sharing of data, derailing data mining projects. Distributed knowledge discovery, if done correctly, can alleviate this problem. The key ...
Jaideep Vaidya, Chris Clifton
BMCBI
2007
128views more  BMCBI 2007»
13 years 7 months ago
Model order selection for bio-molecular data clustering
Background: Cluster analysis has been widely applied for investigating structure in bio-molecular data. A drawback of most clustering algorithms is that they cannot automatically ...
Alberto Bertoni, Giorgio Valentini
IDEAS
2006
IEEE
218views Database» more  IDEAS 2006»
14 years 1 months ago
PBIRCH: A Scalable Parallel Clustering algorithm for Incremental Data
We present a parallel version of BIRCH with the objective of enhancing the scalability without compromising on the quality of clustering. The incoming data is distributed in a cyc...
Ashwani Garg, Ashish Mangla, Neelima Gupta, Vasudh...
CVPR
2008
IEEE
14 years 9 months ago
Robust estimation of gaussian mixtures from noisy input data
We propose a variational bayes approach to the problem of robust estimation of gaussian mixtures from noisy input data. The proposed algorithm explicitly takes into account the un...
Shaobo Hou, Aphrodite Galata