Sciweavers

2131 search results - page 72 / 427
» Co-Scheduling of Computation and Data on Computer Clusters
Sort
View
109
Voted
SAC
2005
ACM
15 years 9 months ago
Rearranging data objects for efficient and stable clustering
When a partitional structure is derived from a data set using a data mining algorithm, it is not unusual to have a different set of outcomes when it runs with a different order of...
Gyesung Lee, Xindong Wu, Jinho Chon
119
Voted
SDM
2003
SIAM
125views Data Mining» more  SDM 2003»
15 years 5 months ago
Scalable, Balanced Model-based Clustering
This paper presents a general framework for adapting any generative (model-based) clustering algorithm to provide balanced solutions, i.e., clusters of comparable sizes. Partition...
Shi Zhong, Joydeep Ghosh
132
Voted
EMMCVPR
2001
Springer
15 years 8 months ago
Path Based Pairwise Data Clustering with Application to Texture Segmentation
Most cost function based clustering or partitioning methods measure the compactness of groups of data. In contrast to this picture of a point source in feature space, some data sou...
Bernd Fischer, Thomas Zöller, Joachim M. Buhm...
167
Voted
OSDI
2004
ACM
16 years 4 months ago
MapReduce: Simplified Data Processing on Large Clusters
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to ge...
Jeffrey Dean, Sanjay Ghemawat
143
Voted
ICPR
2010
IEEE
15 years 7 months ago
CDP Mixture Models for Data Clustering
—In Dirichlet process (DP) mixture models, the number of components is implicitly determined by the sampling parameters of Dirichlet process. However, this kind of models usually...
Yangfeng Ji, Tong Lin, Hongbin Zha