Sciweavers

471 search results - page 42 / 95
» MapReduce: Simplified Data Processing on Large Clusters
Sort
View
ICDCS
2010
IEEE
14 years 27 days ago
A Spinning Join That Does Not Get Dizzy
— As network infrastructures with 10 Gb/s bandwidth and beyond have become pervasive and as cost advantages of large commodity-machine clusters continue to increase, research and...
Philip Werner Frey, Romulo Goncalves, Martin L. Ke...
KDD
2008
ACM
274views Data Mining» more  KDD 2008»
14 years 9 months ago
Data mining using high performance data clouds: experimental studies using sector and sphere
We describe the design and implementation of a high performance cloud that we have used to archive, analyze and mine large distributed data sets. By a cloud, we mean an infrastruc...
Robert L. Grossman, Yunhong Gu
GIS
2008
ACM
13 years 10 months ago
Sparse terrain pyramids
Bintrees based on longest edge bisection and hierarchies of diamonds are popular multiresolution techniques on regularly sampled terrain datasets. In this work, we consider sparse...
Kenneth Weiss, Leila De Floriani
DATAMINE
2006
89views more  DATAMINE 2006»
13 years 9 months ago
Scalable Clustering Algorithms with Balancing Constraints
Clustering methods for data-mining problems must be extremely scalable. In addition, several data mining applications demand that the clusters obtained be balanced, i.e., be of ap...
Arindam Banerjee, Joydeep Ghosh
AUSDM
2008
Springer
238views Data Mining» more  AUSDM 2008»
13 years 11 months ago
Graphics Hardware based Efficient and Scalable Fuzzy C-Means Clustering
The exceptional growth of graphics hardware in programmability and data processing speed in the past few years has fuelled extensive research in using it for general purpose compu...
S. A. Arul Shalom, Manoranjan Dash, Minh Tue