Sciweavers

652 search results - page 10 / 131
» Accelerated EM-based clustering of large data sets
Sort
View
OSDI
2004
ACM
14 years 8 months ago
MapReduce: Simplified Data Processing on Large Clusters
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to ge...
Jeffrey Dean, Sanjay Ghemawat
KDD
1998
ACM
123views Data Mining» more  KDD 1998»
14 years 3 days ago
Scaling Clustering Algorithms to Large Databases
Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clusteri...
Paul S. Bradley, Usama M. Fayyad, Cory Reina
IPPS
2003
IEEE
14 years 1 months ago
Parallel ROLAP Data Cube Construction On Shared-Nothing Multiprocessors
The pre-computation of data cubes is critical to improving the response time of On-Line Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tas...
Ying Chen, Frank K. H. A. Dehne, Todd Eavis, Andre...
ICCV
2009
IEEE
15 years 26 days ago
Mode-Detection via Median-Shift
Median-shift is a mode seeking algorithm that relies on computing the median of local neighborhoods, instead of the mean. We further combine median-shift with Locality Sensitive...
Lior Shapira, Shai Avidan, Ariel Shamir
CLUSTER
2006
IEEE
14 years 1 months ago
An Iteration Aware Multidimensional Data Distribution Prototype for Computing Clusters
Disk and network latency must be taken into account when applying parallel computing to large multidimensional datasets because they can hinder performance by reducing the rate at...
Baoqiang Yan, Philip J. Rhodes