Search Sciweavers | Sciweavers

652 search results - page 10 / 131

» Accelerated EM-based clustering of large data sets

260

click to vote

OSDI
2004
ACM

201views Operating System» more OSDI 2004»

MapReduce: Simplified Data Processing on Large Clusters

16 years 7 months ago

Download labs.google.com

MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to ge...

Jeffrey Dean, Sanjay Ghemawat

claim paper

Read More »

204

click to vote

KDD
1998
ACM

123views Data Mining» more KDD 1998»

Scaling Clustering Algorithms to Large Databases

15 years 11 months ago

Download www.aaai.org

Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clusteri...

Paul S. Bradley, Usama M. Fayyad, Cory Reina

claim paper

Read More »

201

click to vote

IPPS
2003
IEEE

118views Distributed And Parallel Com...» more IPPS 2003»

Parallel ROLAP Data Cube Construction On Shared-Nothing Multiprocessors

16 years 22 days ago

Download people.scs.carleton.ca

The pre-computation of data cubes is critical to improving the response time of On-Line Analytical Processing (OLAP) systems and can be instrumental in accelerating data mining tas...

Ying Chen, Frank K. H. A. Dehne, Todd Eavis, Andre...

claim paper

Read More »

201

click to vote

ICCV
2009
IEEE

849views Computer Vision» more ICCV 2009»

Mode-Detection via Median-Shift

17 years 13 days ago

Download www.cs.tau.ac.il

Median-shift is a mode seeking algorithm that relies on computing the median of local neighborhoods, instead of the mean. We further combine median-shift with Locality Sensitive...

Lior Shapira, Shai Avidan, Ariel Shamir

claim paper

Read More »

187

click to vote

CLUSTER
2006
IEEE

113views Distributed And Parallel Com...» more CLUSTER 2006»

An Iteration Aware Multidimensional Data Distribution Prototype for Computing Clusters

16 years 1 months ago

Download www.cs.unh.edu

Disk and network latency must be taken into account when applying parallel computing to large multidimensional datasets because they can hinder performance by reducing the rate at...

Baoqiang Yan, Philip J. Rhodes

claim paper

Read More »

« Prev « First page 10 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers