Sciweavers

61 search results - page 7 / 13
» Massively Parallel Data Analysis with PACTs on Nephele
Sort
View
ISAAC
2005
Springer
113views Algorithms» more  ISAAC 2005»
14 years 6 days ago
A Simple Optimal Randomized Algorithm for Sorting on the PDM
Abstract. The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the processing of massive data sets. Sorting has been extensively studied ...
Sanguthevar Rajasekaran, Sandeep Sen
ICPP
2009
IEEE
14 years 1 months ago
Generalizing k-Betweenness Centrality Using Short Paths and a Parallel Multithreaded Implementation
We present a new parallel algorithm that extends and generalizes the traditional graph analysis metric of betweenness centrality to include additional non-shortest paths according...
Karl Jiang, David Ediger, David A. Bader
SIGKDD
2000
237views more  SIGKDD 2000»
13 years 6 months ago
The UCI KDD Archive of Large Data Sets for Data Mining Research and Experimentation
Advances in data collection and storage have allowed organizations to create massive, complex and heterogeneous databases, which have stymied traditional methods of data analysis....
Stephen D. Bay, Dennis F. Kibler, Michael J. Pazza...
IPPS
2006
IEEE
14 years 22 days ago
A study of MPI performance analysis tools on Blue Gene/L
Applications on todays massively parallel supercomputers rely on performance analysis tools to guide them toward scalable performance on thousands of processors. However, conventi...
I-Hsin Chung, Robert Walkup, Hui-Fang Wen, Hao Yu
SIGMOD
2012
ACM
288views Database» more  SIGMOD 2012»
11 years 9 months ago
Exploiting MapReduce-based similarity joins
Cloud enabled systems have become a crucial component to efficiently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Sim...
Yasin N. Silva, Jason M. Reed