Sciweavers

179 search results - page 12 / 36
» Parallel k h-Means Clustering for Large Data Sets
Sort
View
CCGRID
2003
IEEE
14 years 4 months ago
Evaluation of the inter-cluster data transfer on Grid environment
High-performance peer-to-peer transfer between clusters will be fundamental technology base for various Grid middleware, such as large-scale data transfer in DataGrid settings, or...
Shoji Ogura, Satoshi Matsuoka, Hidemoto Nakada
LCPC
2007
Springer
14 years 5 months ago
Supporting Huge Address Spaces in a Virtual Machine for Java on a Cluster
Abstract. To solve problems that require far more memory than a single machine can supply, data can be swapped to disk in some manner, it can be compressed, and/or the memory of mu...
Ronald Veldema, Michael Philippsen
ICDT
2001
ACM
147views Database» more  ICDT 2001»
14 years 3 months ago
Parallelizing the Data Cube
This paper presents a general methodology for the efficient parallelization of existing data cube construction algorithms. We describe two different partitioning strategies, one f...
Frank K. H. A. Dehne, Todd Eavis, Susanne E. Hambr...
VLDB
2002
ACM
154views Database» more  VLDB 2002»
13 years 10 months ago
I/O-Conscious Data Preparation for Large-Scale Web Search Engines
Given that commercial search engines cover billions of web pages, efficiently managing the corresponding volumes of disk-resident data needed to answer user queries quickly is a f...
Maxim Lifantsev, Tzi-cker Chiueh
AUSAI
2003
Springer
14 years 4 months ago
Efficiently Mining Frequent Patterns from Dense Datasets Using a Cluster of Computers
Efficient mining of frequent patterns from large databases has been an active area of research since it is the most expensive step in association rules mining. In this paper, we pr...
Yudho Giri Sucahyo, Raj P. Gopalan, Amit Rudra