Sciweavers

471 search results - page 10 / 95
» MapReduce: Simplified Data Processing on Large Clusters
Sort
View
IJIT
2004
13 years 9 months ago
IMDC: An Image-Mapped Data Clustering Technique for Large Datasets
In this paper, we present a new algorithm for clustering data in large datasets using image processing approaches. First the dataset is mapped into a binary image plane. The synthe...
Faruq A. Al-Omari, Nabeel I. Al-Fayoumi
USENIX
2008
13 years 10 months ago
Free Factories: Unified Infrastructure for Data Intensive Web Services
We introduce the Free Factory, a platform for deploying data-intensive web services using small clusters of commodity hardware and free software. Independently administered virtua...
Alexander Wait Zaranek, Tom Clegg, Ward Vandewege,...
DOLAP
2010
ACM
13 years 5 months ago
Relational versus non-relational database systems for data warehousing
Relational database systems have been the dominating technology to manage and analyze large data warehouses. Moreover, the ER model, the standard in database design, has a close r...
Carlos Ordonez, Il-Yeol Song, Carlos Garcia-Alvara...
PVLDB
2008
124views more  PVLDB 2008»
13 years 7 months ago
Scheduling shared scans of large data files
We study how best to schedule scans of large data files, in the presence of many simultaneous requests to a common set of files. The objective is to maximize the overall rate of p...
Parag Agrawal, Daniel Kifer, Christopher Olston
CORR
2010
Springer
63views Education» more  CORR 2010»
13 years 7 months ago
Simplified Distributed Programming with Micro Objects
Developing large-scale distributed applications can be a daunting task. One of the problems that application developers are confronted with is the distribution of data, processes, ...
Jan-Mark S. Wams, Maarten van Steen