Sciweavers

839 search results - page 71 / 168
» Ensemble-index: a new approach to indexing large databases
Sort
View
IPPS
2006
IEEE
14 years 1 months ago
Design and analysis of a multi-dimensional data sampling service for large scale data analysis applications
Sampling is a widely used technique to increase efficiency in database and data mining applications operating on large dataset. In this paper we present a scalable sampling imple...
Xi Zhang, Tahsin M. Kurç, Joel H. Saltz, Sr...
EDBT
2004
ACM
234views Database» more  EDBT 2004»
14 years 8 months ago
A Condensation Approach to Privacy Preserving Data Mining
In recent years, privacy preserving data mining has become an important problem because of the large amount of personal data which is tracked by many business applications. In many...
Charu C. Aggarwal, Philip S. Yu
AISS
2010
146views more  AISS 2010»
13 years 2 months ago
Zatara, the Plug-in-able Eventually Consistent Distributed Database
With the proliferation of the computer Cloud, new software delivery methods were created. In order to build software to fit into one of these models, a scalable, easy to deploy st...
Bogdan Carstoiu, Dorin Carstoiu
SIGMOD
2004
ACM
189views Database» more  SIGMOD 2004»
14 years 8 months ago
A Holistic Paradigm for Large Scale Schema Matching
Schema matching is a critical problem for integrating heterogeneous information sources. Traditionally, the problem of matching multiple schemas has essentially relied on finding ...
Bin He, Kevin Chen-Chuan Chang
MSS
2003
IEEE
108views Hardware» more  MSS 2003»
14 years 1 months ago
Effective Management of Hierarchical Storage Using Two Levels of Data Clustering
When data resides on tertiary storage, clustering is the key to achieving high retrieval performance. However, a straightforward approach to clustering massive amounts of data on ...
Ratko Orlandic