Sciweavers

SIGMOD
2001
ACM
92views Database» more  SIGMOD 2001»
14 years 11 months ago
Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach
AnHai Doan, Pedro Domingos, Alon Y. Halevy
SIGMOD
2001
ACM
104views Database» more  SIGMOD 2001»
14 years 11 months ago
Independence is Good: Dependency-Based Histogram Synopses for High-Dimensional Data
Approximating the joint data distribution of a multi-dimensional data set through a compact and accurate histogram synopsis is a fundamental problem arising in numerous practical ...
Amol Deshpande, Minos N. Garofalakis, Rajeev Rasto...
SIGMOD
2001
ACM
108views Database» more  SIGMOD 2001»
14 years 11 months ago
Improving Index Performance through Prefetching
This paper proposes and evaluates Prefetching B+ -Trees pB+ -Trees, which use prefetching to accelerate two important operations on B+ -Tree indices: searches and range scans. To ...
Shimin Chen, Phillip B. Gibbons, Todd C. Mowry
SIGMOD
2001
ACM
124views Database» more  SIGMOD 2001»
14 years 11 months ago
Query Optimization In Compressed Database Systems
Over the last decades, improvements in CPU speed have outpaced improvements in main memory and disk access rates by orders of magnitude, enabling the use of data compression techn...
Zhiyuan Chen, Johannes Gehrke, Flip Korn
SIGMOD
2001
ACM
229views Database» more  SIGMOD 2001»
14 years 11 months ago
A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries
The ability to approximately answer aggregation queries accurately and efficiently is of great benefit for decision support and data mining tools. In contrast to previous sampling...
Surajit Chaudhuri, Gautam Das, Vivek R. Narasayya
SIGMOD
2001
ACM
102views Database» more  SIGMOD 2001»
14 years 11 months ago
Models and Languages for Describing and Discovering E-Services
Fabio Casati, Ming-Chien Shan
SIGMOD
2001
ACM
106views Database» more  SIGMOD 2001»
14 years 11 months ago
Enabling Dynamic Content Caching for Database-Driven Web Sites
K. Selçuk Candan, Wen-Syan Li, Qiong Luo, W...
SIGMOD
2001
ACM
84views Database» more  SIGMOD 2001»
14 years 11 months ago
STHoles: A Multidimensional Workload-Aware Histogram
Nicolas Bruno, Surajit Chaudhuri, Luis Gravano
SIGMOD
2001
ACM
200views Database» more  SIGMOD 2001»
14 years 11 months ago
Data Bubbles: Quality Preserving Performance Boosting for Hierarchical Clustering
In this paper, we investigate how to scale hierarchical clustering methods (such as OPTICS) to extremely large databases by utilizing data compression methods (such as BIRCH or ra...
Markus M. Breunig, Hans-Peter Kriegel, Peer Kr&oum...
SIGMOD
2001
ACM
145views Database» more  SIGMOD 2001»
14 years 11 months ago
Automatic Segmentation of Text into Structured Records
In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuo...
Vinayak R. Borkar, Kaustubh Deshmukh, Sunita Saraw...