Sciweavers

SIGMOD
2010
ACM
158views Database» more  SIGMOD 2010»
14 years 19 days ago
A case for online mixed workload processing
Jens Krüger, Christian Tinnefeld, Martin Grun...
SIGMOD
2010
ACM
154views Database» more  SIGMOD 2010»
14 years 19 days ago
Unbiased estimation of size and other aggregates over hidden web databases
Many websites provide restrictive form-like interfaces which allow users to execute search queries on the underlying hidden databases. In this paper, we consider the problem of es...
Arjun Dasgupta, Xin Jin, Bradley Jewell, Nan Zhang...
SIGMOD
2010
ACM
167views Database» more  SIGMOD 2010»
14 years 19 days ago
Efficient querying and maintenance of network provenance at internet-scale
Network accountability, forensic analysis, and failure diagnosis are becoming increasingly important for network management and security. Such capabilities often utilize network p...
Wenchao Zhou, Micah Sherr, Tao Tao, Xiaozhou Li, B...
SIGMOD
2010
ACM
269views Database» more  SIGMOD 2010»
14 years 19 days ago
MapDupReducer: detecting near duplicates over massive datasets
Categories and Subject Descriptors General Terms Keywords
Chaokun Wang, Jianmin Wang, Xuemin Lin, Wei Wang, ...
SIGMOD
2010
ACM
224views Database» more  SIGMOD 2010»
14 years 19 days ago
GDR: a system for guided data repair
Improving data quality is a time-consuming, labor-intensive and often domain specific operation. Existing data repair approaches are either fully automated or not efficient in int...
Mohamed Yakout, Ahmed K. Elmagarmid, Jennifer Nevi...
SIGMOD
2010
ACM
200views Database» more  SIGMOD 2010»
14 years 19 days ago
QRelX: generating meaningful queries that provide cardinality assurance
In many business and consumer applications, queries have cardinality constraints. However, current database systems provide minimal support for cardinality assurance. Consequently...
Manasi Vartak, Venkatesh Raghavan, Elke A. Rundens...
SIGMOD
2010
ACM
250views Database» more  SIGMOD 2010»
14 years 19 days ago
Expressive and flexible access to web-extracted data: a keyword-based structured query language
Automated extraction of structured data from Web sources often leads to large heterogeneous knowledge bases (KB), with data and schema items numbering in the hundreds of thousands...
Jeffrey Pound, Ihab F. Ilyas, Grant E. Weddell
SIGMOD
2010
ACM
208views Database» more  SIGMOD 2010»
14 years 19 days ago
Hierarchically organized skew-tolerant histograms for geographic data objects
Histograms have been widely used for fast estimation of query result sizes in query optimization. In this paper, we propose a new histogram method, called the Skew-Tolerant Histog...
Yohan J. Roh, Jae Ho Kim, Yon Dohn Chung, Jin Hyun...
SIGMOD
2010
ACM
436views Database» more  SIGMOD 2010»
14 years 19 days ago
Pluggable personal data servers
An increasing amount of personal data is automatically gathered on servers by administrations, hospitals and private companies while several security surveys highlight the failure...
Nicolas Anciaux, Luc Bouganim, Yanli Guo, Philippe...