Sciweavers

ICDE
2010
IEEE
194views Database» more  ICDE 2010»
14 years 2 months ago
Visualizing cost-based XQuery optimization
Abstract— Developing a full-fledged cost-based XQuery optimizer is a fairly complex task. Nowadays, there is little knowledge concerning suitable cost formulae and optimization ...
Andreas M. Weiner, Theo Härder, Renato Olivei...
ICDE
2010
IEEE
200views Database» more  ICDE 2010»
14 years 2 months ago
GenerIE: Information extraction using database queries
— Information extraction systems are traditionally implemented as a pipeline of special-purpose processing modules. A major drawback of such an approach is that whenever a new ex...
Luis Tari, Phan Huy Tu, Jörg Hakenberg, Yi Ch...
ICDE
2010
IEEE
204views Database» more  ICDE 2010»
14 years 2 months ago
ProbClean: A probabilistic duplicate detection system
— One of the most prominent data quality problems is the existence of duplicate records. Current data cleaning systems usually produce one clean instance (repair) of the input da...
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas...
ICDE
2010
IEEE
399views Database» more  ICDE 2010»
14 years 2 months ago
Discovery-driven graph summarization
Abstract— Large graph datasets are ubiquitous in many domains, including social networking and biology. Graph summarization techniques are crucial in such domains as they can ass...
Ning Zhang, Yuanyuan Tian, Jignesh M. Patel
ICDE
2010
IEEE
255views Database» more  ICDE 2010»
14 years 2 months ago
On supporting effective web extraction
— Commercial tuple extraction systems have enjoyed some success to extract tuples by regarding HTML pages as tree structures and exploiting XPath queries to find attributes of t...
Wook-Shin Han, Wooseong Kwak, Hwanjo Yu
ICDE
2010
IEEE
199views Database» more  ICDE 2010»
14 years 2 months ago
Mining mutation chains in biological sequences
— The increasing infectious disease outbreaks has led to a need for new research to better understand the disease’s origins, epidemiological features and pathogenicity caused b...
Chang Sheng, Wynne Hsu, Mong-Li Lee, Joo Chuan Ton...
ICDE
2010
IEEE
295views Database» more  ICDE 2010»
14 years 2 months ago
K nearest neighbor queries and kNN-Joins in large relational databases (almost) for free
— Finding the k nearest neighbors (kNN) of a query point, or a set of query points (kNN-Join) are fundamental problems in many application domains. Many previous efforts to solve...
Bin Yao, Feifei Li, Piyush Kumar
ICDE
2010
IEEE
177views Database» more  ICDE 2010»
14 years 2 months ago
Optimal load shedding with aggregates and mining queries
— To cope with bursty arrivals of high-volume data, a DSMS has to shed load while minimizing the degradation of Quality of Service (QoS). In this paper, we show that this problem...
Barzan Mozafari, Carlo Zaniolo
ICDE
2010
IEEE
276views Database» more  ICDE 2010»
14 years 2 months ago
X-data: Generating test data for killing SQL mutants
Abstract— Checking if an SQL query has been written correctly is not an easy task. Formal verification is not applicable, since it is based on comparing a specification with an...
Bhanu Pratap Gupta, Devang Vira, S. Sudarshan
ICDE
2010
IEEE
172views Database» more  ICDE 2010»
14 years 2 months ago
Rule profiling for query optimizers and their implications
—Many modern optimizers use a transformation rule based framework. While there has been a lot of work on identifying new transformation rules, there has been little work focused ...
Surajit Chaudhuri, Leo Giakoumakis, Vivek R. Naras...