Sciweavers

SIGMOD
2008
ACM
150views Database» more  SIGMOD 2008»
14 years 11 months ago
LearnPADS: automatic tool generation from ad hoc data
In this demonstration, we will present LEARNPADS, a fully automatic system for generating ad hoc data processing tools. When presented with a collection of ad hoc data, the system...
Kathleen Fisher, David Walker, Kenny Qili Zhu
SIGMOD
2008
ACM
111views Database» more  SIGMOD 2008»
14 years 11 months ago
Discovering bucket orders from full rankings
Discovering a bucket order B from a collection of possibly noisy full rankings is a fundamental problem that relates to various applications involving rankings. Informally, a buck...
Jianlin Feng, Qiong Fang, Wilfred Ng
SIGMOD
2008
ACM
139views Database» more  SIGMOD 2008»
14 years 11 months ago
Paths to stardom: calibrating the potential of a peer-based data management system
As peer-to-peer (P2P) networks become more familiar to the database community, intense interest has built up in using their scalability and resilience properties to scale database...
Mihai Lupu, Beng Chin Ooi, Y. C. Tay
SIGMOD
2008
ACM
164views Database» more  SIGMOD 2008»
14 years 11 months ago
Relational-style XML query
We study the problem of querying relational data embedded in XML. Relational data can be represented by various tree structures in XML. However, current XML query methods, such as...
Taro L. Saito, Shinichi Morishita
SIGMOD
2008
ACM
101views Database» more  SIGMOD 2008»
14 years 11 months ago
Pay-as-you-go user feedback for dataspace systems
A primary challenge to large-scale data integration is creating semantic equivalences between elements from different data sources that correspond to the same real-world entity or...
Shawn R. Jeffery, Michael J. Franklin, Alon Y. Hal...
SIGMOD
2008
ACM
193views Database» more  SIGMOD 2008»
14 years 11 months ago
Efficient provenance storage
Scientific workflow systems are increasingly used to automate complex data analyses, largely due to their benefits over traditional approaches for workflow design, optimization, a...
Adriane Chapman, H. V. Jagadish, Prakash Ramanan
SIGMOD
2008
ACM
167views Database» more  SIGMOD 2008»
14 years 11 months ago
DiMaC: a system for cleaning disguised missing data
In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...
Ming Hua, Jian Pei
SIGMOD
2008
ACM
115views Database» more  SIGMOD 2008»
14 years 11 months ago
Query answering techniques on uncertain and probabilistic data: tutorial summary
Uncertain data are inherent in some important applications, such as environmental surveillance, market analysis, and quantitative economics research. Due to the importance of thos...
Jian Pei, Ming Hua, Yufei Tao, Xuemin Lin
SIGMOD
2008
ACM
131views Database» more  SIGMOD 2008»
14 years 11 months ago
Orion 2.0: native support for uncertain data
Orion is a state-of-the-art uncertain database management system with built-in support for probabilistic data as first class data types. In contrast to other uncertain databases, ...
Sarvjeet Singh, Chris Mayfield, Sagar Mittal, Suni...
SIGMOD
2008
ACM
123views Database» more  SIGMOD 2008»
14 years 11 months ago
Query-based partitioning of documents and indexes for information lifecycle management
Regulations require businesses to archive many electronic documents for extended periods of time. Given the sheer volume of documents and the response time requirements, documents...
Soumyadeb Mitra, Marianne Winslett, Windsor W. Hsu