Sciweavers

341 search results - page 2 / 69
» Data Cleaning and Semantic Improvement in Biological Databas...
Sort
View
ADC
2007
Springer
125views Database» more  ADC 2007»
14 years 2 months ago
Building a Disordered Protein Database: A Case Study in Managing Biological Data
A huge diversity of biological databases is available via the Internet, but many of these databases have been developed in an ad hoc manner rather than in accordance with any data...
Arran D. Stewart, Xiuzhen Zhang
SIGMOD
2010
ACM
211views Database» more  SIGMOD 2010»
14 years 1 months ago
ERACER: a database approach for statistical inference and data cleaning
Real-world databases often contain syntactic and semantic errors, in spite of integrity constraints and other safety measures incorporated into modern DBMSs. We present ERACER, an...
Chris Mayfield, Jennifer Neville, Sunil Prabhakar
CORR
2010
Springer
139views Education» more  CORR 2010»
13 years 3 months ago
Data Cleaning and Query Answering with Matching Dependencies and Matching Functions
Matching dependencies were recently introduced as declarative rules for data cleaning and entity resolution. Enforcing a matching dependency on a database instance identifies the ...
Leopoldo E. Bertossi, Solmaz Kolahi, Laks V. S. La...
PVLDB
2008
136views more  PVLDB 2008»
13 years 8 months ago
Keyword query cleaning
Unlike traditional database queries, keyword queries do not adhere to predefined syntax and are often dirty with irrelevant words from natural languages. This makes accurate and e...
Ken Q. Pu, Xiaohui Yu
ICDE
2006
IEEE
141views Database» more  ICDE 2006»
14 years 10 months ago
Clean Answers over Dirty Databases: A Probabilistic Approach
The detection of duplicate tuples, corresponding to the same real-world entity, is an important task in data integration and cleaning. While many techniques exist to identify such...
Ariel Fuxman, Periklis Andritsos, Renée J. ...