Sciweavers

248 search results - page 30 / 50
» Entity Identification in Database Integration
Sort
View
KDD
2006
ACM
166views Data Mining» more  KDD 2006»
14 years 7 months ago
Anonymizing sequential releases
An organization makes a new release as new information become available, releases a tailored view for each data request, releases sensitive information and identifying information...
Ke Wang, Benjamin C. M. Fung
SIGMOD
2007
ACM
183views Database» more  SIGMOD 2007»
14 years 7 months ago
Leveraging aggregate constraints for deduplication
We show that aggregate constraints (as opposed to pairwise constraints) that often arise when integrating multiple sources of data, can be leveraged to enhance the quality of dedu...
Surajit Chaudhuri, Anish Das Sarma, Venkatesh Gant...
ADBIS
2008
Springer
142views Database» more  ADBIS 2008»
14 years 1 months ago
Evaluating Performance and Quality of XML-Based Similarity Joins
A similarity join correlating fragments in XML documents, which are similar in structure and content, can be used as the core algorithm to support data cleaning and data integratio...
Leonardo Ribeiro, Theo Härder
BTW
2009
Springer
150views Database» more  BTW 2009»
13 years 10 months ago
The Frontiers of Data Programmability
: Simplifying data programming is a core mission of data management research. The issue at stake is to help engineers build efficient and robust data-centric applications. The fron...
Sergey Melnik
ICDE
2006
IEEE
141views Database» more  ICDE 2006»
14 years 8 months ago
Clean Answers over Dirty Databases: A Probabilistic Approach
The detection of duplicate tuples, corresponding to the same real-world entity, is an important task in data integration and cleaning. While many techniques exist to identify such...
Ariel Fuxman, Periklis Andritsos, Renée J. ...