Sciweavers

367 search results - page 4 / 74
» Duplicate detection in probabilistic data
Sort
View
ICDE
2000
IEEE
102views Database» more  ICDE 2000»
15 years 3 days ago
Data Redundancy and Duplicate Detection in Spatial Join Processing
Jens-Peter Dittrich, Bernhard Seeger
VLDB
2002
ACM
110views Database» more  VLDB 2002»
13 years 10 months ago
Eliminating Fuzzy Duplicates in Data Warehouses
The duplicate elimination problem of detecting multiple tuples, which describe the same real world entity, is an important data cleaning problem. Previous domain independent solut...
Rohit Ananthakrishna, Surajit Chaudhuri, Venkatesh...
DEXA
2004
Springer
147views Database» more  DEXA 2004»
14 years 4 months ago
A Flexible Fuzzy Expert System for Fuzzy Duplicate Elimination in Data Cleaning
Data cleaning deals with the detection and removal of errors and inconsistencies in data, gathered from distributed sources. This process is essential for drawing correct conclusio...
Hamid Haidarian Shahri, Ahmad Abdollahzadeh Barfor...
ICDE
2005
IEEE
108views Database» more  ICDE 2005»
14 years 4 months ago
Robust Identification of Fuzzy Duplicates
Detecting and eliminating fuzzy duplicates is a critical data cleaning task that is required by many applications. Fuzzy duplicates are multiple seemingly distinct tuples which re...
Surajit Chaudhuri, Venkatesh Ganti, Rajeev Motwani
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 11 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney