Sciweavers

ICDE
2012
IEEE

Incremental Detection of Inconsistencies in Distributed Data

12 years 3 months ago
Incremental Detection of Inconsistencies in Distributed Data
—This paper investigates the problem of incremental detection of errors in distributed data. Given a distributed database D, a set Σ of conditional functional dependencies (CFDs), the set V of violations of the CFDs in D, and updates ∆D to D, it is to find, with minimum data shipment, changes ∆V to V in response to ∆D. The need for the study is evident since real-life data is often dirty, distributed and is frequently updated. It is often prohibitively expensive to recompute the entire set of violations when D is updated. We show that the incremental detection problem is NP-complete for D partitioned either vertically or horizontally, even when Σ and D are fixed. Nevertheless, we show that it is bounded and better still, actually optimal: there exist algorithms to detect errors such that their computational cost and data shipment are both linear in the size of ∆D and ∆V, independent of the size of the database D. We provide such incremental algorithms for vertically par...
Wenfei Fan, Jianzhong Li, Nan Tang, Wenyuan Yu
Added 28 Sep 2012
Updated 28 Sep 2012
Type Journal
Year 2012
Where ICDE
Authors Wenfei Fan, Jianzhong Li, Nan Tang, Wenyuan Yu
Comments (0)