Sciweavers

DOCENG
2006
ACM

Fast and simple XML tree differencing by sequence alignment

14 years 4 months ago
Fast and simple XML tree differencing by sequence alignment
With the advent of XML we have seen a renewed interest in methods for computing the difference between trees. Methods that include heuristic elements play an important role in practical applications due to the inherent complexity of the problem. We present a method for differencing XML as ordered trees based on mapping the problem to the domain of sequence alignment, applying simple and efficient heuristics in this domain, and transforming back to the tree domain. Our approach provides a method to quickly compute changes that are meaningful transformations on the XML tree level, and includes subtree move as a primitive operation. We evaluate the feasibility of our approach and benchmark it against a selection of existing differencing tools. The results show our approach to be feasible and to have the potential to perform on par with tools of a more complex design in terms of both output size and execution time. Categories and Subject Descriptors I.7.1 [Document and Text Processing]...
Tancred Lindholm, Jaakko Kangasharju, Sasu Tarkoma
Added 13 Jun 2010
Updated 13 Jun 2010
Type Conference
Year 2006
Where DOCENG
Authors Tancred Lindholm, Jaakko Kangasharju, Sasu Tarkoma
Comments (0)