ATreeGrep: Approximate Searching in Unordered Trees

15 years 11 months ago

Download web.njit.edu

An unordered labeled tree is a tree in which each node has a string label and the parent-child relationship is signiﬁcant, but the order among siblings is unimportant. This paper presents an approach to the nearest neighbor search problem for these trees. Given a database ¤ of unordered labeled trees and a query tree ¥ , the goal is to ﬁnd those trees in ¤ that “approximately” contain ¥ . Our approach is based on storing the paths of the trees in a sufﬁx array and then counting the number of mismatching paths between the query tree and a data tree. To speed up a search, we use a hash-based technique to ﬁlter out unqualiﬁed data trees at an early stage of the search. Experimental results obtained by running our techniques on phylogenetic trees and synthetic data demonstrate the good performance of the proposed approach. We also discuss the use of our work in XML and scientiﬁc database management.

Dennis Shasha, Jason Tsong-Li Wang, Huiyuan Shan,

Real-time Traffic

Data Trees | Database | Query Tree | SSDBM 2002 | Unordered Labeled Trees |

claim paper

Post Info
More Details (n/a)

Added	16 Jul 2010
Updated	16 Jul 2010
Type	Conference
Year	2002
Where	SSDBM
Authors	Dennis Shasha, Jason Tsong-Li Wang, Huiyuan Shan, Kaizhong Zhang

Comments (0)

Sciweavers

ATreeGrep: Approximate Searching in Unordered Trees

Data Trees | Database | Query Tree | SSDBM 2002 | Unordered Labeled Trees |

Explore & Download

Productivity Tools

Sciweavers