Sciweavers

EDBT
2006
ACM

DPTree: A Distributed Pattern Tree Index for Partial-Match Queries in Peer-to-Peer Networks

14 years 11 months ago
DPTree: A Distributed Pattern Tree Index for Partial-Match Queries in Peer-to-Peer Networks
Abstract. Partial-match queries return data items that contain a subset of the query keywords and order the results based on the statistical properties of the matched keywords. They are essential for information retrieval on large document repositories. However, most current peer-topeer networks for information retrieval are based on distributed hashing and as such cannot support partial-match queries efficiently. In this paper, we describe an efficient and scalable technique to support partialmatch queries on peer-to-peer networks. We observe that the combinations of keywords in the queries are only a small subset of all possible combinations of the keywords in the documents. Therefore, we propose a distributed index structure, called a distributed pattern tree (DPTree), to record frequent query patterns, i.e., combinations of keywords, learnt from the query history at each node in the network. Using this index, a query can identify its best matching patterns quickly and data lookup c...
Dyce Jing Zhao, Dik Lun Lee, Qiong Luo
Added 08 Dec 2009
Updated 08 Dec 2009
Type Conference
Year 2006
Where EDBT
Authors Dyce Jing Zhao, Dik Lun Lee, Qiong Luo
Comments (0)