Aggregation of Document Frequencies in Unstructured P2P Networks

15 years 11 months ago

Download www.idi.ntnu.no

Peer-to-peer (P2P) systems have been recently proposed for providing search and information retrieval facilities over distributed data sources, including web data. Terms and their document frequencies are the main building blocks of retrieval and as such need to be computed, aggregated, and distributed throughout the system. This is a tedious task, as the local view of each peer may not reﬂect the global document collection, due to skewed document distributions. Moreover, central assembly of the total information is not feasible, due to the prohibitive cost of storage and maintenance, and also because of issues related to digital rights management. In this paper, we propose an eﬃcient approach for aggregating the document frequencies of carefully selected terms based on a hierarchical overlay network. To this end, we examine unsupervised feature selection techniques at the individual peer level, in order to identify only a limited set of the most important terms for aggregation. We...

Robert Neumayer, Christos Doulkeridis, Kjetil N&os

Real-time Traffic

Computer Science | Document Frequencies | Global Document Collection | Skewed Document Distributions | WISE 2009 |

claim paper

» Storage load balancing via local interactions among peers in unstructured P2P networks

» A Document Recommendation System Based on Clustering P2P Networks

» An Architecture for Hybrid P2P FreeText Search

» The SOWES approach to P2P web search using semantic overlays

» Adaptive Double Routing Indices Combining Effectiveness and Efficiency in P2P Systems

» Gossipbased Reputation Aggregation for Unstructured PeertoPeer Networks

» Multidimensional routing indices for efficient distributed query processing

» Active Peer to Peer

Post Info
More Details (n/a)

Added	08 Mar 2010
Updated	08 Mar 2010
Type	Conference
Year	2009
Where	WISE
Authors	Robert Neumayer, Christos Doulkeridis, Kjetil Nørvåg

Comments (0)

Sciweavers

Aggregation of Document Frequencies in Unstructured P2P Networks

Computer Science | Document Frequencies | Global Document Collection | Skewed Document Distributions | WISE 2009 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers