IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking

16 years 6 months ago

Download lsirpeople.epfl.ch

Abstract. We consider a collaboration of peers autonomously crawling the Web. A pivotal issue when designing a peer-to-peer (P2P) Web search engine in this environment is query routing: selecting a small subset of (a potentially very large number of relevant) peers to contact to satisfy a keyword query. Existing approaches for query routing work well on disjoint data sets. However, naturally, the peers' data collections often highly overlap, as popular documents are highly crawled. Techniques for estimating the cardinality of the overlap between sets, designed for and incorporated into information retrieval engines are very much lacking. In this paper we present a comprehensive evaluation of appropriate overlap estimators, showing how they can be incorporated into an efficient, iterative approach to query routing, coined Integrated Quality Novelty (IQN). We propose to further enhance our approach using histograms, combining overlap estimation with the available score/ranking infor...

Sebastian Michel, Matthias Bender, Peter Triantafi

Real-time Traffic

Appropriate Overlap Estimators | Database | EDBT 2006 | Overlap Estimation | Query Routing Work |

claim paper

Added	08 Dec 2009
Updated	08 Dec 2009
Type	Conference
Year	2006
Where	EDBT
Authors	Sebastian Michel, Matthias Bender, Peter Triantafillou, Gerhard Weikum

Sciweavers

IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking

Appropriate Overlap Estimators | Database | EDBT 2006 | Overlap Estimation | Query Routing Work |

Explore & Download

Productivity Tools

Sciweavers