A Reproducible Benchmark for P2P Retrieval

16 years 19 days ago

Download people.mmci.uni-saarland.de

With the growing popularity of information retrieval (IR) in distributed systems and in particular P2P Web search, a huge number of protocols and prototypes have been introduced in the literature. However, nearly every paper considers a diﬀerent benchmark for its experimental evaluation, rendering their mutual comparison and the quantiﬁcation of performance improvements an impossible task. We present a standardized, general purpose benchmark for P2P IR systems that ﬁnally makes this possible. We start by presenting a detailed requirement analysis for such a standardized benchmark framework that allows for reproducible and comparable experimental setups without sacriﬁcing ﬂexibility to suit diﬀerent system models. We further suggest Wikipedia as a publicly-available and all-purpose document corpus and ﬁnally introduce a simple but yet ﬂexible clustering strategy that assigns the Wikipedia articles as documents to an arbitrary number of peers. After proposing a standardi...

Thomas Neumann, Matthias Bender, Sebastian Michel,

Real-time Traffic

EXPDB 2006 | General Purpose Benchmark | Information Management | P2P Web Search | Standardized Benchmark Framework |

claim paper

Added	13 Jun 2010
Updated	13 Jun 2010
Type	Conference
Year	2006
Where	EXPDB
Authors	Thomas Neumann, Matthias Bender, Sebastian Michel, Gerhard Weikum

Sciweavers

A Reproducible Benchmark for P2P Retrieval

EXPDB 2006 | General Purpose Benchmark | Information Management | P2P Web Search | Standardized Benchmark Framework |

Explore & Download

Productivity Tools

Sciweavers