Real Datasets for File-Sharing Peer-to-Peer Systems

16 years 28 days ago

Download web.jjay.cuny.edu

The fundamental drawback of unstructured peer-to-peer (P2P) networks is the ﬂooding-based query processing protocol that seriously limits their scalability. As a result, a signiﬁcant amount of research work has focused on designing eﬃcient search protocols that reduce the overall communication cost. What is lacking, however, is the availability of real data, regarding the exact content of users’ libraries and the queries that these users ask. Using trace-driven simulations will clearly generate more meaningful results and further illustrate the eﬃciency of a generic query processing protocol under a real-life scenario. Motivated by this fact, we developed a Gnutella-style probe and collected detailed data over a period of two months. They involve around 4,500 users and contain the exact ﬁles shared by each user, together with any available metadata (e.g., artist for songs) and information about the nodes (e.g., connection speed). We also collected the queries initiated by t...

Shen-Tat Goh, Panos Kalnis, Spiridon Bakiras, Kian

Real-time Traffic

DASFAA 2005 | Database | Overall Communication Cost | Query Processing Protocol | ﬂooding-based Query Processing |

claim paper

Post Info
More Details (n/a)

Added	24 Jun 2010
Updated	24 Jun 2010
Type	Conference
Year	2005
Where	DASFAA
Authors	Shen-Tat Goh, Panos Kalnis, Spiridon Bakiras, Kian-Lee Tan

Comments (0)

Sciweavers

Real Datasets for File-Sharing Peer-to-Peer Systems

DASFAA 2005 | Database | Overall Communication Cost | Query Processing Protocol | ﬂooding-based Query Processing |

Explore & Download

Productivity Tools

Sciweavers