Peer-to-peer (p2p) systems such as Gnutella and KaZaa are routinely used by millions of people for sharing music and many other files over the Internet, and they account for a significant portion of the Internet traffic. The p2p traffic can be broken down into two categories: protocol messages for maintaining and searching the overlay p2p network, and data messages for downloading data files. This paper makes the following two contributions. First, we present a study of the locality in the collective Gnutella query protocol messages being initiated and forwarded by servents within the same organization. Second, we propose a transparent query caching scheme to reduce the bandwidth consumed by p2p traffic going in and out of the gateway of an organization. Our locality measurements show that there is significant locality in the collective queries going through a gateway forwarded by servents behind that gateway, and the locality increases with the population of those servents. Ou...
Sunil Patro, Y. Charlie Hu