Sciweavers

P2P
2010
IEEE

PeerDedupe: Insights into the Peer-Assisted Sampling Deduplication

13 years 9 months ago
PeerDedupe: Insights into the Peer-Assisted Sampling Deduplication
As the digital data rapidly inflates to a world-wide storage crisis, data deduplication is showing its increasingly prominent function in data storage. Driven by the problems behind the mainstream server-side deduplication schemes, recently there has been a tendency of introducing peer-assisted methods into the deduplication systems. However, this topic is still quite vague at present and lacks thorough research. In this paper, we conduct in-depth and quantitative investigation on the peer-assisted deduplication. Through measurements we observe that the inter-peer duplication accounts for a large proportion of the total duplication, and exhibits strong peer locality. Then based on our observations, we propose PeerDedupe, a novel peer-assisted sampling deduplication approach. Experiments show that PeerDedupe can remove over 98% duplication with each peer coordinating with no more than 5 other peers, and it requires much less server RAM usage than the existing works.
Yuanjian Xing, Zhenhua Li, Yafei Dai
Added 14 Feb 2011
Updated 14 Feb 2011
Type Journal
Year 2010
Where P2P
Authors Yuanjian Xing, Zhenhua Li, Yafei Dai
Comments (0)