Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

175

ECIR
2010
Springer

119views Information Technology» more ECIR 2010»

Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees

15 years 8 months ago

Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees

Download www.l3s.de

Text clustering is an established technique for improving quality in information retrieval, for both centralized and distributed environments. However, for highly distributed environments, such as peer-topeer networks, current clustering algorithms fail to scale. Our algorithm for peer-to-peer clustering achieves high scalability by using a probabilistic approach for assigning documents to clusters. It enables a peer to compare each of its documents only with very few selected clusters, without significant loss of clustering quality. The algorithm offers probabilistic guarantees for the correctness of each document assignment to a cluster. Extensive experimental evaluation with up to 100000 peers and 1 million documents demonstrates the scalability and effectiveness of the algorithm.

Odysseas Papapetrou, Wolf Siberski, Norbert Fuhr

Real-time Traffic

Current Clustering Algorithms | Distributed Environments | ECIR 2010 | Information Technology | Text Clustering |

claim paper

Related Content

» DKS N k f A Family of Low Communication Scalable and FaultTolerant Infrastructures for P2P...

» A Probabilistic ClusteringProjection Model for Discrete Data

» ClusterBased Failure Detection Service for LargeScale Ad Hoc Wireless Network Applications

» Slingshot TimeCritical Multicast for Clustered Applications

» Efficient clustering algorithms for selforganizing wireless sensor networks

» Power balanced coveragetime optimization for clustered wireless sensor networks

» Coveragetime optimization for clustered wireless sensor networks a powerbalancing approach

» Topic evolution and social interactions how authors effect research

» Approximation algorithms for clustering uncertain data

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	ECIR
Authors	Odysseas Papapetrou, Wolf Siberski, Norbert Fuhr

Comments (0)