Sciweavers

396 search results - page 7 / 80
» Scalability for Clustering Algorithms Revisited
Sort
View
KDD
1998
ACM
123views Data Mining» more  KDD 1998»
14 years 23 days ago
Scaling Clustering Algorithms to Large Databases
Practical clustering algorithms require multiple data scans to achieve convergence. For large databases, these scans become prohibitively expensive. We present a scalable clusteri...
Paul S. Bradley, Usama M. Fayyad, Cory Reina
ICCS
2005
Springer
14 years 2 months ago
Generating Parallel Algorithms for Cluster and Grid Computing
We revisit and use the dependence transformation method to generate parallel algorithms suitable for cluster and grid computing. We illustrate this method in two applications: to o...
Ulisses Kendi Hayashida, Kunio Okuda, Jairo Panett...
ECIR
2010
Springer
13 years 10 months ago
Text Clustering for Peer-to-Peer Networks with Probabilistic Guarantees
Text clustering is an established technique for improving quality in information retrieval, for both centralized and distributed environments. However, for highly distributed envir...
Odysseas Papapetrou, Wolf Siberski, Norbert Fuhr
EMNLP
2008
13 years 10 months ago
Scalable Language Processing Algorithms for the Masses: A Case Study in Computing Word Co-occurrence Matrices with MapReduce
This paper explores the challenge of scaling up language processing algorithms to increasingly large datasets. While cluster computing has been available in commercial environment...
Jimmy J. Lin
SDM
2007
SIAM
152views Data Mining» more  SDM 2007»
13 years 10 months ago
HP2PC: Scalable Hierarchically-Distributed Peer-to-Peer Clustering
In distributed data mining models, adopting a flat node distribution model can affect scalability. To address the problem of modularity, flexibility and scalability, we propose...
Khaled M. Hammouda, Mohamed S. Kamel