Sciweavers

JDCTA
2010

A New Agglomerative Hierarchical Clustering Algorithm Implementation based on the Map Reduce Framework

13 years 6 months ago
A New Agglomerative Hierarchical Clustering Algorithm Implementation based on the Map Reduce Framework
Text clustering is one of the difficult and hot research fields in the text mining research. Combing Map Reduce framework and the neuron initialization method of VPSOM (vector pressing SelfOrganizing Model) algorithm, a new text clustering algorithm is presented. It divides the large text vector dataset into data blocks, each of which then processed in different distributed data node of Map Reduce framework with agglomerative hierarchical clustering algorithm. The experiment results indicate that the improved algorithm has a higher efficiency and a better accuracy.
Hui Gao, Jun Jiang, Li She, Yan Fu
Added 19 May 2011
Updated 19 May 2011
Type Journal
Year 2010
Where JDCTA
Authors Hui Gao, Jun Jiang, Li She, Yan Fu
Comments (0)