Sciweavers

17390 search results - page 17 / 3478
» Distributed Data Clustering
Sort
View
IPPS
2010
IEEE
13 years 5 months ago
Improving MapReduce performance through data placement in heterogeneous Hadoop clusters
MapReduce has become an important distributed processing model for large-scale data-intensive applications like data mining and web indexing. Hadoop
Jiong Xie, Shu Yin, Xiaojun Ruan, Zhiyang Ding, Yu...
ACL
2003
13 years 8 months ago
Clustering Polysemic Subcategorization Frame Distributions Semantically
Previous research has demonstrated the utility of clustering in inducing semantic verb classes from undisambiguated corpus data. We describe a new approach which involves clusteri...
Anna Korhonen, Yuval Krymolowski, Zvika Marx
INCDM
2007
Springer
97views Data Mining» more  INCDM 2007»
14 years 1 months ago
Clustering by Random Projections
Abstract. Clustering algorithms for multidimensional numerical data must overcome special difficulties due to the irregularities of data distribution. We present a clustering algo...
Thierry Urruty, Chabane Djeraba, Dan A. Simovici
ASIAN
2005
Springer
150views Algorithms» more  ASIAN 2005»
14 years 28 days ago
ACB-R: An Adaptive Clustering-Based Data Replication Algorithm on a P2P Data-Store
Replication on geographically distributed, unreliable, P2P interconnecting nodes can offer high data availability and low network latency for replica access. The challenge is how ...
Junhu Zhang, Dongqing Yang, Shiwei Tang