Sciweavers

471 search results - page 40 / 95
» MapReduce: Simplified Data Processing on Large Clusters
Sort
View
WWW
2007
ACM
14 years 9 months ago
A new suffix tree similarity measure for document clustering
In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...
Hung Chim, Xiaotie Deng
VLDB
2005
ACM
93views Database» more  VLDB 2005»
14 years 2 months ago
Revisiting Pipelined Parallelism in Multi-Join Query Processing
Multi-join queries are the core of any integration service that integrates data from multiple distributed data sources. Due to the large number of data sources and possibly high v...
Bin Liu, Elke A. Rundensteiner
ECML
2006
Springer
14 years 20 days ago
An Adaptive Kernel Method for Semi-supervised Clustering
Semi-supervised clustering uses the limited background knowledge to aid unsupervised clustering algorithms. Recently, a kernel method for semi-supervised clustering has been introd...
Bojun Yan, Carlotta Domeniconi
DIAGRAMS
2004
Springer
14 years 2 months ago
Geographic Projection of Cluster Composites
A composite cluster map displays a fuzzy categorisation of geographic areas. It combines information from several sources to provide a visualisation of the significance of cluster...
Peter Kleiweg, John Nerbonne, Leonie Bosveld
CLUSTER
2006
IEEE
14 years 3 months ago
Positioning Dynamic Storage Caches for Transient Data
Simulations, experiments and observatories are generating a deluge of scientific data. Even more staggering is the ever growing application demand to process and assimilate these...
Sudharshan S. Vazhkudai, Douglas Thain, Xiaosong M...