Sciweavers

1971 search results - page 46 / 395
» A clustering method that uses lossy aggregation of data
Sort
View
ICDE
2006
IEEE
114views Database» more  ICDE 2006»
14 years 1 months ago
Novelty-based Incremental Document Clustering for On-line Documents
Document clustering has been used as a core technique in managing vast amount of data and providing needed information. In on-line environments, generally new information gains mo...
Sophoin Khy, Yoshiharu Ishikawa, Hiroyuki Kitagawa
IPPS
2007
IEEE
14 years 2 months ago
Towards A Better Understanding of Workload Dynamics on Data-Intensive Clusters and Grids
This paper presents a comprehensive statistical analysis of workloads collected on data-intensive clusters and Grids. The analysis is conducted at different levels, including Virt...
Hui Li, Lex Wolters
KDD
2000
ACM
149views Data Mining» more  KDD 2000»
13 years 11 months ago
Efficient clustering of high-dimensional data sets with application to reference matching
Many important problems involve clustering large datasets. Although naive implementations of clustering are computationally expensive, there are established efficient techniques f...
Andrew McCallum, Kamal Nigam, Lyle H. Ungar
ICML
2009
IEEE
14 years 8 months ago
Multi-assignment clustering for Boolean data
Conventional clustering methods typically assume that each data item belongs to a single cluster. This assumption does not hold in general. In order to overcome this limitation, w...
Andreas P. Streich, Mario Frank, David A. Basin, J...
CIKM
2006
Springer
13 years 9 months ago
Efficiently clustering transactional data with weighted coverage density
In this paper, we propose a fast, memory-efficient, and scalable clustering algorithm for analyzing transactional data. Our approach has three unique features. First, we use the c...
Hua Yan, Keke Chen, Ling Liu