Sciweavers

328 search results - page 2 / 66
» A Fast Clustering Algorithm to Cluster Very Large Categorica...
Sort
View
SIGMOD
2000
ACM
212views Database» more  SIGMOD 2000»
13 years 11 months ago
SQLEM: Fast Clustering in SQL using the EM Algorithm
Clustering is one of the most important tasks performed in Data Mining applications. This paper presents an e cient SQL implementation of the EM algorithm to perform clustering in...
Carlos Ordonez, Paul Cereghini
KDD
1999
ACM
166views Data Mining» more  KDD 1999»
13 years 11 months ago
CACTUS - Clustering Categorical Data Using Summaries
Clustering is an important data mining problem. Most of the earlier work on clustering focussed on numeric attributes which have a natural ordering on their attribute values. Rece...
Venkatesh Ganti, Johannes Gehrke, Raghu Ramakrishn...
SDM
2009
SIAM
114views Data Mining» more  SDM 2009»
14 years 4 months ago
GAD: General Activity Detection for Fast Clustering on Large Data.
In this paper, we propose GAD (General Activity Detection) for fast clustering on large scale data. Within this framework we design a set of algorithms for different scenarios: (...
Jiawei Han, Liangliang Cao, Sangkyum Kim, Xin Jin,...
ICDM
2003
IEEE
154views Data Mining» more  ICDM 2003»
14 years 22 days ago
MaPle: A Fast Algorithm for Maximal Pattern-based Clustering
Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However, pattern-...
Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wan...
CAINE
2003
13 years 8 months ago
A Genetic Algorithm for Clustering on Very Large Data Sets
Clustering is the process of subdividing an input data set into a desired number of subgroups so that members of the same subgroup are similar and members of different subgroups h...
Jim Gasvoda, Qin Ding