Sciweavers

1038 search results - page 8 / 208
» A Genetic Algorithm for Clustering on Very Large Data Sets
Sort
View
ICPR
2006
IEEE
14 years 9 months ago
A Prototypes-Embedded Genetic K-means Algorithm
This paper presents a genetic algorithm (GA) for Kmeans clustering. Instead of the widely applied stringof-group-numbers encoding, we encode the prototypes of the clusters into th...
Hsin-Chia Fu, Hsin-Min Wang, Shih-Sian Cheng, Yi-H...
ADBIS
2007
Springer
143views Database» more  ADBIS 2007»
14 years 2 months ago
Aggregating Multiple Instances in Relational Database Using Semi-Supervised Genetic Algorithm-based Clustering Technique
In solving the classification problem in relational data mining, traditional methods, for example, the C4.5 and its variants, usually require data transformations from datasets sto...
Rayner Alfred, Dimitar Kazakov
SDM
2009
SIAM
114views Data Mining» more  SDM 2009»
14 years 5 months ago
GAD: General Activity Detection for Fast Clustering on Large Data.
In this paper, we propose GAD (General Activity Detection) for fast clustering on large scale data. Within this framework we design a set of algorithms for different scenarios: (...
Jiawei Han, Liangliang Cao, Sangkyum Kim, Xin Jin,...
KDD
2002
ACM
166views Data Mining» more  KDD 2002»
14 years 9 months ago
Frequent term-based text clustering
Text clustering methods can be used to structure large sets of text or hypertext documents. The well-known methods of text clustering, however, do not really address the special p...
Florian Beil, Martin Ester, Xiaowei Xu
DMIN
2006
142views Data Mining» more  DMIN 2006»
13 years 10 months ago
Parallel Hybrid Clustering using Genetic Programming and Multi-Objective Fitness with Density (PYRAMID)
Clustering is the process of locating patterns in large data sets. It is an active research area that provides value to scientific as well as business applications. Practical clust...
Junping Sun, William Sverdlik, Samir Tout