MaPle: A Fast Algorithm for Maximal Pattern-based Clustering

16 years 14 days ago

Download www.cs.sfu.ca

Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However, pattern-based clustering in large databases is challenging. On the one hand, there can be a huge number of clusters and many of them can be redundant and thus make the pattern-based clustering ineffective. On the other hand, the previous proposed methods may not be efﬁcient or scalable in mining large databases. In this paper, we study the problem of maximal patternbased clustering. Redundant clusters are avoided completely by mining only the maximal pattern-based clusters. MaPle, an efﬁcient and scalable mining algorithm is developed. It conducts a depth-ﬁrst, divide-and-conquer search and prunes unnecessary branches smartly. Our extensive performance study on both synthetic data sets and real data sets shows that maximal pattern-based clustering is effective. It reduces the number of clusters substantially. M...

Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wan

Real-time Traffic

Data Mining | ICDM 2003 | Large Databases | Maximal Pattern-based Clusters | Pattern-based Clustering |

claim paper

» FEMA A Fast Expectation Maximization Algorithm based on Grid and PCA

» New Algorithms for Fast Discovery of Association Rules

» Image Segmentation for Robots Fast Selfadapting Gaussian Mixture Model

» SCAN a structural clustering algorithm for networks

» Effective Cluster Assignment for Modulo Scheduling

» Possibilistic Approach to Biclustering An Application to Oligonucleotide Microarray Data A...

» Performance Evaluation of Shared Mesh Protection in WDM Networks

» Coclustering for crosssubject fiber tract analysis through diffusion tensor imaging

Post Info
More Details (n/a)

Added	04 Jul 2010
Updated	04 Jul 2010
Type	Conference
Year	2003
Where	ICDM
Authors	Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wang, Philip S. Yu

Comments (0)

Sciweavers

MaPle: A Fast Algorithm for Maximal Pattern-based Clustering

Data Mining | ICDM 2003 | Large Databases | Maximal Pattern-based Clusters | Pattern-based Clustering |

Explore & Download

Productivity Tools

Sciweavers