On compressing frequent patterns

14 years 17 days ago

Download www.se.cuhk.edu.hk

A major challenge in frequent-pattern mining is the sheer size of its mining results. To compress the frequent patterns, we propose to cluster frequent patterns with a tightness measure d (called d-cluster), and select a representative pattern for each cluster. The problem of ﬁnding a minimum set of representative patterns is shown NP-Hard. We develop two greedy methods, RPglobal and RPlocal. The former has the guaranteed compression bound but higher computational complexity. The latter sacriﬁces the theoretical bounds but is far more eﬃcient. Our performance study shows that the compression quality using RPlocal is very close to RPglobal, and both can reduce the number of closed frequent patterns by almost two orders of magnitude. Furthermore, RPlocal mines even faster than FPClose [G. Grahne, J. Zhu, Eﬃciently using preﬁx-trees in mining frequent itemsets, in: Proc. IEEE ICDM Workshop on Frequent Itemset Mining Implementations (FIMI’03)], a very fast closed frequent-patt...

Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng

Real-time Traffic

DKE 2007 | Frequent Patterns | Mining | Representative Patterns |

claim paper

Post Info
More Details (n/a)

Added	13 Dec 2010
Updated	13 Dec 2010
Type	Journal
Year	2007
Where	DKE
Authors	Dong Xin, Jiawei Han, Xifeng Yan, Hong Cheng

Comments (0)

Sciweavers

On compressing frequent patterns

DKE 2007 | Frequent Patterns | Mining | Representative Patterns |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers