Sciweavers

ICDE
2004
IEEE

An Efficient Algorithm for Mining Frequent Sequences by a New Strategy without Support Counting

15 years 24 days ago
An Efficient Algorithm for Mining Frequent Sequences by a New Strategy without Support Counting
Mining sequential patterns in large databases is an important research topic. The main challenge of mining sequential patterns is the high processing cost due to the large amount of data. In this paper, we propose a new strategy called DIrect Sequence Comparison (abbreviated as DISC), which can find frequent sequences without having to compute the support counts of non-frequent sequences. The main difference between the DISC strategy and the previous works is the way to prune non-frequent sequences. The previous works are based on the anti-monotone property, which prune the non-frequent sequences according to the frequent sequences with shorter lengths. On the contrary, the DISC strategy prunes the non-frequent sequences according to the other sequences with the same length. Moreover, we summarize three strategies used in the previous works and design an efficient algorithm called DISC-all to take advantages of all the four strategies. The experimental results show that the DISC-all a...
Ding-Ying Chiu, Yi-Hung Wu, Arbee L. P. Chen
Added 01 Nov 2009
Updated 01 Nov 2009
Type Conference
Year 2004
Where ICDE
Authors Ding-Ying Chiu, Yi-Hung Wu, Arbee L. P. Chen
Comments (0)