WAT: Finding Top-K Discords in Time Series Database

15 years 8 months ago

Download www.cse.cuhk.edu.hk

Finding discords in time series database is an important problem in a great variety of applications, such as space shuttle telemetry, mechanical industry, biomedicine, and ﬁnancial data analysis. However, most previous methods for this problem suffer from too many parameter settings which are difﬁcult for users. The best known approach to our knowledge that has comparatively fewer parameters still requires users to choose a word size for the compression of subsequences. In this paper, we propose a Haar wavelet and augmented trie based algorithm to mine the top-K discords from a time series database, which can dynamically determine the word size for compression. Due to the characteristics of Haar wavelet transform, our algorithm has greater pruning power than previous approaches. Through experiments with some annotated datasets, the effectiveness and efﬁciency of our algorithm are both attested.

Yingyi Bu, Oscar Tat-Wing Leung, Ada Wai-Chee Fu,

Real-time Traffic

Data Mining | Haar Wavelet | SDM 2007 | Time Series Database | Word Size |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	SDM
Authors	Yingyi Bu, Oscar Tat-Wing Leung, Ada Wai-Chee Fu, Eamonn J. Keogh, Jian Pei, Sam Meshkin

Comments (0)

Sciweavers

WAT: Finding Top-K Discords in Time Series Database

Data Mining | Haar Wavelet | SDM 2007 | Time Series Database | Word Size |

Explore & Download

Productivity Tools

Sciweavers