Clustering of Time Series Subsequences is Meaningless: Implications for Previous and Future Research

14 years 6 months ago

Download www.cs.ucr.edu

Given the recent explosion of interest in streaming data and online algorithms, clustering of time series subsequences, extracted via a sliding window, has received much attention. In this work we make a surprising claim. Clustering of time series subsequences is meaningless. More concretely, clusters extracted from these time series are forced to obey a certain constraint that is pathologically unlikely to be satisfied by any dataset, and because of this, the clusters extracted by any clustering algorithm are essentially random. While this constraint can be intuitively demonstrated with a simple illustration and is simple to prove, it has never appeared in the literature. We can justify calling our claim surprising, since it invalidates the contribution of dozens of previously published papers. We will justify our claim with a theorem, illustrative examples, and a comprehensive set of experiments on reimplementations of previous work. Although the primary contribution of our work is ...

Eamonn J. Keogh, Jessica Lin, Wagner Truppel

Real-time Traffic

Data Mining | ICDM 2003 | Surprising Claim | Time Series | Time Series Subsequences |

claim paper

Post Info
More Details (n/a)

Added	04 Jul 2010
Updated	04 Jul 2010
Type	Conference
Year	2003
Where	ICDM
Authors	Eamonn J. Keogh, Jessica Lin, Wagner Truppel

Comments (0)

Sciweavers

Clustering of Time Series Subsequences is Meaningless: Implications for Previous and Future Research

Data Mining | ICDM 2003 | Surprising Claim | Time Series | Time Series Subsequences |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers