Self-Adaptive Anytime Stream Clustering

13 years 10 months ago

Download dblab.cs.nccu.edu.tw

Clustering streaming data requires algorithms which are capable of updating clustering results for the incoming data. As data is constantly arriving, time for processing is limited. Clustering has to be performed in a single pass over the incoming data and within the possibly varying inter-arrival times of the stream. Likewise, memory is limited, making it impossible to store all data. For clustering, we are faced with the challenge of maintaining a current result that can be presented to the user at any given time. In this work, we propose a parameter free algorithm that automatically adapts to the speed of the data stream. It makes best use of the time available under the current constraints to provide a clustering of the objects seen up to that point. Our approach incorporates the age of the objects to reflect the greater importance of more recent data. Moreover, we are capable of detecting concept drift, novelty and outliers in the stream. For efficient and effective handling, we i...

Philipp Kranen, Ira Assent, Corinna Baldauf, Thoma

Real-time Traffic

Algorithms | Data Mining | ICDM 2009 | Incoming Data | Stream |

claim paper

Post Info
More Details (n/a)

Added	18 Feb 2011
Updated	18 Feb 2011
Type	Journal
Year	2009
Where	ICDM
Authors	Philipp Kranen, Ira Assent, Corinna Baldauf, Thomas Seidl

Comments (0)

Sciweavers

Self-Adaptive Anytime Stream Clustering

Algorithms | Data Mining | ICDM 2009 | Incoming Data | Stream |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers