Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

326

ICDE
2009
IEEE

171views Database» more ICDE 2009»

A Framework for Clustering Massive-Domain Data Streams

16 years 8 months ago

A Framework for Clustering Massive-Domain Data Streams

Download www.usukita.org

In this paper, we will examine the problem of clustering massive domain data streams. Massive-domain data streams are those in which the number of possible domain values for each attribute are very large and cannot be easily tracked for clustering purposes. Some examples of such streams include IP-address streams, credit-card transaction streams, or streams of sales data over large numbers of items. In such cases, it is well known that even simple stream operations such as counting can be extremely difficult because of the difficulty in maintaining summary information over the different discrete values. The task of clustering is significantly more challenging in such cases, since the intermediate statistics for the different clusters cannot be maintained efficiently. In this paper, we propose a method for clustering massive-domain data streams with the use of sketches. We prove probabilistic results which show that a sketch-based clustering method can provide similar results to an infi...

Charu C. Aggarwal

Real-time Traffic

Credit-card Transaction Streams | Database | ICDE 2009 | Infinitespace Clustering Algorithm | IP-address Streams | Massive-domain Data Streams | Sketch-based Clustering Method |

claim paper

Related Content

» DCF An Efficient Data Stream Clustering Framework for Streaming Applications

» Temporal Structure Learning for Clustering Massive Data Streams in RealTime

» Visualising the Cluster Structure of Data Streams

» Weighted Clustering and Evolutionary Analysis of Hybrid Attributes Data Streams

» Detecting Changes in Unlabeled Data Streams Using Martingale

» MOA Massive Online Analysis a Framework for Stream Classification and Clustering

» An Ensemble of Classifiers for coping with Recurring Contexts in Data Streams

» A framework for mining evolving trends in Web data streams using dynamic learning and retr...

» A Framework for Clustering Uncertain Data Streams

Post Info
More Details (n/a)

Added	20 Oct 2009
Updated	20 Oct 2009
Type	Conference
Year	2009
Where	ICDE
Authors	Charu C. Aggarwal

Comments (0)