Sciweavers

PR
2007

Mining evolving data streams for frequent patterns

13 years 11 months ago
Mining evolving data streams for frequent patterns
A data stream is a potentially uninterrupted flow of data. Mining this flow makes it necessary to cope with uncertainty, as only a part of the stream can be stored. In this paper, we evaluate a statistical technique which biases the estimation of the support of patterns, so as to maximize either the precision or the recall, as chosen by the user, and limit the degradation of the other criterion. Theoretical results show that the technique is not far from the optimum, from the statistical standpoint. Experiments performed tend to demonstrate its potential, as it remains robust even under significant distribution drifts. Key words: Data streams, Concentration inequalities, Precision, Recall, Accuracy. Preprint submitted to Elsevier Science 8 November 2005
Pierre-Alain Laur, Richard Nock, Jean-Emile Sympho
Added 27 Dec 2010
Updated 27 Dec 2010
Type Journal
Year 2007
Where PR
Authors Pierre-Alain Laur, Richard Nock, Jean-Emile Symphor, Pascal Poncelet
Comments (0)