Learning from Time-Changing Data with Adaptive Windowing

15 years 3 months ago

Download siam.org

We present a new approach for dealing with distribution change and concept drift when learning from data sequences that may vary with time. We use sliding windows whose size, instead of being ﬁxed a priori, is recomputed online according to the rate of change observed from the data in the window itself. This delivers the user or programmer from having to guess a time-scale for change. Contrary to many related works, we provide rigorous guarantees of performance, as bounds on the rates of false positives and false negatives. Using ideas from data stream algorithmics, we develop a time- and memory-eﬃcient version of this algorithm, called ADWIN2. We show how to combine ADWIN2 with the Na¨ıve Bayes (NB) predictor, in two ways: one, using it to monitor the error rate of the current model and declare when revision is necessary and, two, putting it inside the NB predictor to maintain up-to-date estimations of conditional probabilities in the data. We test our approach using synthetic ...

Albert Bifet, Ricard Gavaldà

Real-time Traffic