Single-pass online learning: performance, voting schemes and online feature selection

15 years 7 months ago

Download www.cs.cmu.edu

To learn concepts over massive data streams, it is essential to design inference and learning methods that operate in real time with limited memory. Online learning methods such as perceptron or Winnow are naturally suited to stream processing; however, in practice multiple passes over the same training data are required to achieve accuracy comparable to state-of-the-art batch learners. In the current work we address the problem of training an on-line learner with a single pass over the data. We evaluate several existing methods, and also propose a new modification of Margin Balanced Winnow, which has performance comparable to linear SVM. We also explore the effect of averaging, a.k.a. voting, on online learning. Finally, we describe how the new Modified Margin Balanced Winnow algorithm can be naturally adapted to perform feature selection. This scheme performs comparably to widely-used batch feature selection methods like information gain or Chi-square, with the advantage of being ab...

Vitor R. Carvalho, William W. Cohen

Real-time Traffic

Data Mining | KDD 2006 | Margin Balanced Winnow | Online Learning Methods | Winnow Algorithm |

claim paper

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2006
Where	KDD
Authors	Vitor R. Carvalho, William W. Cohen

Comments (0)

Sciweavers

Single-pass online learning: performance, voting schemes and online feature selection

Data Mining | KDD 2006 | Margin Balanced Winnow | Online Learning Methods | Winnow Algorithm |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers