Robust bounds for classification via selective sampling

15 years 1 months ago

Download homes.dsi.unimi.it

We introduce a new algorithm for binary classification in the selective sampling protocol. Our algorithm uses Regularized Least Squares (RLS) as base classifier, and for this reason it can be efficiently run in any RKHS. Unlike previous margin-based semisupervised algorithms, our sampling condition hinges on a simultaneous upper bound on bias and variance of the RLS estimate under a simple linear label noise model. This fact allows us to prove performance bounds that hold for an arbitrary sequence of instances. In particular, we show that our sampling strategy approximates the margin of the Bayes optimal classifier to any desired accuracy by asking O d/2 queries (in the RKHS case d is replaced by a suitable spectral quantity). While these are the standard rates in the fully supervised i.i.d. case, the best previously known result in our harder setting was O d3 /4 . Preliminary experiments show that some of our algorithms also exhibit a good practical performance.

Nicolò Cesa-Bianchi, Claudio Gentile, Franc

Real-time Traffic

Bayes Optimal Classifier | ICML 2009 | Machine Learning | Regularized Least Squares | Selective Sampling Protocol |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2009
Where	ICML
Authors	Nicolò Cesa-Bianchi, Claudio Gentile, Francesco Orabona

Comments (0)

Sciweavers

Robust bounds for classification via selective sampling

Bayes Optimal Classifier | ICML 2009 | Machine Learning | Regularized Least Squares | Selective Sampling Protocol |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers