Search Sciweavers | Sciweavers

180 search results - page 5 / 36

» Improved bounds on the sample complexity of learning

click to vote

COLT
2010
Springer

177views Machine Learning» more COLT 2010»

Robust Selective Sampling from Single and Multiple Teachers

13 years 5 months ago

Download www.colt2010.org

We present a new online learning algorithm in the selective sampling framework, where labels must be actively queried before they are revealed. We prove bounds on the regret of ou...

Ofer Dekel, Claudio Gentile, Karthik Sridharan

claim paper

Read More »

click to vote

ATAL
2006
Springer

192views Intelligent Agents» more ATAL 2006»

A hierarchical approach to efficient reinforcement learning in deterministic domains

13 years 11 months ago

Download paul.rutgers.edu

Factored representations, model-based learning, and hierarchies are well-studied techniques for improving the learning efficiency of reinforcement-learning algorithms in large-sca...

Carlos Diuk, Alexander L. Strehl, Michael L. Littm...

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

13 years 12 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

IPL
2010

92views more IPL 2010»

Learning parities in the mistake-bound model

13 years 6 months ago

Download www.cs.technion.ac.il

We study the problem of learning parity functions that depend on at most k variables (kparities) attribute-eﬃciently in the mistake-bound model. We design a simple, deterministi...

Harry Buhrman, David García-Soriano, Arie M...

claim paper

Read More »

click to vote

ICML
2006
IEEE

130views Machine Learning» more ICML 2006»

Agnostic active learning

14 years 8 months ago

Download hunch.net

We state and analyze the first active learning algorithm which works in the presence of arbitrary forms of noise. The algorithm, A2 (for Agnostic Active), relies only upon the ass...

Maria-Florina Balcan, Alina Beygelzimer, John Lang...

claim paper

Read More »

« Prev « First page 5 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers