Search Sciweavers | Sciweavers

We combine the results of [13] and [8] and derive a continuous variant of a large class of drifting games. Our analysis furthers the understanding of the relationship between boos...

Yoav Freund, Manfred Opper

claim paper

Read More »

142

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

169

click to vote

COLT
2000
Springer

132views Machine Learning» more COLT 2000»

Barrier Boosting

15 years 11 months ago

Download users.soe.ucsc.edu

Boosting algorithms like AdaBoost and Arc-GV are iterative strategies to minimize a constrained objective function, equivalent to Barrier algorithms. Based on this new understandi...

Gunnar Rätsch, Manfred K. Warmuth, Sebastian ...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers