Search Sciweavers | Sciweavers

86 search results - page 12 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

207

click to vote

UAI
2008

252views Artificial Intelligence» more UAI 2008»

Small Sample Inference for Generalization Error in Classification Using the CUD Bound

15 years 8 months ago

Download www.stat.lsa.umich.edu

Confidence measures for the generalization error are crucial when small training samples are used to construct classifiers. A common approach is to estimate the generalization err...

Eric Laber, Susan Murphy

claim paper

Read More »

229

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 8 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

215

Voted

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

16 years 17 days ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

189

click to vote

ICML
2010
IEEE

258views Machine Learning» more ICML 2010»

Feature Selection as a One-Player Game

15 years 8 months ago

Download www.lri.fr

This paper formalizes Feature Selection as a Reinforcement Learning problem, leading to a provably optimal though intractable selection policy. As a second contribution, this pape...

Romaric Gaudel, Michèle Sebag

claim paper

Read More »

183

click to vote

WSC
2004

99views Modeling And Simulation» more WSC 2004»

Function-Approximation-Based Importance Sampling for Pricing American Options

15 years 8 months ago

Download www.informs-sim.org

Monte Carlo simulation techniques that use function approximations have been successfully applied to approximately price multi-dimensional American options. However, for many pric...

Nomesh Bolia, Sandeep Juneja, Paul Glasserman

claim paper

Read More »

« Prev « First page 12 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers