Search Sciweavers | Sciweavers

86 search results - page 13 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

198

click to vote

JMLR
2012

187views Programming Languages» more JMLR 2012»

Bounding the Probability of Error for High Precision Optical Character Recognition

13 years 9 months ago

Download jmlr.csail.mit.edu

We consider a model for which it is important, early in processing, to estimate some variables with high precision, but perhaps at relatively low recall. If some variables can be ...

Gary B. Huang, Andrew Kae, Carl Doersch, Erik G. L...

claim paper

Read More »

198

click to vote

NIPS
2008

188views Information Technology» more NIPS 2008»

Bayesian Kernel Shaping for Learning Control

15 years 8 months ago

Download eprints.pascal-network.org

In kernel-based regression learning, optimizing each kernel individually is useful when the data density, curvature of regression surfaces (or decision boundaries) or magnitude of...

Jo-Anne Ting, Mrinal Kalakrishnan, Sethu Vijayakum...

claim paper

Read More »

160

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

15 years 8 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

167

click to vote

ICML
2006
IEEE

151views Machine Learning» more ICML 2006»

Efficient MAP approximation for dense energy functions

16 years 7 months ago

Download www.ri.cmu.edu

We present an efficient method for maximizing energy functions with first and second order potentials, suitable for MAP labeling estimation problems that arise in undirected graph...

Marius Leordeanu, Martial Hebert

claim paper

Read More »

215

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

« Prev « First page 13 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers