Search Sciweavers | Sciweavers

80 search results - page 13 / 16

» Efficient Reinforcement Learning Using Recursive Least-Squar...

109

click to vote

ACL
2010

135views Computational Linguistics» more ACL 2010»

Reading between the Lines: Learning to Map High-Level Instructions to Commands

15 years 16 days ago

Download ai.cs.washington.edu

In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...

S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...

claim paper

Read More »

134

Voted

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 2 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

132

Voted

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 2 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

117

Voted

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 3 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

138

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 14 days ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

« Prev « First page 13 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers