reinforcement learning methods

37

ACMSE
2010
ACM

111views Theoretical Computer Science» more ACMSE 2010»

Generating three binary addition algorithms using reinforcement programming

14 years 1 months ago

Reinforcement Programming (RP) is a new technique for automatically generating a computer program using reinforcement learning methods. This paper describes how RP learned to gene...

Spencer K. White, Tony R. Martinez, George L. Rudo...

claim paper

Read More »

48

click to vote

AR
2008

118views more AR 2008»

Efficient Behavior Learning Based on State Value Estimation of Self and Others

14 years 3 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...

Yasutake Takahashi, Kentarou Noma, Minoru Asada

claim paper

Read More »

49

click to vote

ICMLA
2003

159views Machine Learning» more ICMLA 2003»

A Distributed Reinforcement Learning Approach to Pattern Inference in Go

14 years 4 months ago

Download mysite.verizon.net

— This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search spa...

Myriam Abramson, Harry Wechsler

claim paper

Read More »

51

click to vote

NCI
2004

185views Neural Networks» more NCI 2004»

Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

14 years 4 months ago

Download staff.science.uva.nl

This paper describes a method for hierarchical reinforcement learning in which high-level policies automatically discover subgoals, and low-level policies learn to specialize for ...

Bram Bakker, Jürgen Schmidhuber

claim paper

Read More »

45

click to vote

ICDM
2002
IEEE

105views Data Mining» more ICDM 2002»

Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing

14 years 8 months ago

Download www.weifan.info

We empirically evaluate the performance of various reinforcement learning methods in applications to sequential targeted marketing. In particular, we propose and evaluate a progre...

Naoki Abe, Edwin P. D. Pednault, Haixun Wang, Bian...

claim paper

Read More »

57

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

14 years 9 months ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

50

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

15 years 4 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

42

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

15 years 4 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers