Search Sciweavers | Sciweavers

536 search results - page 22 / 108

» Residual Algorithms: Reinforcement Learning with Function Ap...

click to vote

ICRA
2009
IEEE

259views Robotics» more ICRA 2009»

Constructing action set from basis functions for reinforcement learning of robot control

14 years 2 months ago

Download robotics.aist-nara.ac.jp

Abstract— Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets a...

Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawar...

claim paper

Read More »

click to vote

GECCO
2003
Springer

79views Optimization» more GECCO 2003»

Reinforcement Learning Estimation of Distribution Algorithm

14 years 26 days ago

Download www.iba.t.u-tokyo.ac.jp

Abstract. This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions ...

Topon Kumar Paul, Hitoshi Iba

claim paper

Read More »

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

13 years 2 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

13 years 8 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

click to vote

ICML
2008
IEEE

133views Machine Learning» more ICML 2008»

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

14 years 8 months ago

Download www.cs.duke.edu

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the...

Ronald Parr, Lihong Li, Gavin Taylor, Christopher ...

claim paper

Read More »

« Prev « First page 22 / 108 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers