Search Sciweavers | Sciweavers

51 search results - page 5 / 11

» Exponentiated Gradient Methods for Reinforcement Learning

196

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

180

click to vote

ICML
2002
IEEE

133views Machine Learning» more ICML 2002»

Coordinated Reinforcement Learning

16 years 7 months ago

Download select.cs.cmu.edu

We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...

Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...

claim paper

Read More »

157

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 1 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

175

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

15 years 4 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

177

click to vote

ICRA
2009
IEEE

259views Robotics» more ICRA 2009»

Constructing action set from basis functions for reinforcement learning of robot control

16 years 1 months ago

Download robotics.aist-nara.ac.jp

Abstract— Continuous action sets are used in many reinforcement learning (RL) applications in robot control since the control input is continuous. However, discrete action sets a...

Akihiko Yamaguchi, Jun Takamatsu, Tsukasa Ogasawar...

claim paper

Read More »

« Prev « First page 5 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers