Sciweavers

1233 search results - page 181 / 247
» Feudal Reinforcement Learning
Sort
View
ICRA
2007
IEEE
155views Robotics» more  ICRA 2007»
14 years 2 months ago
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control
— The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...
Masashi Sugiyama, Hirotaka Hachiya, Christopher To...
ICANN
2007
Springer
14 years 2 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
CIMCA
2006
IEEE
14 years 2 months ago
Multi-Agent Coalition Formation for Long-Term Task or Mobile Network
Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...
Hsiu-Hui Lee, Chung-Hsien Chen
CIS
2005
Springer
14 years 1 months ago
An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm
Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...
Jooyoung Park, Jongho Kim, Daesung Kang
AMEC
2004
Springer
14 years 1 months ago
Three Automated Stock-Trading Agents: A Comparative Study
Abstract. This paper documents the development of three autonomous stocktrading agents within the framework of the Penn Exchange Simulator (PXS), a novel stock-trading simulator th...
Alexander A. Sherstov, Peter Stone