Sciweavers

1262 search results - page 212 / 253
» Reinforcement Learning: An Introduction
Sort
View
ICANN
2007
Springer
14 years 4 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
CIMCA
2006
IEEE
14 years 4 months ago
Multi-Agent Coalition Formation for Long-Term Task or Mobile Network
Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...
Hsiu-Hui Lee, Chung-Hsien Chen
CIS
2005
Springer
14 years 3 months ago
An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm
Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...
Jooyoung Park, Jongho Kim, Daesung Kang
AMEC
2004
Springer
14 years 3 months ago
Three Automated Stock-Trading Agents: A Comparative Study
Abstract. This paper documents the development of three autonomous stocktrading agents within the framework of the Penn Exchange Simulator (PXS), a novel stock-trading simulator th...
Alexander A. Sherstov, Peter Stone
ATAL
2004
Springer
14 years 3 months ago
Adaptive, Distributed Control of Constrained Multi-Agent Systems
Product Distribution (PD) theory was recently developed as a framework for analyzing and optimizing distributed systems. In this paper we demonstrate its use for adaptive distribu...
Stefan Bieniawski, David Wolpert