Search Sciweavers | Sciweavers

1262 search results - page 212 / 253

» Reinforcement Learning: An Introduction

Voted

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 9 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

129

click to vote

CIMCA
2006
IEEE

164views Intelligent Agents» more CIMCA 2006»

Multi-Agent Coalition Formation for Long-Term Task or Mobile Network

15 years 9 months ago

Download digital.cs.usu.edu

Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...

Hsiu-Hui Lee, Chung-Hsien Chen

claim paper

Read More »

140

Voted

CIS
2005
Springer

129views Applied Computing» more CIS 2005»

An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

15 years 9 months ago

Download www-clmc.usc.edu

Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...

Jooyoung Park, Jongho Kim, Daesung Kang

claim paper

Read More »

142

Voted

AMEC
2004
Springer

243views Intelligent Agents» more AMEC 2004»

Three Automated Stock-Trading Agents: A Comparative Study

15 years 9 months ago

Download userweb.cs.utexas.edu

Abstract. This paper documents the development of three autonomous stocktrading agents within the framework of the Penn Exchange Simulator (PXS), a novel stock-trading simulator th...

Alexander A. Sherstov, Peter Stone

claim paper

Read More »

141

Voted

ATAL
2004
Springer

197views Intelligent Agents» more ATAL 2004»

Adaptive, Distributed Control of Constrained Multi-Agent Systems

15 years 9 months ago

Download collectives.stanford.edu

Product Distribution (PD) theory was recently developed as a framework for analyzing and optimizing distributed systems. In this paper we demonstrate its use for adaptive distribu...

Stefan Bieniawski, David Wolpert

claim paper

Read More »

« Prev « First page 212 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers