Sciweavers

473 search results - page 61 / 95
» Optimal policy switching algorithms for reinforcement learni...
Sort
View
NN
2010
Springer
125views Neural Networks» more  NN 2010»
13 years 7 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
SIGDIAL
2010
13 years 7 months ago
Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy
This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...
Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...
IWANN
1999
Springer
14 years 1 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
UAI
2008
13 years 10 months ago
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...
Richard S. Sutton, Csaba Szepesvári, Alborz...
ATAL
2009
Springer
13 years 6 months ago
Decentralized Learning in Wireless Sensor Networks
In this paper we use a reinforcement learning algorithm with the aim to increase the autonomous lifetime of a Wireless Sensor Network (WSN) and decrease latency in a decentralized...
Mihail Mihaylov, Karl Tuyls, Ann Nowé