Search Sciweavers | Sciweavers

473 search results - page 61 / 95

» Optimal policy switching algorithms for reinforcement learni...

166

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 4 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

216

click to vote

SIGDIAL
2010

137views Natural Language Processing» more SIGDIAL 2010»

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy

15 years 3 months ago

Download mastarpj.nict.go.jp

This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...

Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...

claim paper

Read More »

161

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 10 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

191

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 7 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

165

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Decentralized Learning in Wireless Sensor Networks

15 years 3 months ago

Download teamcore.usc.edu

In this paper we use a reinforcement learning algorithm with the aim to increase the autonomous lifetime of a Wireless Sensor Network (WSN) and decrease latency in a decentralized...

Mihail Mihaylov, Karl Tuyls, Ann Nowé

claim paper

Read More »

« Prev « First page 61 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers