Search Sciweavers | Sciweavers

2108 search results - page 87 / 422

» Tracking in Reinforcement Learning

191

click to vote

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

15 years 7 months ago

Download www.fil.ion.ucl.ac.uk

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

184

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

15 years 5 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

227

click to vote

ICASSP
2011
IEEE

204views Signal Processing» more ICASSP 2011»

Bayesian reinforcement learning for POMDP-based dialogue systems

14 years 11 months ago

Download mirlab.org

Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...

ShaoWei Png, Joelle Pineau

claim paper

Read More »

173

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 11 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

172

click to vote

ICML
2002
IEEE

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

16 years 8 months ago

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...

Adam Laud, Gerald DeJong

claim paper

Read More »

« Prev « First page 87 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers