Search Sciweavers | Sciweavers

81 search results - page 10 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

182

click to vote

BMCV
2000
Springer

170views Computer Vision» more BMCV 2000»

Unsupervised Learning of Biologically Plausible Object Recognition Strategies

15 years 11 months ago

Download www.cs.colostate.edu

Recent psychological and neurological evidence suggests that biological object recognition is a process of matching sensed images to stored iconic memories. This paper presents a p...

Bruce A. Draper, Kyungim Baek

claim paper

Read More »

213

click to vote

ATAL
2008
Springer

133views Intelligent Agents» more ATAL 2008»

Transfer of task representation in reinforcement learning using policy-based proto-value functions

15 years 9 months ago

Download www.aamas-conference.org

Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...

Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

156

click to vote

IJCNN
2006
IEEE

121views Neural Networks» more IJCNN 2006»

Learning a Rendezvous Task with Dynamic Joint Action Perception

16 years 1 months ago

Download axon.cs.byu.edu

Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...

Nancy Fulda, Dan Ventura

claim paper

Read More »

216

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 1 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

241

click to vote

INLG
2010
Springer

134views Natural Language Processing» more INLG 2010»

Hierarchical Reinforcement Learning for Adaptive Text Generation

15 years 5 months ago

Download www.aclweb.org

We present a novel approach to natural language generation (NLG) that applies hierarchical reinforcement learning to text generation in the wayfinding domain. Our approach aims to...

Nina Dethlefs, Heriberto Cuayáhuitl

claim paper

Read More »

« Prev « First page 10 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers