Sciweavers

2108 search results - page 110 / 422
» Tracking in Reinforcement Learning
Sort
View
ICML
2006
IEEE
14 years 4 months ago
Automatic basis function construction for approximate dynamic programming and reinforcement learning
We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...
Philipp W. Keller, Shie Mannor, Doina Precup
IAT
2005
IEEE
14 years 3 months ago
Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment
This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value ...
Ah-Hwee Tan, Dan Xiao
ATAL
2007
Springer
14 years 4 months ago
Transfer via inter-task mappings in policy search reinforcement learning
The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
ECML
2007
Springer
13 years 11 months ago
Sequence Labeling with Reinforcement Learning and Ranking Algorithms
Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...
Francis Maes, Ludovic Denoyer, Patrick Gallinari
AAAI
2004
13 years 11 months ago
Performance Bounded Reinforcement Learning in Strategic Interactions
Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeably lacking. T...
Bikramjit Banerjee, Jing Peng