Sciweavers

688 search results - page 28 / 138
» Using reinforcement learning to adapt an imitation task
Sort
View
IAT
2008
IEEE
13 years 7 months ago
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...
Dan Xiao, Ah-Hwee Tan
ICAI
2004
13 years 9 months ago
Action Inhibition
An explicit exploration strategy is necessary in reinforcement learning (RL) to balance the need to reduce the uncertainty associated with the expected outcome of an action and the...
Myriam Abramson
IDEAL
2007
Springer
14 years 1 months ago
Skill Combination for Reinforcement Learning
Recently researchers have introduced methods to develop reusable knowledge in reinforcement learning (RL). In this paper, we define simple principles to combine skills in reinforce...
Zhihui Luo, David A. Bell, Barry McCollum
ATAL
2005
Springer
14 years 1 months ago
Behavior transfer for value-function-based reinforcement learning
Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...
Matthew E. Taylor, Peter Stone
WSDM
2012
ACM
214views Data Mining» more  WSDM 2012»
12 years 3 months ago
Selecting actions for resource-bounded information extraction using reinforcement learning
Given a database with missing or uncertain content, our goal is to correct and fill the database by extracting specific information from a large corpus such as the Web, and to d...
Pallika H. Kanani, Andrew K. McCallum