Sciweavers

119 search results - page 1 / 24
» Inter-Task Action Correlation for Reinforcement Learning Tas...
Sort
View
WSDM
2012
ACM
214views Data Mining» more  WSDM 2012»
12 years 3 months ago
Selecting actions for resource-bounded information extraction using reinforcement learning
Given a database with missing or uncertain content, our goal is to correct and fill the database by extracting specific information from a large corpus such as the Web, and to d...
Pallika H. Kanani, Andrew K. McCallum
IROS
2008
IEEE
125views Robotics» more  IROS 2008»
14 years 2 months ago
Dynamic correlation matrix based multi-Q learning for a multi-robot system
—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...
Hongliang Guo, Yan Meng
SCAI
2008
13 years 9 months ago
Fast Learning in an Actor-Critic Architecture with Reward and Punishment
Abstract. A reinforcement architecture is introduced that consists of three complementary learning systems with different generalization abilities. The ACTOR learns state-action as...
Christian Balkenius, Stefan Winberg