Search Sciweavers | Sciweavers

688 search results - page 28 / 138

» Using reinforcement learning to adapt an imitation task

146

click to vote

IAT
2008
IEEE

161views Intelligent Agents» more IAT 2008»

Scaling Up Multi-agent Reinforcement Learning in Complex Domains

15 years 4 months ago

Download www3.ntu.edu.sg

TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...

Dan Xiao, Ah-Hwee Tan

claim paper

Read More »

127

click to vote

ICAI
2004

116views Artificial Intelligence» more ICAI 2004»

Action Inhibition

15 years 6 months ago

Download mysite.verizon.net

An explicit exploration strategy is necessary in reinforcement learning (RL) to balance the need to reduce the uncertainty associated with the expected outcome of an action and the...

Myriam Abramson

claim paper

Read More »

145

click to vote

IDEAL
2007
Springer

127views Intelligent Agents» more IDEAL 2007»

Skill Combination for Reinforcement Learning

15 years 10 months ago

Download www.cs.qub.ac.uk

Recently researchers have introduced methods to develop reusable knowledge in reinforcement learning (RL). In this paper, we define simple principles to combine skills in reinforce...

Zhihui Luo, David A. Bell, Barry McCollum

claim paper

Read More »

133

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

15 years 10 months ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

151

click to vote

WSDM
2012
ACM

214views Data Mining» more WSDM 2012»

Selecting actions for resource-bounded information extraction using reinforcement learning

14 years 7 days ago

Download people.cs.umass.edu

Given a database with missing or uncertain content, our goal is to correct and ﬁll the database by extracting speciﬁc information from a large corpus such as the Web, and to d...

Pallika H. Kanani, Andrew K. McCallum

claim paper

Read More »

« Prev « First page 28 / 138 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers