Sciweavers

132 search results - page 5 / 27
» Rewarding Behaviors
Sort
View
ATAL
2010
Springer
13 years 10 months ago
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...
W. Bradley Knox, Peter Stone
AAAI
2011
12 years 9 months ago
Optimal Rewards versus Leaf-Evaluation Heuristics in Planning Agents
Planning agents often lack the computational resources needed to build full planning trees for their environments. Agent designers commonly overcome this finite-horizon approxima...
Jonathan Sorg, Satinder P. Singh, Richard L. Lewis
ICRA
2008
IEEE
128views Robotics» more  ICRA 2008»
14 years 4 months ago
Intrinsically motivated hierarchical manipulation
— We present a framework for the programming of manipulation behavior by means of an intrinsic reward function that encourages the building of deep control knowledge. We show how...
Stephen Hart, Shijaj Sen, Roderic A. Grupen
SIGECOM
2009
ACM
114views ECommerce» more  SIGECOM 2009»
14 years 4 months ago
Policy teaching through reward function learning
Policy teaching considers a Markov Decision Process setting in which an interested party aims to influence an agent’s decisions by providing limited incentives. In this paper, ...
Haoqi Zhang, David C. Parkes, Yiling Chen
NIPS
2003
13 years 11 months ago
Eye Movements for Reward Maximization
Recent eye tracking studies in natural tasks suggest that there is a tight link between eye movements and goal directed motor actions. However, most existing models of human eye m...
Nathan Sprague, Dana H. Ballard