Search Sciweavers | Sciweavers

204

ATAL
2010
Springer

158views Intelligent Agents» more ATAL 2010»

Combining manual feedback with subsequent MDP reward signals for reinforcement learning

15 years 7 months ago

As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...

W. Bradley Knox, Peter Stone

claim paper

Read More »

164

click to vote

AAAI
2011

149views Intelligent Agents» more AAAI 2011»

Optimal Rewards versus Leaf-Evaluation Heuristics in Planning Agents

14 years 5 months ago

Download www.eecs.umich.edu

Planning agents often lack the computational resources needed to build full planning trees for their environments. Agent designers commonly overcome this ﬁnite-horizon approxima...

Jonathan Sorg, Satinder P. Singh, Richard L. Lewis

claim paper

Read More »

150

click to vote

ICRA
2008
IEEE

128views Robotics» more ICRA 2008»

Intrinsically motivated hierarchical manipulation

16 years 10 days ago

Download www-robotics.cs.umass.edu

— We present a framework for the programming of manipulation behavior by means of an intrinsic reward function that encourages the building of deep control knowledge. We show how...

Stephen Hart, Shijaj Sen, Roderic A. Grupen

claim paper

Read More »

137

click to vote

SIGECOM
2009
ACM

114views ECommerce» more SIGECOM 2009»

Policy teaching through reward function learning

16 years 12 days ago

Download www.eecs.harvard.edu

Policy teaching considers a Markov Decision Process setting in which an interested party aims to inﬂuence an agent’s decisions by providing limited incentives. In this paper, ...

Haoqi Zhang, David C. Parkes, Yiling Chen

claim paper

Read More »

158

click to vote

NIPS
2003

268views Information Technology» more NIPS 2003»

Eye Movements for Reward Maximization

15 years 7 months ago

Download books.nips.cc

Recent eye tracking studies in natural tasks suggest that there is a tight link between eye movements and goal directed motor actions. However, most existing models of human eye m...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers