The interaction of representations and planning objectives for decision-theoretic planning tasks

14 years 4 days ago

Download idm-lab.org

We study decision-theoretic planning or reinforcement learning in the presence of traps such as steep slopes for outdoor robots or staircases for indoor robots. In this case, achieving the goal from the start is often the primary objective while minimizing the travel time is only of secondary importance. We study how this planning objective interacts with possible representations of the planning tasks, namely whether to use a discount factor that is one or smaller than one and whether to use the action-penalty or the goal-reward representation. We show that the action-penalty representation without discounting guarantees that the plan that maximizes the expected reward also achieves the goal from the start (provided that this is possible) but neither the action-penalty representation with discounting nor the goal-reward representation with discounting have this property. We then show exactly when this trapping phenomenon occurs, using a novel interpretation of discounting, namely that...

Sven Koenig, Yaxin Liu

Real-time Traffic

Action-penalty Representation | Discounting | Goal-reward Representation | JETAI 2002 |

claim paper

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	2002
Where	JETAI
Authors	Sven Koenig, Yaxin Liu

Comments (0)

Sciweavers

The interaction of representations and planning objectives for decision-theoretic planning tasks

Action-penalty Representation | Discounting | Goal-reward Representation | JETAI 2002 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers