An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

14 years 7 months ago

Download www.ni.uos.de

We identify two fundamental points of utilizing CBR for an adaptive agent that tries to learn on the basis of trial and error without a model of its environment. The ﬁrst link concerns the utmost eﬃcient exploitation of experience the agent has collected by interacting within its environment, while the second relates to the acquisition and representation of a suitable behavior policy. Combining both connections, we develop a state-action value function approximation mechanism that relies on case-based, approximate transition graphs and forms the basis on which the agent improves its behavior. We evaluate our approach empirically in the context of dynamic control tasks.

Thomas Gabel, Martin Riedmiller

Real-time Traffic

Adaptive Agent | ICCBR 2007 | Suitable Behavior Policy | Utmost Eﬃcient Exploitation |

claim paper

Post Info
More Details (n/a)

Added	08 Jun 2010
Updated	08 Jun 2010
Type	Conference
Year	2007
Where	ICCBR
Authors	Thomas Gabel, Martin Riedmiller

Comments (0)

Sciweavers

An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs

Adaptive Agent | ICCBR 2007 | Suitable Behavior Policy | Utmost Eﬃcient Exploitation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers