Imitating Inscrutable Enemies: Learning from Stochastic Policy Observation, Retrieval and Reuse

14 years 5 months ago

Download www.cse.lehigh.edu

In this paper we study the topic of CBR systems learning from observations in which those observations can be represented as stochastic policies. We describe a general framework which encompasses three steps: (1) it observes agents performing actions, elicits stochastic policies representing the agents’ strategies and retains these policies as cases. (2) The agent analyzes the environment and retrieves a suitable stochastic policy. (3) The agent then executes the retrieved stochastic policy, which results in the agent mimicking the previously observed agent. We implement our framework in a system called JuKeCB that observes and mimics players playing games. We present the results of three sets of experiments designed to evaluate our framework. The first experiment demonstrates that JuKeCB performs well when trained against a variety of fixed strategy opponents. The second experiment demonstrates that JuKeCB can also, after training, win against an opponent with a dynamic strategy. Th...

Kellen Gillespie, Justin Karneeb, Stephen Lee-Urba

Real-time Traffic

Automated Reasoning | Elicits Stochastic Policies | ICCBR 2010 | Stochastic Policies | Stochastic Policy |

claim paper

Post Info
More Details (n/a)

Added	15 Aug 2010
Updated	15 Aug 2010
Type	Conference
Year	2010
Where	ICCBR
Authors	Kellen Gillespie, Justin Karneeb, Stephen Lee-Urban, Héctor Muñoz-Avila

Comments (0)

Sciweavers

Imitating Inscrutable Enemies: Learning from Stochastic Policy Observation, Retrieval and Reuse

Automated Reasoning | Elicits Stochastic Policies | ICCBR 2010 | Stochastic Policies | Stochastic Policy |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers