Counter Example for Q-Bucket-Brigade Under Prediction Problem

14 years 8 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of LCS diverges; and (2) methods to avoid such divergence. Based on our previous work that showed equivalence between LCS’s reinforcement process and Reinforcement Learning (RL) with Function approximation (FA) method, we present a counter-example for LCS with Q-bucketbrigade based on the 11-state star problem, a counterexample originally proposed to show the divergence of Qlearning with linear FA. Furthermore, the empirical results applying the counter-example to LCS veriﬁed the results predicted from the theory: (1) LCS with Q-bucket-brigade diverged under the prediction problem, where the action selection policy was ﬁxed; and (2) such divergence was avoided by using implicit-bucket-brigade or applying residual gradient algorithm to Q-bucket-brigade. Categories and Subject Descriptors I.2.6 [Artiﬁcial ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

Real-time Traffic

IWLCS 2005 | LCS’s Reinforcement Process | Machine Learning | Reinforcement Learning | Reinforcement Process |

claim paper

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	IWLCS
Authors	Atsushi Wada, Keiki Takadama, Katsunori Shimohara

Comments (0)

Sciweavers

Counter Example for Q-Bucket-Brigade Under Prediction Problem

IWLCS 2005 | LCS’s Reinforcement Process | Machine Learning | Reinforcement Learning | Reinforcement Process |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers