Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

16 years 8 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy of state value, and brings difficulty in the convergence. To solve the problems of tradeoff between the generalization and accuracy in reinforcement learning, we represent state-action value by two CMAC networks with different generalization parameters. The accuracy CMAC network can represent values exactly, which achieves precise control in the states around target area. And the generalization CMAC network can extend experiences to unknown area, and guide the learning of accuracy CMAC network. The algorithm proposed in this paper can effectively avoid the dilemma of achieving tradeoff between generalization and accuracy. Simulation results for the control of double inverted pendulum are presented to show effectiveness of the proposed algorithm.

Siwei Luo, Yu Zheng, Ziang Lv

Real-time Traffic

Accuracy Cmac Network | CMAC Networks | Computer Vision | Generalization Cmac Network | ICPR 2006 |

claim paper

» Combining Reinforcement Learning with a Local Control Algorithm

» Exploring the TMaze Evolving LearningLike Robot Behaviors Using CTRNNs

» Transfer of Neuroevolved Controllers in Unstable Domains

Post Info
More Details (n/a)

Added	09 Nov 2009
Updated	09 Nov 2009
Type	Conference
Year	2006
Where	ICPR
Authors	Siwei Luo, Yu Zheng, Ziang Lv

Comments (0)

Sciweavers

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

Accuracy Cmac Network | CMAC Networks | Computer Vision | Generalization Cmac Network | ICPR 2006 |

Explore & Download

Productivity Tools

Sciweavers