Sciweavers

ICPR
2006
IEEE
15 years 1 months ago
Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network
To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...
Siwei Luo, Yu Zheng, Ziang Lv