Sciweavers

1799 search results - page 70 / 360
» Filtered Reinforcement Learning
Sort
View
AAAI
2007
13 years 11 months ago
A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs
An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...
Roy Fox, Moshe Tennenholtz
ATAL
2008
Springer
13 years 11 months ago
Analysis of an evolutionary reinforcement learning method in a multiagent domain
Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algo...
Jan Hendrik Metzen, Mark Edgington, Yohannes Kassa...
ICPR
2006
IEEE
14 years 10 months ago
Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network
To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...
Siwei Luo, Yu Zheng, Ziang Lv
ICRA
2005
IEEE
140views Robotics» more  ICRA 2005»
14 years 2 months ago
Fast Reinforcement Learning for Vision-guided Mobile Robots
— This paper presents a new reinforcement learning algorithm for accelerating acquisition of new skills by real mobile robots, without requiring simulation. It speeds up Q-learni...
Tomás Martínez-Marín, Tom Duc...
ATAL
2008
Springer
13 years 11 months ago
Efficient multi-agent reinforcement learning through automated supervision
Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in large-scale systems. In this work, we develop a supervision fr...
Chongjie Zhang, Sherief Abdallah, Victor R. Lesser