Search Sciweavers | Sciweavers

1799 search results - page 70 / 360

» Filtered Reinforcement Learning

144

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 9 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

224

click to vote

ATAL
2008
Springer

176views Intelligent Agents» more ATAL 2008»

Analysis of an evolutionary reinforcement learning method in a multiagent domain

15 years 9 months ago

Download www.aamas-conference.org

Many multiagent problems comprise subtasks which can be considered as reinforcement learning (RL) problems. In addition to classical temporal difference methods, evolutionary algo...

Jan Hendrik Metzen, Mark Edgington, Yohannes Kassa...

claim paper

Read More »

156

click to vote

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

16 years 8 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

179

click to vote

ICRA
2005
IEEE

140views Robotics» more ICRA 2005»

Fast Reinforcement Learning for Vision-guided Mobile Robots

16 years 24 days ago

Download aass.oru.se

— This paper presents a new reinforcement learning algorithm for accelerating acquisition of new skills by real mobile robots, without requiring simulation. It speeds up Q-learni...

Tomás Martínez-Marín, Tom Duc...

claim paper

Read More »

292

click to vote

ATAL
2008
Springer

136views Intelligent Agents» more ATAL 2008»

Efficient multi-agent reinforcement learning through automated supervision

15 years 9 months ago

Download www.cs.umass.edu

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in large-scale systems. In this work, we develop a supervision fr...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

« Prev « First page 70 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers