Sciweavers

343 search results - page 14 / 69
» Action discovery for reinforcement learning
Sort
View
AUSAI
2008
Springer
13 years 9 months ago
Partial Order Hierarchical Reinforcement Learning
In this paper the notion of a partial-order plan is extended to task-hierarchies. We introduce the concept of a partial-order taskhierarchy that decomposes a problem using multi-ta...
Bernhard Hengst
AAAI
2007
13 years 10 months ago
A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs
An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...
Roy Fox, Moshe Tennenholtz
KES
2004
Springer
14 years 1 months ago
Coordination in Multiagent Reinforcement Learning Systems
This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...
M. A. S. Kamal, Junichi Murata
TFS
2011
239views Education» more  TFS 2011»
13 years 2 months ago
Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning
—Reinforcement learning (RL) is a valuable learning method when the systems require a selection of control actions whose consequences emerge over long periods for which input– ...
William M. Hinojosa, Samia Nefti, Uzay Kaymak
NIPS
2003
13 years 9 months ago
Approximate Planning in POMDPs with Macro-Actions
Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...
Georgios Theocharous, Leslie Pack Kaelbling