Search Sciweavers | Sciweavers

343 search results - page 14 / 69

» Action discovery for reinforcement learning

click to vote

AUSAI
2008
Springer

105views Artificial Intelligence» more AUSAI 2008»

Partial Order Hierarchical Reinforcement Learning

13 years 9 months ago

Download www.cse.unsw.edu.au

In this paper the notion of a partial-order plan is extended to task-hierarchies. We introduce the concept of a partial-order taskhierarchy that decomposes a problem using multi-ta...

Bernhard Hengst

claim paper

Read More »

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

13 years 10 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

click to vote

KES
2004
Springer

165views Information Technology» more KES 2004»

Coordination in Multiagent Reinforcement Learning Systems

14 years 1 months ago

Download cig.ees.kyushu-u.ac.jp

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...

M. A. S. Kamal, Junichi Murata

claim paper

Read More »

click to vote

TFS
2011

239views Education» more TFS 2011»

Systems Control With Generalized Probabilistic Fuzzy-Reinforcement Learning

13 years 2 months ago

Download www.triteq.com

—Reinforcement learning (RL) is a valuable learning method when the systems require a selection of control actions whose consequences emerge over long periods for which input– ...

William M. Hinojosa, Samia Nefti, Uzay Kaymak

claim paper

Read More »

click to vote

NIPS
2003

148views Information Technology» more NIPS 2003»

Approximate Planning in POMDPs with Macro-Actions

13 years 9 months ago

Download books.nips.cc

Recent research has demonstrated that useful POMDP solutions do not require consideration of the entire belief space. We extend this idea with the notion of temporal abstraction. ...

Georgios Theocharous, Leslie Pack Kaelbling

claim paper

Read More »

« Prev « First page 14 / 69 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers