Search Sciweavers | Sciweavers

102 search results - page 12 / 21

» MDPs with Non-Deterministic Policies

177

click to vote

NCI
2004

185views Neural Networks» more NCI 2004»

Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

15 years 7 months ago

Download staff.science.uva.nl

This paper describes a method for hierarchical reinforcement learning in which high-level policies automatically discover subgoals, and low-level policies learn to specialize for ...

Bram Bakker, Jürgen Schmidhuber

claim paper

Read More »

180

click to vote

UAI
1998

99views Artificial Intelligence» more UAI 1998»

Flexible Decomposition Algorithms for Weakly Coupled Markov Decision Problems

15 years 7 months ago

Download reference.kfupm.edu.sa

This paper presents two new approaches to decomposing and solving large Markov decision problems (MDPs), a partial decoupling method and a complete decoupling method. In these app...

Ronald Parr

claim paper

Read More »

153

click to vote

AIPS
2009

97views Artificial Intelligence» more AIPS 2009»

Minimal Sufficient Explanations for Factored Markov Decision Processes

15 years 7 months ago

Download www.cs.uwaterloo.ca

Explaining policies of Markov Decision Processes (MDPs) is complicated due to their probabilistic and sequential nature. We present a technique to explain policies for factored MD...

Omar Zia Khan, Pascal Poupart, James P. Black

claim paper

Read More »

157

click to vote

AIPS
2009

126views Artificial Intelligence» more AIPS 2009»

Automatic Derivation of Memoryless Policies and Finite-State Controllers Using Classical Planners

15 years 7 months ago

Download www.tecn.upf.es

Finite-state and memoryless controllers are simple action selection mechanisms widely used in domains such as videogames and mobile robotics. Memoryless controllers stand for func...

Blai Bonet, Héctor Palacios, Hector Geffner

claim paper

Read More »

172

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Towards Faster Planning with Continuous Resources in Stochastic Domains

15 years 8 months ago

Download www.aaai.org

Agents often have to construct plans that obey resource limits for continuous resources whose consumption can only be characterized by probability distributions. While Markov Deci...

Janusz Marecki, Milind Tambe

claim paper

Read More »

« Prev « First page 12 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers