Search Sciweavers | Sciweavers

87 search results - page 9 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

13 years 9 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

click to vote

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

13 years 2 months ago

Download www.montefiore.ulg.ac.be

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

14 years 8 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

click to vote

ATAL
2006
Springer

109views Intelligent Agents» more ATAL 2006»

On the relationship between MDPs and the BDI architecture

13 years 11 months ago

Download www.sci.brooklyn.cuny.edu

In this paper we describe the initial results of an investigation into the relationship between Markov Decision Processes (MDPs) and Belief-Desire-Intention (BDI) architectures. W...

Gerardo I. Simari, Simon Parsons

claim paper

Read More »

click to vote

AAAI
2010

172views Intelligent Agents» more AAAI 2010»

Using Bisimulation for Policy Transfer in MDPs

13 years 9 months ago

Download www.cs.mcgill.ca

Knowledge transfer has been suggested as a useful approach for solving large Markov Decision Processes. The main idea is to compute a decision-making policy in one environment and...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

« Prev « First page 9 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers