Search Sciweavers | Sciweavers

332 search results - page 16 / 67

» Ranking policies in discrete Markov decision processes

click to vote

AAAI
1997

119views Intelligent Agents» more AAAI 1997»

Structured Solution Methods for Non-Markovian Decision Processes

13 years 9 months ago

Download www.cs.toronto.edu

Markov Decision Processes (MDPs), currently a popular method for modeling and solving decision theoretic planning problems, are limited by the Markovian assumption: rewards and dy...

Fahiem Bacchus, Craig Boutilier, Adam J. Grove

claim paper

Read More »

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

13 years 10 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

13 years 9 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

14 years 2 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

14 years 8 months ago

Download paul.rutgers.edu

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

« Prev « First page 16 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers