Search Sciweavers | Sciweavers

202 search results - page 3 / 41

» Comments on the Origin and Application of Markov Decision Pr...

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 10 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

MOR
2007

109views more MOR 2007»

Solution and Forecast Horizons for Infinite-Horizon Nonhomogeneous Markov Decision Processes

13 years 8 months ago

Download www-personal.umich.edu

We consider the problem of solving a nonhomogeneous inﬁnite horizon Markov Decision Process (MDP) problem in the general case of potentially multiple optimal ﬁrst period polic...

Torpong Cheevaprawatdomrong, Irwin E. Schochetman,...

claim paper

Read More »

click to vote

AIPS
2004

142views Artificial Intelligence» more AIPS 2004»

Heuristic Refinements of Approximate Linear Programming for Factored Continuous-State Markov Decision Processes

13 years 10 months ago

Download www.cs.pitt.edu

Approximate linear programming (ALP) offers a promising framework for solving large factored Markov decision processes (MDPs) with both discrete and continuous states. Successful ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

14 years 5 months ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

click to vote

NFM
2011

225views Formal Methods» more NFM 2011»

Synthesis for PCTL in Parametric Markov Decision Processes

13 years 3 months ago

Download www.veriware.org

Abstract. In parametric Markov Decision Processes (PMDPs), transition probabilities are not ﬁxed, but are given as functions over a set of parameters. A PMDP denotes a family of ...

Ernst Moritz Hahn, Tingting Han, Lijun Zhang

claim paper

Read More »

« Prev « First page 3 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers