Search Sciweavers | Sciweavers

185 search results - page 4 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

175

Voted

AIPS
2011

233views Artificial Intelligence» more AIPS 2011»

Sample-Based Planning for Continuous Action Markov Decision Processes

14 years 7 months ago

Download www.chrismansley.com

In this paper, we present a new algorithm that integrates recent advances in solving continuous bandit problems with sample-based rollout methods for planning in Markov Decision P...

Christopher R. Mansley, Ari Weinstein, Michael L. ...

claim paper

Read More »

120

Voted

TALG
2010

73views more TALG 2010»

Discounted deterministic Markov decision processes and discounted all-pairs shortest paths

15 years 1 months ago

Download omadani.net

We present two new algorithms for ﬁnding optimal strategies for discounted, inﬁnite-horizon, Deterministic Markov Decision Processes (DMDP). The ﬁrst one is an adaptation of...

Omid Madani, Mikkel Thorup, Uri Zwick

claim paper

Read More »

128

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

16 years 4 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

147

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 4 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

113

Voted

ALT
2008
Springer

141views Machine Learning» more ALT 2008»

Online Regret Bounds for Markov Decision Processes with Deterministic Transitions

16 years 12 days ago

Download personal.unileoben.ac.at

Abstract. We consider an upper conﬁdence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the onl...

Ronald Ortner

claim paper

Read More »

« Prev « First page 4 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers