Search Sciweavers | Sciweavers

515 search results - page 8 / 103

» Approximating Markov Processes by Averaging

142

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

15 years 5 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

151

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 5 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

147

click to vote

ECAI
2000
Springer

90views Artificial Intelligence» more ECAI 2000»

Efficient Asymptotic Approximation in Temporal Difference Learning

15 years 8 months ago

Download www.inra.fr

Abstract. TD(

Frédérick Garcia, Florent Serre

claim paper

Read More »

131

click to vote

CORR
2010
Springer

112views Education» more CORR 2010»

Efficient Approximation of Optimal Control for Markov Games

15 years 4 months ago

Download react.cs.uni-sb.de

The success of probabilistic model checking for discrete-time Markov decision processes and continuous-time Markov chains has led to rich academic and industrial applications. The ...

Markus Rabe, Sven Schewe, Lijun Zhang

claim paper

Read More »

173

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 11 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 8 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers