stationary distribution

158

ORL
2007

70views more ORL 2007»

Linear dependence of stationary distributions in ergodic Markov decision processes

15 years 6 months ago

In ergodic MDPs we consider stationary distributions of policies that coincide in all but n states, in which one of two possible actions is chosen. We give conditions and formulas...

Ronald Ortner

claim paper

Read More »

167

click to vote

CORR
2010
Springer

110views Education» more CORR 2010»

Mixing Time and Stationary Expected Social Welfare of Logit Dynamics

15 years 6 months ago

Download www.dia.unisa.it

We study logit dynamics [3] for strategic games. At every stage of the game a player is selected uniformly at random and she is assumed to play according to a noisy best-response ...

Vincenzo Auletta, Diodato Ferraioli, Francesco Pas...

claim paper

Read More »

168

Voted

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 7 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

180

click to vote

WSC
1998

107views Modeling And Simulation» more WSC 1998»

Stopping Criterion for a Simulation-Based Optimization Method

15 years 8 months ago

Download www.informs-sim.org

We consider a new simulation-based optimization method called the Nested Partitions (NP) method. This method generates a Markov chain and solving the optimization problem is equiv...

Sigurdur Ólafsson, Leyuan Shi

claim paper

Read More »

214

click to vote

DAGSTUHL
2006

137views Software Engineering» more DAGSTUHL 2006»

How fast does the stationary distribution of the Markov chain modelling EAs concentrate on the homogeneous populations for small

15 years 8 months ago

Download drops.dagstuhl.de

One of the main difficulties faced when analyzing Markov chains modelling evolutionary algorithms is that their cardinality grows quite fast. A reasonable way to deal with this iss...

Boris Mitavskiy, Jonathan E. Rowe

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers