Search Sciweavers | Sciweavers

24

GECCO
2008
Springer

183views Optimization» more GECCO 2008»

13 years 8 months ago

This paper investigates how the Univariate Marginal Distribution Algorithm (UMDA) behaves in non-stationary environments when engaging in sampling and selection strategies designe...

Carlos M. Fernandes, Cláudio F. Lima, Agost...

claim paper

Read More »

25

click to vote

WSC
2007

123views Modeling And Simulation» more WSC 2007»

Stochastic trust region gradient-free method (strong): a new response-surface-based algorithm in simulation optimization

13 years 9 months ago

Download www.informs-sim.org

Response Surface Methodology (RSM) is a metamodelbased optimization method. Its strategy is to explore small subregions of the parameter space in succession instead of attempting ...

Kuo-Hao Chang, L. Jeff Hong, Hong Wan

claim paper

Read More »

30

click to vote

UAI
2000

136views Artificial Intelligence» more UAI 2000»

Fast Planning in Stochastic Games

13 years 8 months ago

Download www.cis.upenn.edu

Stochastic games generalize Markov decision processes MDPs to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards de...

Michael J. Kearns, Yishay Mansour, Satinder P. Sin...

claim paper

Read More »

27

click to vote

ATAL
2010
Springer

115views Intelligent Agents» more ATAL 2010»

Self-organization for coordinating decentralized reinforcement learning

13 years 8 months ago

Download www.cs.umass.edu

Decentralized reinforcement learning (DRL) has been applied to a number of distributed applications. However, one of the main challenges faced by DRL is its convergence. Previous ...

Chongjie Zhang, Victor R. Lesser, Sherief Abdallah

claim paper

Read More »

20

click to vote

CORR
2010
Springer

127views Education» more CORR 2010»

Mean field for Markov Decision Processes: from Discrete to Continuous Optimization

13 years 7 months ago

Download infoscience.epfl.ch

We study the convergence of Markov Decision Processes made of a large number of objects to optimization problems on ordinary differential equations (ODE). We show that the optimal...

Nicolas Gast, Bruno Gaujal, Jean-Yves Le Boudec

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers