Search Sciweavers | Sciweavers

The Markov chain approximation method is a widely used, relatively easy to use, and efficient family of methods for the bulk of stochastic control problems in continuous time, for...

Harold J. Kushner

claim paper

Read More »

click to vote

EOR
2008

88views more EOR 2008»

Selection of a correlated equilibrium in Markov stopping games

13 years 11 months ago

Download www.math.s.chiba-u.ac.jp

This paper deals with an extension of the concept of correlated strategies to Markov stopping games. The Nash equilibrium approach to solving nonzero-sum stopping games may give m...

David M. Ramsey, Krzysztof Szajowski

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

14 years 11 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

« Prev « First page 1 / 234 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers