Search Sciweavers | Sciweavers

337 search results - page 39 / 68

» Mean-Variance Optimization in Markov Decision Processes

CCE
2004

162views Software Engineering» more CCE 2004»

An algorithmic framework for improving heuristic solutions: Part II. A new version of the stochastic traveling salesman problem

13 years 7 months ago

Download www.che.gatech.edu

The algorithmic framework developed for improving heuristic solutions of the new version of deterministic TSP [Choi et al., 2002] is extended to the stochastic case. To verify the...

Jaein Choi, Jay H. Lee, Matthew J. Realff

claim paper

Read More »

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 9 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

INFOCOM
2011
IEEE

222views Communications» more INFOCOM 2011»

A dynamic relay selection scheme for mobile users in wireless relay networks

12 years 11 months ago

Download www3.ntu.edu.sg

—Cooperative communication has attracted dramatic attention in the last few years due to its advantage in mitigating channel fading. Despite much effort that has been made in the...

Yifan Li, Ping Wang, Dusit Niyato, Weihua Zhuang

claim paper

Read More »

click to vote

AUTOMATICA
2008

74views more AUTOMATICA 2008»

Policy iteration based feedback control

13 years 7 months ago

Download www.cfins.au.tsinghua.edu.cn

It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...

Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao

claim paper

Read More »

« Prev « First page 39 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers