Search Sciweavers | Sciweavers

332 search results - page 46 / 67

» Ranking policies in discrete Markov decision processes

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

IJCAI
2001

174views Artificial Intelligence» more IJCAI 2001»

Complexity of Probabilistic Planning under Average Rewards

13 years 9 months ago

Download www.informatik.uni-freiburg.de

A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...

Jussi Rintanen

claim paper

Read More »

click to vote

IROS
2006
IEEE

121views Robotics» more IROS 2006»

Planning and Acting in Uncertain Environments using Probabilistic Inference

14 years 2 months ago

Download www.cs.washington.edu

— An important problem in robotics is planning and selecting actions for goal-directed behavior in noisy uncertain environments. The problem is typically addressed within the fra...

Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

click to vote

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

13 years 10 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

click to vote

EOR
2006

106views more EOR 2006»

Optimal dynamic assignment of a flexible worker on an open production line with specialists

13 years 8 months ago

Download users.iems.northwestern.edu

This paper models and analyzes serial production lines with specialists at each station and a single, cross-trained floating worker who can work at any station. We formulate Marko...

Linn I. Sennott, Mark P. Van Oyen, Seyed M. R. Ira...

claim paper

Read More »

« Prev « First page 46 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers