Sciweavers

771 search results - page 89 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
ATAL
2008
Springer
15 years 6 months ago
Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies
Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...
Thomas Gabel, Martin A. Riedmiller
AAAI
2006
15 years 5 months ago
Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...
Vishal Soni, Satinder P. Singh
EOR
2006
106views more  EOR 2006»
15 years 4 months ago
Optimal dynamic assignment of a flexible worker on an open production line with specialists
This paper models and analyzes serial production lines with specialists at each station and a single, cross-trained floating worker who can work at any station. We formulate Marko...
Linn I. Sennott, Mark P. Van Oyen, Seyed M. R. Ira...
JAIR
2008
107views more  JAIR 2008»
15 years 4 months ago
Planning with Durative Actions in Stochastic Domains
Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative action...
Mausam, Daniel S. Weld
JAIR
2006
160views more  JAIR 2006»
15 years 4 months ago
Anytime Point-Based Approximations for Large POMDPs
The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact s...
Joelle Pineau, Geoffrey J. Gordon, Sebastian Thrun