Sciweavers

332 search results - page 47 / 67
» Ranking policies in discrete Markov decision processes
Sort
View
AAAI
2012
11 years 10 months ago
Kernel-Based Reinforcement Learning on Representative States
Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...
Branislav Kveton, Georgios Theocharous
ATAL
2008
Springer
13 years 10 months ago
Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies
Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...
Thomas Gabel, Martin A. Riedmiller
AAAI
2006
13 years 9 months ago
Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...
Vishal Soni, Satinder P. Singh
WCNC
2010
IEEE
13 years 12 months ago
Credit-Based Spectrum Sharing for Cognitive Mobile Multihop Relay Networks
Abstract—In cognitive mobile multihop relay (CMMR) network, the mobile user as the primary user is allocated with the channel for transmitting data. Relay station as the secondar...
Dusit Niyato, Ping Wang
CPAIOR
2008
Springer
13 years 9 months ago
Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization
The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...
Luc Mercier, Pascal Van Hentenryck