Search Sciweavers | Sciweavers

332 search results - page 47 / 67

» Ranking policies in discrete Markov decision processes

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

11 years 10 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 10 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

13 years 9 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

click to vote

WCNC
2010
IEEE

168views Computer Networks» more WCNC 2010»

Credit-Based Spectrum Sharing for Cognitive Mobile Multihop Relay Networks

13 years 12 months ago

Download www3.ntu.edu.sg

Abstract—In cognitive mobile multihop relay (CMMR) network, the mobile user as the primary user is allocated with the channel for transmitting data. Relay station as the secondar...

Dusit Niyato, Ping Wang

claim paper

Read More »

click to vote

CPAIOR
2008
Springer

198views Operations Research» more CPAIOR 2008»

Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization

13 years 9 months ago

Download cs.brown.edu

The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...

Luc Mercier, Pascal Van Hentenryck

claim paper

Read More »

« Prev « First page 47 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers