Search Sciweavers | Sciweavers

119 search results - page 12 / 24

» A Markov Reward Model Checker

145

click to vote

CDC
2009
IEEE

169views Control Systems» more CDC 2009»

Parametric regret in uncertain Markov decision processes

15 years 10 months ago

Download www.cim.mcgill.ca

— We consider decision making in a Markovian setup where the reward parameters are not known in advance. Our performance criterion is the gap between the performance of the best ...

Huan Xu, Shie Mannor

claim paper

Read More »

172

Voted

ICASSP
2008
IEEE

163views Signal Processing» more ICASSP 2008»

Link throughput of multi-channel opportunistic access with limited sensing

16 years 15 days ago

Download www.ece.ucdavis.edu

—We aim to characterize the maximum link throughput of a multi-channel opportunistic communication system. The states of these channels evolve as independent and identically dist...

Keqin Liu, Qing Zhao

claim paper

Read More »

165

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

16 years 15 days ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

137

click to vote

AIPS
2000

129views Artificial Intelligence» more AIPS 2000»

Representations of Decision-Theoretic Planning Tasks

15 years 7 months ago

Download www.aaai.org

Goal-directed Markov Decision Process models (GDMDPs) are good models for many decision-theoretic planning tasks. They have been used in conjunction with two different reward stru...

Sven Koenig, Yaxin Liu

claim paper

Read More »

188

click to vote

CORR
2010
Springer

143views Education» more CORR 2010»

The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret

15 years 2 months ago

Download www.ece.ucdavis.edu

In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are N arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A play...

Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Z...

claim paper

Read More »

« Prev « First page 12 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers