Search Sciweavers | Sciweavers

534 search results - page 3 / 107

» Markov Reward Approach to Performability and Reliability Ana...

238

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 2 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

233

click to vote

CORR
2010
Springer

171views Education» more CORR 2010»

Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach

15 years 2 months ago

Download www.eecs.umich.edu

We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...

Cem Tekin, Mingyan Liu

claim paper

Read More »

171

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

16 years 29 days ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

175

Voted

CDC
2009
IEEE

133views Control Systems» more CDC 2009»

Arbitrarily modulated Markov decision processes

16 years 4 days ago

Download www.cim.mcgill.ca

— We consider decision-making problems in Markov decision processes where both the rewards and the transition probabilities vary in an arbitrary (e.g., nonstationary) fashion. We...

Jia Yuan Yu, Shie Mannor

claim paper

Read More »

197

click to vote

INFOCOM
2008
IEEE

137views Communications» more INFOCOM 2008»

QoS Performance Analysis of Cognitive Radio-Based Virtual Wireless Networks

16 years 1 months ago

Download www.irisa.fr

—Cognitive radio presents a new approach to wireless spectrum utilization and management. In this work, the potential performance improvement gained by applying cognitive radio t...

Brent Ishibashi, Nizar Bouabdallah, Raouf Boutaba

claim paper

Read More »

« Prev « First page 3 / 107 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers