optimal policy | Sciweavers

159

TR
2010

126views Hardware» more TR 2010»

Optimal Maintenance Strategies for Wind Turbine Systems Under Stochastic Weather Conditions

15 years 1 months ago

Abstract--We examine optimal repair strategies for wind turbines operated under stochastic weather conditions. In-situ sensors installed at wind turbines produce useful information...

Eunshin Byon, Lewis Ntaimo, Yu Ding

claim paper

Read More »

209

click to vote

TASE
2011
IEEE

226views Software Engineering» more TASE 2011»

Dynamic Pricing and Inventory Control in a Make-to-Stock Queue With Information on the Production Status

15 years 1 months ago

Download www.se.cuhk.edu.hk

: This paper addresses the dynamic pricing problem of a single-item, make-to-stock production system. Demand arrives according to Poisson processes with changeable arrival rate dep...

Liuxin Chen, Youhua Chen, Zhan Pang

claim paper

Read More »

184

click to vote

CORR
2011
Springer

175views Education» more CORR 2011»

Adaptive Channel Recommendation for Dynamic Spectrum Access

15 years 1 months ago

Download home.ie.cuhk.edu.hk

—We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average rewa...

Xu Chen, Jianwei Huang, Husheng Li

claim paper

Read More »

241

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

182

click to vote

TIT
2008

110views more TIT 2008»

Optimal Cross-Layer Scheduling of Transmissions Over a Fading Multiaccess Channel

15 years 6 months ago

Download ece.iisc.ernet.in

We consider the problem of several users transmitting packets to a base station, and study an optimal scheduling formulation involving three communication layers, namely, the mediu...

Munish Goyal, Anurag Kumar, Vinod Sharma

claim paper

Read More »

198

click to vote

TCOM
2008

128views more TCOM 2008»

Cross-Layer Rate and Power Adaptation Strategies for IR-HARQ Systems over Fading Channels with Memory: A SMDP-Based Approach

15 years 6 months ago

Download www.ece.ubc.ca

Abstract--Incremental-redundancy hybrid automatic repeatrequest (IR-HARQ) schemes are proposed in several wireless standards for increased throughput-efficiency and greater reliabi...

Ashok K. Karmokar, Dejan V. Djonin, Vijay K. Bharg...

claim paper

Read More »

143

click to vote

AUTOMATICA
2006

92views more AUTOMATICA 2006»

Dynamic brand-image-based production location decisions

15 years 6 months ago

Download www.mba.biu.ac.il

In this paper, we study the dynamic production location decisions of a manufacturer of a certain branded product. Considering brand-image as a form of goodwill, we extend the well...

Gila E. Fruchter, Eugene D. Jaffe, Israel D. Neben...

claim paper

Read More »

189

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

176

click to vote

UAI
1998

109views Artificial Intelligence» more UAI 1998»

An Anytime Algorithm for Decision Making under Uncertainty

15 years 7 months ago

Download www.cs.ubc.ca

We present an anytime algorithm which computes policies for decision problems represented as multi-stage influence diagrams. Our algorithm constructs policies incrementally, start...

Michael C. Horsch, David Poole

claim paper

Read More »

184

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 7 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers