optimal policy | Sciweavers

104

NIPS
2001

101views Information Technology» more NIPS 2001»

The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay

15 years 2 months ago

Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...

Michael Kositsky, Andrew G. Barto

claim paper

Read More »

106

click to vote

NIPS
2008

171views Information Technology» more NIPS 2008»

MDPs with Non-Deterministic Policies

15 years 2 months ago

Download www.cs.mcgill.ca

Markov Decision Processes (MDPs) have been extensively studied and used in the context of planning and decision-making, and many methods exist to find the optimal policy for probl...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

85

Voted

ESANN
2007

122views Neural Networks» more ESANN 2007»

The Recurrent Control Neural Network

15 years 2 months ago

Download www.dice.ucl.ac.be

This paper presents our Recurrent Control Neural Network (RCNN), which is a model-based approach for a data-eﬃcient modelling and control of reinforcement learning problems in di...

Anton Maximilian Schäfer, Steffen Udluft, Han...

claim paper

Read More »

147

click to vote

INFOCOM
1991
IEEE

145views Communications» more INFOCOM 1991»

Queueing Performance with Impatient Customers

15 years 4 months ago

Download catt.poly.edu

customer which exceeds its deadline will either leave the queue without service or stay in the queue to get unsucWe consider the problem of scheduling impatient CUS- cessful servic...

Zheng-Xue Zhao, Shivendra S. Panwar, Donald F. Tow...

claim paper

Read More »

113

click to vote

ESA
2006
Springer

136views Algorithms» more ESA 2006»

Approximation in Preemptive Stochastic Online Scheduling

15 years 4 months ago

Download www.mpi-inf.mpg.de

Abstract. We present a first constant performance guarantee for preemptive stochastic scheduling to minimize the sum of weighted completion times. For scheduling jobs with release ...

Nicole Megow, Tjark Vredeveld

claim paper

Read More »

110

Voted

INFOCOM
2000
IEEE

91views Communications» more INFOCOM 2000»

Optimal Streaming of Layered Video

15 years 5 months ago

Download cis.poly.edu

Abstract—This paper presents a model and theory for streaming layered video. We model the bandwidth available to the streaming application as a stochastic process whose statistic...

Despina Saparilla, Keith W. Ross

claim paper

Read More »

101

click to vote

ATAL
2003
Springer

126views Intelligent Agents» more ATAL 2003»

Constructing optimal policies for agents with constrained architectures

15 years 5 months ago

Download www.eecs.umich.edu

Optimal behavior is a very desirable property of autonomous agents and, as such, has received much attention over the years. However, making optimal decisions and executing optima...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

113

Voted

INFOCOM
2003
IEEE

149views Communications» more INFOCOM 2003»

Power Constrained and Delay Optimal Policies for Scheduling Transmission over a Fading Channel

15 years 6 months ago

Download www.ieee-infocom.org

ACT We consider an optimal power and rate scheduling problem for a single user transmitting to a base station on a fading wireless link with the objective of minimizing the mean de...

Munish Goyal, Anurag Kumar, Vinod Sharma

claim paper

Read More »

115

Voted

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

15 years 6 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

116

Voted

ICN
2007
Springer

97views Computer Networks» more ICN 2007»

Heuristic Approach of Optimal Code Allocation in High Speed Downlink Packet Access Networks

15 years 6 months ago

Download www.sce.carleton.ca

— In this paper, we use the Markov Decision Process (MDP) technique to ﬁnd the optimal code allocation policy in High-Speed Downlink Packet Access (HSDPA) networks. A discrete ...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers