Sciweavers

567 search results - page 44 / 114
» Regularized Policy Iteration
Sort
View
TIT
2008
110views more  TIT 2008»
13 years 7 months ago
Optimal Cross-Layer Scheduling of Transmissions Over a Fading Multiaccess Channel
We consider the problem of several users transmitting packets to a base station, and study an optimal scheduling formulation involving three communication layers, namely, the mediu...
Munish Goyal, Anurag Kumar, Vinod Sharma
QUESTA
2000
56views more  QUESTA 2000»
13 years 7 months ago
On the value function of a priority queue with an application to a controlled polling model
We give a closed-form expression for the discounted weighted queue length and switching costs of a two-class single-server queueing model under a preemptive priority rule. These e...
Ger Koole, Philippe Nain
JMLR
2010
135views more  JMLR 2010»
13 years 2 months ago
Finite-sample Analysis of Bellman Residual Minimization
We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is avai...
Odalric-Ambrym Maillard, Rémi Munos, Alessa...
CIA
2007
Springer
14 years 1 months ago
Multi-agent Learning Dynamics: A Survey
Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...
H. Jaap van den Herik, Daniel Hennes, Michael Kais...
ICTAI
2006
IEEE
14 years 1 months ago
A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem
We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...
Lhassane Idoumghar, René Schott