Sciweavers

3049 search results - page 42 / 610
» On the Convergence of Bound Optimization Algorithms
Sort
View
NIPS
2008
13 years 10 months ago
Regularized Policy Iteration
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...
TSP
2010
13 years 3 months ago
Linear precoder design through cut-off rate maximization in MIMO-OFDM coded systems with imperfect CSIT
This paper proposes a linear transmitter design that aims at minimizing the packet error rate (PER) using partial channel state information at the transmitter (CSIT). The design is...
Francesc Rey, Meritxell Lamarca, Gregori Vá...
ICIP
2007
IEEE
14 years 10 months ago
Two-Step Algorithms for Linear Inverse Problems with Non-Quadratic Regularization
Iterative shrinkage/thresholding (IST) algorithms have been recently proposed to handle high-dimensional convex optimization problems arising in image inverse problems (namely dec...
José M. Bioucas-Dias, Mário A. T. Fi...
INFOCOM
2005
IEEE
14 years 2 months ago
Optimal utility based multi-user throughput allocation subject to throughput constraints
— We consider the problem of scheduling multiple users sharing a time-varying wireless channel. (As an example, this is a model of scheduling in 3G wireless technologies, such as...
Matthew Andrews, Lijun Qian, Alexander L. Stolyar
ICML
2009
IEEE
14 years 9 months ago
Efficient learning algorithms for changing environments
We study online learning in an oblivious changing environment. The standard measure of regret bounds the difference between the cost of the online learner and the best decision in...
Elad Hazan, C. Seshadhri