Sciweavers

377 search results - page 11 / 76
» Convergence of Stochastic Iterative Dynamic Programming Algo...
Sort
View
CORR
2010
Springer
119views Education» more  CORR 2010»
13 years 7 months ago
Dynamic Policy Programming
In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...
Mohammad Gheshlaghi Azar, Hilbert J. Kappen
CORR
2007
Springer
94views Education» more  CORR 2007»
13 years 7 months ago
Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm
— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...
Bruce Hajek, Kevin Mitzel, Sichao Yang
CDC
2009
IEEE
119views Control Systems» more  CDC 2009»
14 years 15 days ago
Linear Parameter Varying Iterative Learning Control
— In this paper an Iterative Learning Control (ILC) algorithm is proposed for a certain class of Linear Parameter Varying (LPV) systems whose dynamics change between iterations. ...
Mark Edward John Butcher, Alireza Karimi
CORR
2010
Springer
66views Education» more  CORR 2010»
13 years 7 months ago
Computing the speed of convergence of ergodic averages and pseudorandom points in computable dynamical systems
A pseudorandom point in an ergodic dynamical system over a computable metric space is a point which is computable but its dynamics has the same statistical behavior of a typical po...
Stefano Galatolo, Mathieu Hoyrup, Cristobal Rojas
JMLR
2012
11 years 10 months ago
Multi Kernel Learning with Online-Batch Optimization
In recent years there has been a lot of interest in designing principled classification algorithms over multiple cues, based on the intuitive notion that using more features shou...
Francesco Orabona, Jie Luo, Barbara Caputo