Sciweavers

567 search results - page 23 / 114
» Regularized Policy Iteration
Sort
View
ISAAC
2010
Springer
243views Algorithms» more  ISAAC 2010»
13 years 5 months ago
Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles
Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...
Thomas Dueholm Hansen, Uri Zwick
CORR
2010
Springer
119views Education» more  CORR 2010»
13 years 7 months ago
Dynamic Policy Programming
In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...
Mohammad Gheshlaghi Azar, Hilbert J. Kappen
ECCV
2008
Springer
14 years 9 months ago
A Generative Shape Regularization Model for Robust Face Alignment
In this paper, we present a robust face alignment system that is capable of dealing with exaggerating expressions, large occlusions, and a wide variety of image noises. The robustn...
Leon Gu, Takeo Kanade
ICIP
2007
IEEE
14 years 9 months ago
Two-Step Algorithms for Linear Inverse Problems with Non-Quadratic Regularization
Iterative shrinkage/thresholding (IST) algorithms have been recently proposed to handle high-dimensional convex optimization problems arising in image inverse problems (namely dec...
José M. Bioucas-Dias, Mário A. T. Fi...
MFCS
2009
Springer
14 years 2 months ago
Synchronization of Regular Automata
Functional graph grammars are finite devices which generate the class of regular automata. We recall the notion of synchronization by grammars, and for any given grammar we consid...
Didier Caucal