Sciweavers

510 search results - page 23 / 102
» Gradient Estimation Revitalized
Sort
View
ECML
2007
Springer
14 years 4 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
BILDMED
2011
313views Algorithms» more  BILDMED 2011»
13 years 1 months ago
Automatic Multi-modal ToF/CT Organ Surface Registration
Abstract. In the field of image-guided liver surgery (IGLS), the initial registration of the intra-operative organ surface with preoperative tomographic image data is performed on...
Kerstin Müller, Sebastian Bauer, Jakob Wasza,...
ICML
2009
IEEE
14 years 10 months ago
Predictive representations for policy gradient in POMDPs
We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...
Abdeslam Boularias, Brahim Chaib-draa
IPMI
2005
Springer
14 years 10 months ago
3D Active Shape Models Using Gradient Descent Optimization of Description Length
Abstract. Active Shape Models are a popular method for segmenting three-dimensional medical images. To obtain the required landmark correspondences, various automatic approaches ha...
Tobias Heimann, Ivo Wolf, Tomos G. Williams, Hans-...
IPPS
2006
IEEE
14 years 3 months ago
On the performance of parallel normalized explicit preconditioned conjugate gradient type methods
A new class of parallel normalized preconditioned conjugate gradient type methods in conjunction with normalized approximate inverses algorithms, based on normalized approximate f...
George A. Gravvanis, Konstantinos M. Giannoutakis