Sciweavers

510 search results - page 23 / 102
» Gradient Estimation Revitalized
Sort
View
ECML
2007
Springer
15 years 8 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
187
Voted
BILDMED
2011
313views Algorithms» more  BILDMED 2011»
14 years 6 months ago
Automatic Multi-modal ToF/CT Organ Surface Registration
Abstract. In the field of image-guided liver surgery (IGLS), the initial registration of the intra-operative organ surface with preoperative tomographic image data is performed on...
Kerstin Müller, Sebastian Bauer, Jakob Wasza,...
122
Voted
ICML
2009
IEEE
16 years 3 months ago
Predictive representations for policy gradient in POMDPs
We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...
Abdeslam Boularias, Brahim Chaib-draa
126
Voted
IPMI
2005
Springer
16 years 3 months ago
3D Active Shape Models Using Gradient Descent Optimization of Description Length
Abstract. Active Shape Models are a popular method for segmenting three-dimensional medical images. To obtain the required landmark correspondences, various automatic approaches ha...
Tobias Heimann, Ivo Wolf, Tomos G. Williams, Hans-...
113
Voted
IPPS
2006
IEEE
15 years 8 months ago
On the performance of parallel normalized explicit preconditioned conjugate gradient type methods
A new class of parallel normalized preconditioned conjugate gradient type methods in conjunction with normalized approximate inverses algorithms, based on normalized approximate f...
George A. Gravvanis, Konstantinos M. Giannoutakis