Sciweavers

915 search results - page 17 / 183
» Convergence of Iterations
Sort
View
ICML
2010
IEEE
13 years 8 months ago
Convergence of Least Squares Temporal Difference Methods Under General Conditions
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
Huizhen Yu
IJCV
2006
142views more  IJCV 2006»
13 years 7 months ago
Geometry and Convergence Analysis of Algorithms for Registration of 3D Shapes
The computation of a rigid body transformation which optimally aligns a set of measurement points with a surface and related registration problems are studied from the viewpoint o...
Helmut Pottmann, Qi-Xing Huang, Yong-Liang Yang, S...
ICML
2000
IEEE
14 years 8 months ago
Convergence Problems of General-Sum Multiagent Reinforcement Learning
Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...
Michael H. Bowling
ECML
2004
Springer
14 years 28 days ago
Convergence and Divergence in Standard and Averaging Reinforcement Learning
Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...
Marco Wiering
ICIP
2009
IEEE
14 years 8 months ago
Edge-preserving Nonlinear Iterative Image Resampling Method
In this paper, an edge-preserving nonlinear iterative regularization-based image resampling method for a single noise-free image is proposed. Several aspects of the resampling alg...