Sciweavers

101 search results - page 7 / 21
» Convergence of Gradient Dynamics with a Variable Learning Ra...
Sort
View
AUTOMATICA
2006
76views more  AUTOMATICA 2006»
13 years 7 months ago
Nonlinear robust performance analysis using complex-step gradient approximation
In this paper, the complex-step method is applied in the setting of numerical optimisation problems involving dynamical systems modelled as nonlinear differential equations. The m...
Jongrae Kim, Declan G. Bates, Ian Postlethwaite
KDD
2010
ACM
245views Data Mining» more  KDD 2010»
13 years 9 months ago
Learning incoherent sparse and low-rank patterns from multiple tasks
We consider the problem of learning incoherent sparse and lowrank patterns from multiple tasks. Our approach is based on a linear multi-task learning formulation, in which the spa...
Jianhui Chen, Ji Liu, Jieping Ye
ICANN
2010
Springer
13 years 7 months ago
Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients
Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...
Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...
MVA
2002
195views Computer Vision» more  MVA 2002»
13 years 7 months ago
Improved Adaptive Mixture Learning for Robust Video Background Modeling
2 Related Works Gaussian mixtures are often used for data modeling in many real-time applications such as video background modeling and speaker direction tracking. The real-time a...
Dar-Shyang Lee
JMLR
2006
124views more  JMLR 2006»
13 years 7 months ago
Policy Gradient in Continuous Time
Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...
Rémi Munos