Search Sciweavers | Sciweavers

101 search results - page 7 / 21

» Convergence of Gradient Dynamics with a Variable Learning Ra...

150

click to vote

AUTOMATICA
2006

76views more AUTOMATICA 2006»

Nonlinear robust performance analysis using complex-step gradient approximation

15 years 7 months ago

Download jkim-pc.aero.gla.ac.uk

In this paper, the complex-step method is applied in the setting of numerical optimisation problems involving dynamical systems modelled as nonlinear differential equations. The m...

Jongrae Kim, Declan G. Bates, Ian Postlethwaite

claim paper

Read More »

192

click to vote

KDD
2010
ACM

245views Data Mining» more KDD 2010»

Learning incoherent sparse and low-rank patterns from multiple tasks

15 years 9 months ago

Download www.public.asu.edu

We consider the problem of learning incoherent sparse and lowrank patterns from multiple tasks. Our approach is based on a linear multi-task learning formulation, in which the spa...

Jianhui Chen, Ji Liu, Jieping Ye

claim paper

Read More »

172

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 7 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

206

click to vote

MVA
2002

195views Computer Vision» more MVA 2002»

Improved Adaptive Mixture Learning for Robust Video Background Modeling

15 years 6 months ago

Download www.cvl.iis.u-tokyo.ac.jp

2 Related Works Gaussian mixtures are often used for data modeling in many real-time applications such as video background modeling and speaker direction tracking. The real-time a...

Dar-Shyang Lee

claim paper

Read More »

214

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 6 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

« Prev « First page 7 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers