Search Sciweavers | Sciweavers

101 search results - page 6 / 21

» Convergence of Gradient Dynamics with a Variable Learning Ra...

156

click to vote

ECAI
2008
Springer

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

15 years 8 months ago

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...

Francisco S. Melo

claim paper

Read More »

155

click to vote

FOCM
2008

140views more FOCM 2008»

Online Gradient Descent Learning Algorithms

15 years 6 months ago

Download www.cs.ucl.ac.uk

This paper considers the least-square online gradient descent algorithm in a reproducing kernel Hilbert space (RKHS) without explicit regularization. We present a novel capacity i...

Yiming Ying, Massimiliano Pontil

claim paper

Read More »

229

click to vote

CORR
2012
Springer

232views Education» more CORR 2012»

Smoothing Proximal Gradient Method for General Structured Sparse Learning

14 years 2 months ago

Download www.cs.cmu.edu

We study the problem of learning high dimensional regression models regularized by a structured-sparsity-inducing penalty that encodes prior structural information on either input...

Xi Chen, Qihang Lin, Seyoung Kim, Jaime G. Carbone...

claim paper

Read More »

193

click to vote

JMLR
2010

161views more JMLR 2010»

Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization

15 years 1 months ago

Download jmlr.csail.mit.edu

We consider regularized stochastic learning and online optimization problems, where the objective function is the sum of two convex terms: one is the loss function of the learning...

Lin Xiao

claim paper

Read More »

168

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 7 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 6 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers