Search Sciweavers | Sciweavers

101 search results - page 3 / 21

» Convergence of Gradient Dynamics with a Variable Learning Ra...

click to vote

NIPS
1993

103views Information Technology» more NIPS 1993»

Optimal Stochastic Search and Adaptive Momentum

13 years 8 months ago

Download www.bme.ogi.edu

Stochastic optimization algorithms typically use learning rate schedules that behave asymptotically as (t) = 0=t. The ensemble dynamics (Leen and Moody, 1993) for such algorithms ...

Todd K. Leen, Genevieve B. Orr

claim paper

Read More »

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

13 years 6 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

13 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

MOC
2002

77views more MOC 2002»

Directional Newton methods in n variables

13 years 6 months ago

Download benisrael.net

Directional Newton methods for functions f of n variables are shown to converge, under standard assumptions, to a solution of f(x) = 0. The rate of convergence is quadratic, for ne...

Yuri Levin, Adi Ben-Israel

claim paper

Read More »

click to vote

JMLR
2006

97views more JMLR 2006»

Learning Coordinate Covariances via Gradients

13 years 7 months ago

Download jmlr.csail.mit.edu

We introduce an algorithm that learns gradients from samples in the supervised learning framework. An error analysis is given for the convergence of the gradient estimated by the ...

Sayan Mukherjee, Ding-Xuan Zhou

claim paper

Read More »

« Prev « First page 3 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers