Search Sciweavers | Sciweavers

311 search results - page 14 / 63

» Gradient Convergence in Gradient methods with Errors

104

Voted

FOCM
2006

50views more FOCM 2006»

Online Learning Algorithms

15 years 3 months ago

Download ttic.uchicago.edu

In this paper, we study an online learning algorithm in Reproducing Kernel Hilbert Spaces (RKHS) and general Hilbert spaces. We present a general form of the stochastic gradient m...

Steve Smale, Yuan Yao

claim paper

Read More »

128

Voted

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 3 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

148

Voted

CEC
2008
IEEE

109views Artificial Intelligence» more CEC 2008»

A study on constrained MA using GA and SQP: Analytical vs. finite-difference gradients

15 years 10 months ago

Download ntu-cg.ntu.edu.sg

— Many deterministic algorithms in the context of constrained optimization require the ﬁrst-order derivatives, or the gradient vectors, of the objective and constraint function...

Stephanus Daniel Handoko, Chee Keong Kwoh, Yew-Soo...

claim paper

Read More »

130

Voted

ARC
2008
Springer

115views Hardware» more ARC 2008»

A High Throughput FPGA-based Floating Point Conjugate Gradient Implementation

15 years 5 months ago

Download cas.ee.ic.ac.uk

As Field Programmable Gate Arrays (FPGAs) have reached capacities beyond millions of equivalent gates, it becomes possible to accelerate floating-point scientific computing applica...

Antonio Roldao Lopes, George A. Constantinides

claim paper

Read More »

129

Voted

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 4 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 14 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers