Search Sciweavers | Sciweavers

309 search results - page 16 / 62

» Smooth Optimization with Approximate Gradient

166

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 7 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

170

click to vote

GLOBECOM
2009
IEEE

129views Communications» more GLOBECOM 2009»

Semi-Blind Gradient-Newton CMA and SDD Algorithm for MIMO Space-Time Equalisation

16 years 1 months ago

Download users.ecs.soton.ac.uk

— Semi-blind space-time equalisation is considered for dispersive multiple-input multiple-output systems that employ high-throughput quadrature amplitude modulation signalling. A...

S. Chen, Lajos Hanzo, H.-T. Cheng

claim paper

Read More »

185

click to vote

GECCO
2004
Springer

122views Optimization» more GECCO 2004»

Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems

16 years 2 days ago

Download www.cs.york.ac.uk

This paper introduces a gradient-based reward prediction update mechanism to the XCS classiﬁer system as applied in neuralnetwork type learning and function approximation mechani...

Martin V. Butz, David E. Goldberg, Pier Luca Lanzi

claim paper

Read More »

202

click to vote

ICIP
2000
IEEE

194views Image Processing» more ICIP 2000»

Curve Evolution, Boundary-Value Stochastic Processes, the Mumford-Shah Problem, and Missing Data Applications

16 years 8 months ago

Download www.ece.gatech.edu

We present an estimation-theoretic approach to curve evolution for the Mumford-Shah problem. By viewing an active contour as the set of discontinuities in the Mumford-Shah problem...

Andy Tsai, Anthony J. Yezzi, Alan S. Willsky

claim paper

Read More »

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 16 / 62 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers