Search Sciweavers | Sciweavers

340 search results - page 50 / 68

» Kernelized value function approximation for reinforcement le...

148

Voted

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 3 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

101

click to vote

WWW
2007
ACM

142views Internet Technology» more WWW 2007»

A kernel based structure matching for web services search

16 years 3 months ago

Download www2007.org

This paper describes a kernel based Web Services (abbreviated as service) matching mechanism for service discovery and integration. The matching mechanism tries to exploit the lat...

Yu Jianjun, Guo Shengmin, Su Hao, Zhang Hui, Xu Ke

claim paper

Read More »

119

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

15 years 3 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

116

click to vote

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 3 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

126

click to vote

MLCW
2005
Springer

118views Machine Learning» more MLCW 2005»

Estimating Predictive Variances with Kernel Ridge Regression

15 years 8 months ago

Download eprints.pascal-network.org

In many regression tasks, in addition to an accurate estimate of the conditional mean of the target distribution, an indication of the predictive uncertainty is also required. Ther...

Gavin C. Cawley, Nicola L. C. Talbot, Olivier Chap...

claim paper

Read More »

« Prev « First page 50 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers