Search Sciweavers | Sciweavers

121 search results - page 14 / 25

» Toward Off-Policy Learning Control with Function Approximati...

188

click to vote

ML
1998
ACM

131views Machine Learning» more ML 1998»

Learning from Examples and Membership Queries with Structured Determinations

15 years 6 months ago

Download web.engr.oregonstate.edu

It is well known that prior knowledge or bias can speed up learning, at least in theory. It has proved di cult to make constructive use of prior knowledge, so that approximately c...

Prasad Tadepalli, Stuart J. Russell

claim paper

Read More »

156

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

338

click to vote

ECAI
2010
Springer

339views Artificial Intelligence» more ECAI 2010»

Bayesian Monte Carlo for the Global Optimization of Expensive Functions

15 years 7 months ago

Download www.sps.ele.tue.nl

In the last decades enormous advances have been made possible for modelling complex (physical) systems by mathematical equations and computer algorithms. To deal with very long run...

Perry Groot, Adriana Birlutiu, Tom Heskes

claim paper

Read More »

166

Voted

ICANN
2003
Springer

114views Neural Networks» more ICANN 2003»

Unsupervised Learning of a Kinematic Arm Model

15 years 12 months ago

Download www.heikohoffmann.de

Abstract. An abstract recurrent neural network trained by an unsupervised method is applied to the kinematic control of a robot arm. The network is a novel extension of the Neural ...

Heiko Hoffmann, Ralf Möller

claim paper

Read More »

208

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 1 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

« Prev « First page 14 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers