Search Sciweavers | Sciweavers

236 search results - page 16 / 48

» Bias and Variance Approximation in Value Function Estimates

132

click to vote

ESANN
2004

90views Neural Networks» more ESANN 2004»

High-accuracy value-function approximation with neural networks applied to the acrobot

15 years 5 months ago

Download remi.coulom.free.fr

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this pape...

Rémi Coulom

claim paper

Read More »

151

Voted

ML
1998
ACM

131views Machine Learning» more ML 1998»

Learning from Examples and Membership Queries with Structured Determinations

15 years 4 months ago

Download web.engr.oregonstate.edu

It is well known that prior knowledge or bias can speed up learning, at least in theory. It has proved di cult to make constructive use of prior knowledge, so that approximately c...

Prasad Tadepalli, Stuart J. Russell

claim paper

Read More »

170

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 10 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

168

click to vote

WEBDB
2004
Springer

202views Database» more WEBDB 2004»

Mining Approximate Functional Dependencies and Concept Similarities to Answer Imprecise Queries

15 years 9 months ago

Download rakaposhi.eas.asu.edu

Current approaches for answering queries with imprecise constraints require users to provide distance metrics and importance measures for attributes of interest. In this paper we ...

Ullas Nambiar, Subbarao Kambhampati

claim paper

Read More »

164

click to vote

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

15 years 5 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

« Prev « First page 16 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers