Search Sciweavers | Sciweavers

567 search results - page 10 / 114

» Learning All Subfunctions of a Function

202

Voted

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

15 years 4 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

179

Voted

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

222

Voted

PAMI
2011

283views Operations Research» more PAMI 2011»

Semi-Supervised Learning via Regularized Boosting Working on Multiple Semi-Supervised Assumptions

15 years 1 months ago

Download www.cs.man.ac.uk

—Semi-supervised learning concerns the problem of learning in the presence of labeled and unlabeled data. Several boosting algorithms have been extended to semi-supervised learni...

Ke Chen, Shihai Wang

claim paper

Read More »

177

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 11 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

170

click to vote

ICML
1998
IEEE

169views Machine Learning» more ICML 1998»

Q2: Memory-Based Active Learning for Optimizing Noisy Continuous Functions

16 years 7 months ago

Download www.ri.cmu.edu

This paper introduces a new algorithm, Q2, foroptimizingthe expected output ofamultiinput noisy continuous function. Q2 is designed to need only a few experiments, it avoids stron...

Andrew W. Moore, Jeff G. Schneider, Justin A. Boya...

claim paper

Read More »

« Prev « First page 10 / 114 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers