Search Sciweavers | Sciweavers

688 search results - page 83 / 138

» Using reinforcement learning to adapt an imitation task

click to vote

ML
2002
ACM

178views Machine Learning» more ML 2002»

Metric-Based Methods for Adaptive Model Selection and Regularization

13 years 7 months ago

Download www.cs.cmu.edu

We present a general approach to model selection and regularization that exploits unlabeled data to adaptively control hypothesis complexity in supervised learning tasks. The idea ...

Dale Schuurmans, Finnegan Southey

claim paper

Read More »

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

14 years 8 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

NIPS
2008

166views Information Technology» more NIPS 2008»

An Empirical Analysis of Domain Adaptation Algorithms for Genomic Sequence Analysis

13 years 9 months ago

Download www.fml.tuebingen.mpg.de

We study the problem of domain transfer for a supervised classification task in mRNA splicing. We consider a number of recent domain transfer methods from machine learning, includ...

Gabriele Schweikert, Christian Widmer, Bernhard Sc...

claim paper

Read More »

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

14 years 2 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

« Prev « First page 83 / 138 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers