Search Sciweavers | Sciweavers

779 search results - page 81 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

131

Voted

IROS
2008
IEEE

144views Robotics» more IROS 2008»

Learning nonparametric policies by imitation

15 years 8 months ago

Download www.cs.washington.edu

— A long cherished goal in artiﬁcial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ...

David B. Grimes, Rajesh P. N. Rao

claim paper

Read More »

137

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 2 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

149

click to vote

NECO
2011

182views Computer Networks» more NECO 2011»

Least Squares Estimation Without Priors or Supervision

14 years 9 months ago

Download www.cns.nyu.edu

Selection of an optimal estimator typically relies on either supervised training samples (pairs of measurements and their associated true values), or a prior probability model for...

Martin Raphan, Eero P. Simoncelli

claim paper

Read More »

click to vote

ICML
2008
IEEE

120views Machine Learning» more ICML 2008»

Exploration scavenging

16 years 3 months ago

Download hunch.net

We examine the problem of evaluating a policy in the contextual bandit setting using only observations collected during the execution of another policy. We show that policy evalua...

John Langford, Alexander L. Strehl, Jennifer Wortm...

claim paper

Read More »

136

click to vote

KDD
2006
ACM

115views Data Mining» more KDD 2006»

Supervised probabilistic principal component analysis

16 years 2 months ago

Download wwwbrauer.informatik.tu-muenchen.de

Principal component analysis (PCA) has been extensively applied in data mining, pattern recognition and information retrieval for unsupervised dimensionality reduction. When label...

Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Krieg...

claim paper

Read More »

« Prev « First page 81 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers