Search Sciweavers | Sciweavers

779 search results - page 73 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

click to vote

CVPR
2009
IEEE

637views Computer Vision» more CVPR 2009»

Regularized Multi-Class Semi-Supervised Boosting

15 years 3 months ago

Download www.ymer.org

Many semi-supervised learning algorithms only deal with binary classification. Their extension to the multi-class problem is usually obtained by repeatedly solving a set of bina...

Amir Saffari, Christian Leistner, Horst Bischof

posted by leistner

Read More »

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

13 years 9 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 5 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

click to vote

ATAL
2007
Springer

180views Intelligent Agents» more ATAL 2007»

Confidence-based policy learning from demonstration using Gaussian mixture models

13 years 12 months ago

Download www.cs.cmu.edu

We contribute an approach for interactive policy learning through expert demonstration that allows an agent to actively request and effectively represent demonstration examples. I...

Sonia Chernova, Manuela M. Veloso

claim paper

Read More »

click to vote

DAGM
2004
Springer

109views Image Processing» more DAGM 2004»

Learning from Labeled and Unlabeled Data Using Random Walks

14 years 1 months ago

Download research.microsoft.com

We consider the general problem of learning from labeled and unlabeled data. Given a set of points, some of them are labeled, and the remaining points are unlabeled. The goal is to...

Dengyong Zhou, Bernhard Schölkopf

claim paper

Read More »

« Prev « First page 73 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers