Search Sciweavers | Sciweavers

65 search results - page 10 / 13

» Graph Laplacian based transfer learning in reinforcement lea...

click to vote

IWCLS
2007
Springer

176views Machine Learning» more IWCLS 2007»

On Lookahead and Latent Learning in Simple LCS

14 years 1 months ago

Download www.psychologie.uni-wuerzburg.de

Learning Classifier Systems use evolutionary algorithms to facilitate rule- discovery, where rule fitness is traditionally payoff based and assigned under a sharing scheme. Most c...

Larry Bull

claim paper

Read More »

click to vote

ECML
2003
Springer

87views Machine Learning» more ECML 2003»

Self-evaluated Learning Agent in Multiple State Games

14 years 24 days ago

Download www.ai.sanken.osaka-u.ac.jp

Abstract. Most of multi-agent reinforcement learning algorithms aim to converge to a Nash equilibrium, but a Nash equilibrium does not necessarily mean a desirable result. On the o...

Koichi Moriyama, Masayuki Numao

claim paper

Read More »

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

13 years 6 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

ICCV
2007
IEEE

125views Computer Vision» more ICCV 2007»

Contextual Distance for Data Perception

14 years 1 months ago

Download research.microsoft.com

Structural perception of data plays a fundamental role in pattern analysis and machine learning. In this paper, we develop a new structural perception of data based on local conte...

Deli Zhao, Zhouchen Lin, Xiaoou Tang

claim paper

Read More »

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 5 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

« Prev « First page 10 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers