Search Sciweavers | Sciweavers

275 search results - page 32 / 55

» Learning equivalent action choices from demonstration

124

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 1 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

Voted

ICML
2008
IEEE

162views Machine Learning» more ICML 2008»

Automatic discovery and transfer of MAXQ hierarchies

16 years 1 months ago

Download pages.cs.wisc.edu

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful...

Neville Mehta, Soumya Ray, Prasad Tadepalli, Thoma...

claim paper

Read More »

118

Voted

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

15 years 7 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

133

Voted

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

15 years 6 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

113

Voted

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

15 years 7 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

« Prev « First page 32 / 55 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers