Search Sciweavers | Sciweavers

179 search results - page 12 / 36

» Learning Relational Navigation Policies

270

click to vote

Publication

154views

Preference elicitation and inverse reinforcement learning

14 years 9 months ago

Download arxiv.org

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous w...

Constantin Rothkopf, Christos Dimitrakakis

posted by olethros

Read More »

193

click to vote

ATAL
2008
Springer

151views Intelligent Agents» more ATAL 2008»

Graph Laplacian based transfer learning in reinforcement learning

15 years 9 months ago

Download www.ifaamas.org

The aim of transfer learning is to accelerate learning in related domains. In reinforcement learning, many different features such as a value function and a policy can be transfer...

Yi-Ting Tsao, Ke-Ting Xiao, Von-Wun Soo

claim paper

Read More »

179

click to vote

AIPS
2008

153views Artificial Intelligence» more AIPS 2008»

Learning Relational Decision Trees for Guiding Heuristic Planning

15 years 9 months ago

Download www.plg.inf.uc3m.es

The current evaluation functions for heuristic planning are expensive to compute. In numerous domains these functions give good guidance on the solution, so it worths the computat...

Tomás de la Rosa, Sergio Jiménez, Da...

claim paper

Read More »

226

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

186

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

16 years 8 months ago

Download paul.rutgers.edu

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

« Prev « First page 12 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers