Search Sciweavers | Sciweavers

178 search results - page 7 / 36

» Probabilistic policy reuse in a reinforcement learning agent

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

13 years 11 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

click to vote

PKDD
2009
Springer

181views Data Mining» more PKDD 2009»

Active Learning for Reward Estimation in Inverse Reinforcement Learning

14 years 3 months ago

Download users.isr.ist.utl.pt

Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

click to vote

AAAI
2007

142views Intelligent Agents» more AAAI 2007»

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison

13 years 11 months ago

Download staff.science.uva.nl

Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

ROBOCUP
2007
Springer

153views Robotics» more ROBOCUP 2007»

Model-Based Reinforcement Learning in a Complex Domain

14 years 2 months ago

Download userweb.cs.utexas.edu

Reinforcement learning is a paradigm under which an agent seeks to improve its policy by making learning updates based on the experiences it gathers through interaction with the en...

Shivaram Kalyanakrishnan, Peter Stone, Yaxin Liu

claim paper

Read More »

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

13 years 10 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

« Prev « First page 7 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers