Search Sciweavers | Sciweavers

109 search results - page 4 / 22

» Policy teaching through reward function learning

163

click to vote

IIE
2007

63views more IIE 2007»

Investigation of Q-Learning in the Context of a Virtual Learning Environment

15 years 6 months ago

Download www.mii.lt

We investigate the possibility to apply a known machine learning algorithm of Q-learning in the domain of a Virtual Learning Environment (VLE). It is important in this problem doma...

Dalia Baziukaite

claim paper

Read More »

230

click to vote

AAMAS
2010
Springer

251views Intelligent Agents» more AAMAS 2010»

Teaching a pet-robot to understand user feedback through interactive virtual training tasks

15 years 6 months ago

Download www.robotopia.de

Abstract In this paper, we present a human-robot teaching framework that uses "virtual" games as a means for adapting a robot to its user through natural interaction in a...

Anja Austermann, Seiji Yamada

claim paper

Read More »

190

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 8 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

174

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 5 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

183

Voted

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 4 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers