Sciweavers

3643 search results - page 77 / 729
» Learning Submodular Functions
Sort
View
PKDD
2009
Springer
181views Data Mining» more  PKDD 2009»
15 years 11 months ago
Active Learning for Reward Estimation in Inverse Reinforcement Learning
Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...
Manuel Lopes, Francisco S. Melo, Luis Montesano
ICML
1998
IEEE
16 years 5 months ago
The MAXQ Method for Hierarchical Reinforcement Learning
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...
Thomas G. Dietterich
IJCAI
2007
15 years 5 months ago
Bayesian Inverse Reinforcement Learning
Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...
Deepak Ramachandran, Eyal Amir
ICML
2003
IEEE
15 years 9 months ago
The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy
Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...
Clifford Kotnik, Jugal K. Kalita
ECTEL
2006
Springer
15 years 8 months ago
Community Based Software Development - the Case of Movelex
Abstract. The paper provides an overview of the elaboration, testing and improvement of Movelex, a complex virtual learning environment (VLE) supporting the establishment of self-r...
Kornél Varga, Andrea Kárpáti