Sciweavers

178 search results - page 6 / 36
» Probabilistic policy reuse in a reinforcement learning agent
Sort
View

Publication
154views
12 years 11 months ago
Preference elicitation and inverse reinforcement learning
We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous w...
Constantin Rothkopf, Christos Dimitrakakis
AAAI
2011
12 years 8 months ago
Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs
In many multi-agent applications such as distributed sensor nets, a network of agents act collaboratively under uncertainty and local interactions. Networked Distributed POMDP (ND...
Chongjie Zhang, Victor R. Lesser
ROBOCUP
2005
Springer
134views Robotics» more  ROBOCUP 2005»
14 years 2 months ago
Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System
The existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup...
Yasutake Takahashi, Kazuhiro Edazawa, Kentarou Nom...
ICML
2000
IEEE
14 years 9 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
ECML
2004
Springer
14 years 2 months ago
Batch Reinforcement Learning with State Importance
Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classifier mapping states to actions....
Lihong Li, Vadim Bulitko, Russell Greiner