Sciweavers

373 search results - page 13 / 75
» Building Relational World Models for Reinforcement Learning
Sort
View
ICML
2005
IEEE
14 years 8 months ago
Reinforcement learning with Gaussian processes
Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...
Yaakov Engel, Shie Mannor, Ron Meir
NIPS
2003
13 years 9 months ago
Learning a World Model and Planning with a Self-Organizing, Dynamic Neural System
We present a connectionist architecture that can learn a model of the relations between perceptions and actions and use this model for behavior planning. State representations are...
Marc Toussaint
ECML
2005
Springer
14 years 1 months ago
Model-Based Online Learning of POMDPs
Abstract. Learning to act in an unknown partially observable domain is a difficult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...
Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony
CORR
2011
Springer
194views Education» more  CORR 2011»
12 years 11 months ago
Accelerating Reinforcement Learning through Implicit Imitation
Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an agent’s ability to learn useful behaviors by making intelligent use of the kn...
Craig Boutilier, Bob Price
JDCTA
2010
160views more  JDCTA 2010»
13 years 2 months ago
Learning and Decision Making in Human During a Game of Matching Pennies
To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...
Jianfeng Hu, Xiaofeng Li, Jinghai Yin