Sciweavers

4544 search results - page 159 / 909
» Reinforcement Learning with Time
Sort
View
ECAI
2008
Springer
13 years 12 months ago
Exploiting locality of interactions using a policy-gradient approach in multiagent learning
In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality...
Francisco S. Melo
ICCBR
2005
Springer
14 years 3 months ago
Opportunities for CBR in Learning by Doing
In this paper we partially describe JV2 M, a metaphorical simulation of the Java Virtual Machine where students can learn Java language compilation and reinforce object-oriented pr...
Pedro Pablo Gómez-Martín, Marco Anto...
DAGM
2007
Springer
14 years 2 months ago
Efficient Learning of Neural Networks with Evolutionary Algorithms
Abstract. In this article we present EANT2, a method that creates neural networks (NNs) by evolutionary reinforcement learning. The structure of NNs is developed using mutation ope...
Nils T. Siebel, Jochen Krause, Gerald Sommer
ECAL
2007
Springer
14 years 4 months ago
Guided Self-organisation for Autonomous Robot Development
Abstract. The paper presents a method to guide the self-organised development of behaviours of autonomous robots. In earlier publications we demonstrated how to use the homeokinesi...
Georg Martius, J. Michael Herrmann, Ralf Der
ICML
2005
IEEE
14 years 11 months ago
Bayesian sparse sampling for on-line reward optimization
We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...
Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...