Sciweavers

ATAL
2004
Springer
14 years 5 months ago
A Pheromone-Based Utility Model for Collaborative Foraging
Multi-agent research often borrows from biology, where remarkable examples of collective intelligence may be found. One interesting example is ant colonies’ use of pheromones as...
Liviu Panait, Sean Luke
LAMAS
2005
Springer
14 years 6 months ago
Multi-agent Relational Reinforcement Learning
In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a r...
Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice ...
IWLCS
2005
Springer
14 years 6 months ago
Counter Example for Q-Bucket-Brigade Under Prediction Problem
Aiming to clarify the convergence or divergence conditions for Learning Classifier System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...
Atsushi Wada, Keiki Takadama, Katsunori Shimohara
ICCBR
2005
Springer
14 years 6 months ago
CBR for State Value Function Approximation in Reinforcement Learning
CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...
Thomas Gabel, Martin A. Riedmiller
ICANN
2005
Springer
14 years 6 months ago
Reinforcement Learning in MirrorBot
For this special session of EU projects in the area of NeuroIT, we will review the progress of the MirrorBot project with special emphasis on its relation to reinforcement learning...
Cornelius Weber, David Muse, Mark Elshaw, Stefan W...
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
14 years 6 months ago
Co-evolving recurrent neurons learn deep memory POMDPs
Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...
Faustino J. Gomez, Jürgen Schmidhuber
GECCO
2005
Springer
111views Optimization» more  GECCO 2005»
14 years 6 months ago
XCS with eligibility traces
The development of the XCS Learning Classifier System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...
Jan Drugowitsch, Alwyn Barry
ECML
2005
Springer
14 years 6 months ago
Towards Finite-Sample Convergence of Direct Reinforcement Learning
Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...
Shiau Hong Lim, Gerald DeJong
AUSAI
2005
Springer
14 years 6 months ago
Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning
: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...
Peter Vamplew, Robert Ollington
ATAL
2005
Springer
14 years 6 months ago
Coordinated exploration in multi-agent reinforcement learning: an application to load-balancing
This paper is concerned with how multi-agent reinforcement learning algorithms can practically be applied to real-life problems. Recently, a new coordinated multi-agent exploratio...
Katja Verbeeck, Ann Nowé, Karl Tuyls