Sciweavers

417 search results - page 37 / 84
» The Dynamics of Reinforcement Learning in Cooperative Multia...
Sort
View
NIPS
1996
13 years 9 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
ISNN
2007
Springer
14 years 1 months ago
Online Dynamic Value System for Machine Learning
A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...
Haibo He, Janusz A. Starzyk
ECAI
1992
Springer
13 years 11 months ago
Towards a Cooperation Knowledge Level For Collaborative Problem Solving
The cooperation knowledge level is a new computer level specifically for multi-agent problem solvers which describes rich and explicit models of common social phenomena. A cooperat...
Nicholas R. Jennings
AAAI
2008
13 years 10 months ago
Perpetual Learning for Non-Cooperative Multiple Agents
This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...
Luke Dickens
ECML
2007
Springer
13 years 11 months ago
Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs
Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...
Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass