Sciweavers

1167 search results - page 102 / 234
» Relational Markov Games
Sort
View
IAT
2003
IEEE
15 years 11 months ago
Asymmetric Multiagent Reinforcement Learning
A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the ...
Ville Könönen
ROBOCUP
2001
Springer
94views Robotics» more  ROBOCUP 2001»
15 years 10 months ago
CS Freiburg: Global View by Cooperative Sensing
Global vision systems as found in the small size league are prohibited in the middle size league. This paper presents methods for creating a global view of the world by cooperative...
Markus Dietl, Jens-Steffen Gutmann, Bernhard Nebel
AAAI
2006
15 years 7 months ago
Targeting Specific Distributions of Trajectories in MDPs
We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...
David L. Roberts, Mark J. Nelson, Charles Lee Isbe...
AAAI
2006
15 years 7 months ago
Point-based Dynamic Programming for DEC-POMDPs
We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...
Daniel Szer, François Charpillet
ISCI
2000
98views more  ISCI 2000»
15 years 5 months ago
Quantum decision-maker
A quantum device simulating human decision making process is introduced. It consists of quantum recurrent nets generating stochastic processes which represent the motor dynamics, ...
Michail Zak