Sciweavers

ATAL
2009
Springer
14 years 6 months ago
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...
Michael R. James, Satinder P. Singh
ATAL
2009
Springer
14 years 6 months ago
Context-aware multi-stage routing
In context-aware route planning, a set of agents has to plan routes on a common infrastructure and each agent has to plan a conflict-free route from a source to a destination wit...
Adriaan ter Mors, Jeroen van Belle, Cees Witteveen
ATAL
2009
Springer
14 years 6 months ago
Distributed constraint optimization with structured resource constraints
Distributed constraint optimization (DCOP) provides a framework for coordinated decision making by a team of agents. Often, during the decision making, capacity constraints on age...
Akshat Kumar, Boi Faltings, Adrian Petcu
ATAL
2009
Springer
14 years 6 months ago
The cost of stability in weighted voting games
Yoram Bachrach, Reshef Meir, Michael Zuckerman, J&...
ATAL
2009
Springer
14 years 6 months ago
On-line coordination among discrete-event agents
Manh Tung Pham, Kiam Tian Seow
ATAL
2009
Springer
14 years 6 months ago
Using opponent models for efficient negotiation
d Abstract) Koen Hindriks Man-Machine Interaction Group Delft University of Technology Mekelweg 4, Delft, The Netherlands
Koen V. Hindriks, Catholijn M. Jonker, Dmytro Tykh...
ATAL
2009
Springer
14 years 6 months ago
Introducing social groups and group exchanges in the PopOrg model
Antônio Carlos da Rocha Costa, Graçal...
ATAL
2009
Springer
14 years 6 months ago
Adaptive price update in distributed Lagrangian relaxation protocol
Distributed Lagrangian Relaxation Protocol (DisLRP) has been proposed to solve a distributed combinatorial maximization problem called the Generalized Mutual Assignment Problem (G...
Katsutoshi Hirayama, Toshihiro Matsui, Makoto Yoko...
ATAL
2009
Springer
14 years 6 months ago
AOR-Simulation.org: cognitive agent simulation
In this paper, we describe an ontologically well-founded agentbased discrete event simulation framework with a high-level rulemulation language and an abstract simulator architect...
Gerd Wagner, Ion-Mircea Diaconescu
ATAL
2009
Springer
14 years 6 months ago
Adversarial search with procedural knowledge heuristic
We introduce an adversarial planning algorithm based on game tree search, which is applicable in large-scale multiplayer domains. In order to tackle the scalability issues of game...
Viliam Lisý, Branislav Bosanský, Mic...