Agents in dynamic environments have to deal with world rep- To appear in: RoboCup 2005: Robot Soccer World Cup IX, c Springer-Verlag, 2006 resentations that change over time. In or...
Andreas D. Lattner, Andrea Miene, Ubbo Visser, Ott...
A key problem in reinforcement learning is finding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...
In this paper, we study a sequential decision making problem. The objective is to maximize the total reward while satisfying constraints, which are defined at every time step. The...
In this paper, we study a sequential decision making problem. The objective is to maximize the average reward accumulated over time subject to temporal cost constraints. The novel...
We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...
Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...