This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...
Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...
Software architecture practice is highly complex. Software architects interact with business as well as technical aspects of systems, often embedded in large and changing organiza...
An explicit exploration strategy is necessary in reinforcement learning (RL) to balance the need to reduce the uncertainty associated with the expected outcome of an action and the...
Much recent research in decision theoretic planning has adopted Markov decision processes (MDPs) as the model of choice, and has attempted to make their solution more tractable by...
Autonomy is a prime issue on robotics field and it is closely related to decision making. Last researches on decision making for social robots are focused on imitating humans’ m...