We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...
Time varying environments or model selection problems lead to crucial dilemmas in identification and control science. In this paper, we propose a modular prediction scheme consisti...
A distributed robot control system is proposed based on a temporal self-organizing neural network, called competitive and temporal Hebbian (CTH) network. The CTH network can learn ...
To increase the assurance with which agents can be deployed in operational settings, we have been developing the KAoS policy and domain services. In conjunction with Nomads strong...
Jeffrey M. Bradshaw, Andrzej Uszok, Renia Jeffers,...
Abstract. While traditional approaches to machine learning are sensitive to highdimensional state and action spaces, this paper demonstrates how an indirectly encoded neurocontroll...