One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
In the future, robotic surgical assistants may assist surgeons by performing specific subtasks such as retraction and suturing to reduce surgeon tedium and reduce the duration of s...
Jur van den Berg, Stephen Miller, Daniel Duckworth...
This paper proposes a type-2 self-organizing neural fuzzy system (T2SONFS) and its hardware implementation. The antecedent parts in each T2SONFS fuzzy rule are interval type-2 fuzz...
Abstract--Since the fuzzy cerebellar model articulation controller (FCMAC) uses linguistic variables, it is highly intuitive and easily comprehended. Despite the FCMAC's good ...
Wen Yu, Floriberto Ortiz Rodriguez, Marco A. Moren...
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...