Sciweavers

1235 search results - page 16 / 247
» Reinforcement learning in a nutshell
Sort
View
NIPS
2004
13 years 9 months ago
Brain Inspired Reinforcement Learning
Successful application of reinforcement learning algorithms often involves considerable hand-crafting of the necessary non-linear features to reduce the complexity of the value fu...
François Rivest, Yoshua Bengio, John Kalask...
IJCAI
2007
13 years 9 months ago
Deictic Option Schemas
Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this...
Balaraman Ravindran, Andrew G. Barto, Vimal Mathew
JMLR
2012
11 years 10 months ago
Contextual Bandit Learning with Predictable Rewards
Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...
Alekh Agarwal, Miroslav Dudík, Satyen Kale,...
ICCBR
2009
Springer
14 years 2 months ago
Improving Reinforcement Learning by Using Case Based Heuristics
This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and ...
Reinaldo A. C. Bianchi, Raquel Ros, Ramon Ló...