Sciweavers

779 search results - page 15 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ICMLA
2008
13 years 9 months ago
Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture
In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...
Sertan Girgin, Philippe Preux
NECO
2010
97views more  NECO 2010»
13 years 6 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
KES
2007
Springer
14 years 1 months ago
Making Financial Trading by Recurrent Reinforcement Learning
In this paper we propose a financial trading system whose strategy is developed by means of an artificial neural network approach based on a recurrent reinforcement learning algo...
Francesco Bertoluzzo, Marco Corazza
INTERSPEECH
2010
13 years 2 months ago
Still talking to machines (cognitively speaking)
This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...
Steve Young