Sciweavers

223 search results - page 18 / 45
» Least-Squares Temporal Difference Learning
Sort
View
ICML
2009
IEEE
14 years 9 months ago
Proto-predictive representation of states with simple recurrent temporal-difference networks
We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...
Takaki Makino
ICDM
2008
IEEE
190views Data Mining» more  ICDM 2008»
14 years 3 months ago
Simultaneous Co-segmentation and Predictive Modeling for Large, Temporal Marketing Data
Several marketing problems involve prediction of customer purchase behavior and forecasting future preferences. We consider predictive modeling of large scale, bi-modal or multimo...
Meghana Deodhar, Joydeep Ghosh
GECCO
2009
Springer
124views Optimization» more  GECCO 2009»
14 years 1 months ago
Reinforcement learning for games: failures and successes
We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...
Wolfgang Konen, Thomas Bartz-Beielstein
ML
1998
ACM
136views Machine Learning» more  ML 1998»
13 years 8 months ago
Co-Evolution in the Successful Learning of Backgammon Strategy
Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...
Jordan B. Pollack, Alan D. Blair
ICONIP
2009
13 years 6 months ago
Tracking in Reinforcement Learning
Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...
Matthieu Geist, Olivier Pietquin, Gabriel Fricout