Search Sciweavers | Sciweavers

223 search results - page 18 / 45

» Least-Squares Temporal Difference Learning

190

click to vote

ICML
2009
IEEE

143views Machine Learning» more ICML 2009»

Proto-predictive representation of states with simple recurrent temporal-difference networks

16 years 8 months ago

Download www.snowelm.com

We propose a new neural network architecture, called Simple Recurrent Temporal-Difference Networks (SR-TDNs), that learns to predict future observations in partially observable en...

Takaki Makino

claim paper

Read More »

207

click to vote

ICDM
2008
IEEE

190views Data Mining» more ICDM 2008»

Simultaneous Co-segmentation and Predictive Modeling for Large, Temporal Marketing Data

16 years 1 months ago

Download users.ece.utexas.edu

Several marketing problems involve prediction of customer purchase behavior and forecasting future preferences. We consider predictive modeling of large scale, bi-modal or multimo...

Meghana Deodhar, Joydeep Ghosh

claim paper

Read More »

234

Voted

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

16 years 19 hour ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

197

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 7 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

171

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 5 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

« Prev « First page 18 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers