Search Sciweavers | Sciweavers

114 search results - page 10 / 23

» Temporal Difference Updating without a Learning Rate

186

click to vote

VLDB
1998
ACM

134views Database» more VLDB 1998»

Design, Implementation, and Performance of the LHAM Log-Structured History Data Access Method

15 years 11 months ago

Download www.vldb.org

Numerous applications such as stock market or medical information systems require that both historical and current data be logically integrated into a temporal database. The under...

Peter Muth, Patrick E. O'Neil, Achim Pick, Gerhard...

claim paper

Read More »

226

Voted

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 9 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

222

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 8 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

216

click to vote

IJCNN
2007
IEEE

169views Neural Networks» more IJCNN 2007»

Random Feature Subset Selection for Analysis of Data with Missing Features

16 years 1 months ago

Download users.rowan.edu

Abstract - We discuss an ensemble-of-classifiers based algorithm for the missing feature problem. The proposed approach is inspired in part by the random subspace method, and in pa...

Joseph DePasquale, Robi Polikar

claim paper

Read More »

197

click to vote

FLAIRS
2008

133views Artificial Intelligence» more FLAIRS 2008»

Reinforcement of Local Pattern Cases for Playing Tetris

15 years 9 months ago

Download www.aaai.org

In the paper, we investigate the use of reinforcement learning in CBR for estimating and managing a legacy case base for playing the game of Tetris. Each case corresponds to a loc...

Houcine Romdhane, Luc Lamontagne

claim paper

Read More »

« Prev « First page 10 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers