Search Sciweavers | Sciweavers

223 search results - page 7 / 45

» Least-Squares Temporal Difference Learning

205

click to vote

SIGIR
1995
ACM

212views Information Technology» more SIGIR 1995»

Noise Reduction in a Statistical Approach to Text Categorization

15 years 10 months ago

Download reference.kfupm.edu.sa

This paper studies noise reduction for computational efﬁciency improvements in a statistical learning method for text categorization, the Linear Least Squares Fit (LLSF) mapping...

Yiming Yang

claim paper

Read More »

156

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 7 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

201

click to vote

CIG
2006
IEEE

202views Applied Computing» more CIG 2006»

Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation

16 years 1 months ago

Download algoval.essex.ac.uk

Abstract— This paper compares the use of temporal difference learning (TDL) versus co-evolutionary learning (CEL) for acquiring position evaluation functions for the game of Othe...

Simon M. Lucas, Thomas Philip Runarsson

claim paper

Read More »

223

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

16 years 5 days ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

193

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 7 / 45 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers