Search Sciweavers | Sciweavers

1340 search results - page 10 / 268

» Kalman Temporal Differences

177

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 10 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

167

click to vote

NIPS
1993

123views Information Technology» more NIPS 1993»

Temporal Difference Learning of Position Evaluation in the Game of Go

15 years 8 months ago

Download www.gatsby.ucl.ac.uk

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...

Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...

claim paper

Read More »

177

Voted

ICIP
2000
IEEE

155views Image Processing» more ICIP 2000»

Video Dissolve and Wipe Detection via Spatio-Temporal Images of Chromatic Histogram Differences

15 years 11 months ago

Download www.cs.sfu.ca

Gradual transitions represent a challenging problem for temporal segmentation of video. Here we present two new features for detecting these. Recently, Ngo et al. set out a method...

Mark S. Drew, Ze-Nian Li, Xiang Zhong

claim paper

Read More »

184

click to vote

ECAI
2000
Springer

90views Artificial Intelligence» more ECAI 2000»

Efficient Asymptotic Approximation in Temporal Difference Learning

15 years 10 months ago

Download www.inra.fr

Abstract. TD(

Frédérick Garcia, Florent Serre

claim paper

Read More »

197

Voted

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 6 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

« Prev « First page 10 / 268 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers