Search Sciweavers | Sciweavers

1118 search results - page 6 / 224

» Relational temporal difference learning

142

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 28 days ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

115

click to vote

ICCBR
2010
Springer

274views Automated Reasoning» more ICCBR 2010»

Reducing the Memory Footprint of Temporal Difference Learning over Finitely Many States by Using Case-Based Generalization

15 years 6 months ago

Download www.cse.lehigh.edu

In this paper we present an approach for reducing the memory footprint requirement of temporal difference methods in which the set of states is finite. We use case-based generaliza...

Matt Dilts, Héctor Muñoz-Avila

claim paper

Read More »

116

Voted

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

15 years 2 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

128

click to vote

ECAI
2000
Springer

90views Artificial Intelligence» more ECAI 2000»

Efficient Asymptotic Approximation in Temporal Difference Learning

15 years 6 months ago

Download www.inra.fr

Abstract. TD(

Frédérick Garcia, Florent Serre

claim paper

Read More »

120

click to vote

NIPS
1993

123views Information Technology» more NIPS 1993»

Temporal Difference Learning of Position Evaluation in the Game of Go

15 years 3 months ago

Download www.gatsby.ucl.ac.uk

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation e...

Nicol N. Schraudolph, Peter Dayan, Terrence J. Sej...

claim paper

Read More »

« Prev « First page 6 / 224 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers